Personal tools

WarcManager:Urls: Difference between revisions

From Adapt

Jump to: navigation, search
No edit summary
No edit summary
 
Line 9: Line 9:
*** urlID - db id of url to retrieve
*** urlID - db id of url to retrieve
* retrieve files from the archive
* retrieve files from the archive
** /rest/retrieve/{currentdate}/{url }
** /rest/retrieve/{currentdate}/{url}
*** currentdate - point in time to return
*** currentdate - point in time to return
*** url - page url
*** url - page url

Latest revision as of 19:49, 16 April 2010

Urls available for the warc manager

  • list collections
    • /rest/collection - returns a list of Collections
  • list entries for a url - returns a list of entries for the given url or urlID
    • /rest/url/{url}
      • url - url of page to retrieve
    • /rest/url/{urlID }
      • urlID - db id of url to retrieve
  • retrieve files from the archive
    • /rest/retrieve/{currentdate}/{url}
      • currentdate - point in time to return
      • url - page url
    • /rest/retrieve/{entryId}
      • entryId - dat entry id to return (id from list entries)
  • list entries in a dat/arc file
    • /rest/warc/{id}
      • id - db id of dat file