Personal tools

WarcManager:Urls: Difference between revisions

From Adapt

Jump to: navigation, search
No edit summary
 
No edit summary
Line 4: Line 4:
**/rest/collection - returns a list of Collections
**/rest/collection - returns a list of Collections
* list entries for a url - returns a list of entries for the given url or urlID
* list entries for a url - returns a list of entries for the given url or urlID
** /rest/url/url
** /rest/url/{url}
*** url - url of page to retrieve
*** url - url of page to retrieve
** /rest/url/urlID  
** /rest/url/{urlID }
*** urlID - db id of url to retrieve
*** urlID - db id of url to retrieve
* retrieve files from the archive
* retrieve files from the archive
** /rest/retrieve/currentdate/url  
** /rest/retrieve/{currentdate}/{url }
*** currentdate - point in time to return
*** currentdate - point in time to return
*** url - page url
*** url - page url
** /rest/retrieve/entryId
** /rest/retrieve/{entryId}
*** entryId - dat entry id to return (id from list entries)
*** entryId - dat entry id to return (id from list entries)
* list entries in a dat/arc file
* list entries in a dat/arc file
** /rest/warc/{id}
** /rest/warc/{id}
*** id - db id of dat file
*** id - db id of dat file

Revision as of 19:49, 16 April 2010

Urls available for the warc manager

  • list collections
    • /rest/collection - returns a list of Collections
  • list entries for a url - returns a list of entries for the given url or urlID
    • /rest/url/{url}
      • url - url of page to retrieve
    • /rest/url/{urlID }
      • urlID - db id of url to retrieve
  • retrieve files from the archive
    • /rest/retrieve/{currentdate}/{url }
      • currentdate - point in time to return
      • url - page url
    • /rest/retrieve/{entryId}
      • entryId - dat entry id to return (id from list entries)
  • list entries in a dat/arc file
    • /rest/warc/{id}
      • id - db id of dat file