WarcManager:Urls: Difference between revisions
From Adapt
No edit summary |
No edit summary |
||
Line 9: | Line 9: | ||
*** urlID - db id of url to retrieve | *** urlID - db id of url to retrieve | ||
* retrieve files from the archive | * retrieve files from the archive | ||
** /rest/retrieve/{currentdate}/{url } | ** /rest/retrieve/{currentdate}/{url} | ||
*** currentdate - point in time to return | *** currentdate - point in time to return | ||
*** url - page url | *** url - page url |
Latest revision as of 19:49, 16 April 2010
Urls available for the warc manager
- list collections
- /rest/collection - returns a list of Collections
- list entries for a url - returns a list of entries for the given url or urlID
- /rest/url/{url}
- url - url of page to retrieve
- /rest/url/{urlID }
- urlID - db id of url to retrieve
- /rest/url/{url}
- retrieve files from the archive
- /rest/retrieve/{currentdate}/{url}
- currentdate - point in time to return
- url - page url
- /rest/retrieve/{entryId}
- entryId - dat entry id to return (id from list entries)
- /rest/retrieve/{currentdate}/{url}
- list entries in a dat/arc file
- /rest/warc/{id}
- id - db id of dat file
- /rest/warc/{id}