There are two ways to view and understand the content archived in your collections.
To get an authoritative listing of what was captured, consult the Hosts report for any given crawl, specifically the 'Docs' column in that report will allow you to view exactly what URLs were archived from each host. For more complete guidance on reading this and other reports, see: How to read your crawl's report.
You can also browse the archived websites directly to get a feel for your collection and its contents. Archived content is available to browse in Wayback 24 hours after a production crawl finishes. From inside the application, go to the "Archives" section and select the "Wayback" tab. Now enter the seed URL you want to see and click "Go". To view the seed URLs for a specific collection, select that collection from the drop-down menu under the search box. You can also do a keyword text search under the "Search" tab while logged into the application or from the public Archive-It site (www.archive-it.org). Archived content is available for full text search 7 days after a production crawl finishes or a test crawl is saved. Enter any keyword and the search engine will search for it in the text of your archived web pages.
Publicly accessible collections can also be browsed any time from the public site (www.archive-it.org). From the homepage, partners can access their collections by searching for a specific collection in the "Explore Collections" search box or for their organization in the "Explore Collecting Organizations" search box. You can also view collection information by searching or browsing from the Explore page: http://www.archive-it.org/explore
Comments
0 comments
Please sign in to leave a comment.