Archive-It User Guide
Getting Started
- Guide for new Archive-It users
- Archive-It Video Curriculum
- Known Web Archiving Challenges
- What is web archiving?
- Support Ticket Submission
- Set up and administer your account
Collections
- How seeds, documents, and collections work together
- Create and manage a collection
- Select Seed URLs
- Add, edit, and manage your metadata
- Add and edit collection information
- Add and Edit Seed Level Metadata
Scoping
- How Archive-It crawlers determine scope
- Modify your collection or seed scope
- Assign and edit a "seed type"
- Adding Seed Level Scoping Rules in Bulk
- Identify and avoid crawler traps
- Limit your crawl
Scoping crawls for specific types of sites
- Scoping guidance for specific types of sites
- Archiving ArcGIS
- Archiving Blogspot sites
- Archiving Facebook
- Archiving Flickr streams
- Archiving Google Docs
Crawling
- Run, monitor, and save a test crawl
- Manually start test and one-time crawls
- Crawl new seeds immediately with InstaCrawl
- Schedule crawls
- Monitor crawls
- Resume a finished crawl
Reviewing
Quality Assurance (QA)
Access
- Access your account with the Archive-It Partner API
- Archive-It APIs and integrations
- Controlling access to your web archives
- Controlling the public visibility of specific metadata fields
- Browse and search on archive-it.org
- Google Analytics for your archive-it.org content
Storage and preservation
- Request and download web archive derivatives with WASAPI
- Archive-It Storage and Preservation Policy
- WARC Naming Conventions
- Derivative Data Sets
- Partner Guide to Downloading Archive-It Data
- Find and download your WARC files with WASAPI
Resources
- Archive-It System Status
- Free trial Archive-It accounts
- Troubleshooting browser issues
- Frequently Asked Questions
- Live Archive-It Training Sessions and Webinars
- Archive-It Office Hours