The Archive-It workflow is a model for web archiving when using the Archive-It service. This workflow is not linear; a step, or a sequence of steps, may need to be repeated in order to achieve the best results. These steps provide a framework for moving web content into an archived format. They work best in the Archive-It ecosystem, but fit into a broader web archiving lifecycle.
Each section represents a distinct step in the workflow:
- Collections are a group of URLs curated around a common theme, topic, or domain.
- Scope determines what the crawler will capture and what it won’t. Scoping is the process of, and use of tools, to tell the crawler how to adjust the scope. This includes general scoping, as well as scoping for specific web platforms.
- Crawling is the use of software, called crawlers, to visit websites and index the information included therein.
- Reviewing is the activity of evaluating completed captures.
- Quality Assurance includes the use of tools and articles related to improving the quality of captures.
- Access is the step of sharing content, by either making is publicly available, or sharing the private collection link, if applicable.