On this page:
- Getting Started
- Post Crawl Analysis
- Advanced Training Webinars
- Advanced Scoping
- Archiving Video Content
- Archiving Social Media
- Advanced Quality Assurance
- Access to Archive-It Collections
- Under the Hood
- Describing Web Archives
- Intro to Brozzler
- WARC Tools for Management and Preservation
- Archive-It as a Reference Tool
- The Web Archiving Systems API (WASAPI)
- The Archive-It Partner API
- Teaching Archive-It
- Collection development policies
- The Wayback API
- Intro to the WARC
- Lone Arrangers
- The State of Social Media Web Archiving
- Reassess Your Access: Waybackfill and Redirection Service
Getting Started
Navigating Archive-It
New to Archive-It or just need a refresher? This video will help you get acquainted with the Archive-It web application, pointing out where features are located and why you might want to use them.
Navigating Archive-It Section |
Time |
Public Website | 01:08 |
Partner Account Application | 02:32 |
Collection Overview Tab | 05:11 |
Seeds Tab | 06:06 |
Crawls Tab | 07:19 |
Crawl Scope Tab | 08:12 |
Metadata Tab | 08:37 |
Wayback QA Tab | 08:56 |
Administrative Functions
Are you the administrator of your Archive-It account? If so, watch this video to find out how to add and edit users, and how to use the other administrative features available in your account. Features include: URL customization, account level metadata, and Google Analytics [replaced by Plausible Analytics].
Administrative Functions Section |
Time |
Add and Edit Users | 00:24 |
Customize URL, Logo, Description | 01:17 |
Set Metadata Fields to Private | 05:11 |
Google Analytics [deprecated - replacement] | 02:09 |
Pre-crawl Scoping
Make sure your seeds are set up correctly before you start your crawls. In this video you'll learn tips for selecting, formatting, and administering your seed URLs before you run a crawl to help capture the data you're looking for.
Pre-Crawl Scoping Section |
Time |
Selecting and Formatting Seeds | 00:25 |
Seed Types | 04:10 |
Standard Scoping Rules | 05:12 |
Test Crawls
Adding new seeds or scoping rules? You'll want to make sure you run a test crawl! This video will give you tips on how and why to run test crawls in your collections.
Test Crawls Section |
Time |
Run a Test Crawl | 00:09 |
Review a Test Crawl | 01:14 |
Save a Test Crawl | 02:08 |
PDF Only Crawls
Learn how to run a PDF Only crawl and how to access the archived PDFs
PDF Only Crawl Section |
Time |
One-Time or Test Crawls | 00:03 |
Scheduled Crawls | 00:39 |
Review Crawls | 01:03 |
Post-crawl Analysis
Getting the most from your post crawl reports
So you've run a crawl, now what? This video walks through each report to provide detail on why each post-crawl report is necessary, and the information you can glean from them.
Getting the Most Section |
Time |
Crawl Overview | 00:55 |
Seeds Report | 02:46 |
Seeds' Hosts Report | 03:59 |
Hosts Report | 05:13 |
File Types Report | 05:38 |
Understanding your Hosts Report
Don't be overwhelmed by the information in your Hosts report! Find out all of the different ways you can use it to identify crawler traps, block hosts, add data limits, run patch crawls, and more!
Understanding Hosts Reports Section |
Time |
Blocked Hosts | 01:49 |
Patch Crawl Blocked Hosts | 02:54 |
Queued Hosts | 03:27 |
Out of Scope Hosts | 05:39 |
Quality Assurance
What can you do if your archived websites don't look quite right in Wayback? Following these steps may help you improve the capture and replay of your Wayback pages.
Quality Assurance Section |
Time |
Checking Reports | 00:35 |
Wayback QA Tool | 02:19 |
Crawl page as a seed | 06:17 |
Proxy Mode (deprecated) | 04:25 |
Advanced Training Webinars
These recordings of our advanced training webinars are recommended for users who are familiar and comfortable with the content outlined in our “Getting Started” and “Post Crawl Analysis” video curricula.
Advanced Scoping
This live session on advanced crawl scoping tools and techniques will empower you with a toolbox of tips and tricks for you to use as you crawl. Recorded August 28, 2018.
Archiving Video Content
This webinar explains how general archiving workflows apply to video content. A look at both capture and replay, as related to YouTube, Vimeo, and streaming video platforms, followed by a live review of results from a YouTube crawl. Recorded February 7, 2017.
Archiving Social Media
In this webinar, the focus is on Facebook, Twitter, Instagram and YouTube. We also cover how to scope embedded social content and quality assurance strategies relevant to social media sites. Recorded November 14, 2017.
Web Archiving Quality Assurance
This recording takes an in-depth look at quality assurance strategies that will strengthen your ability to assess and improve your crawl results. Recorded August 8, 2017.
Access to Archive-It Collections
This webinar reviews different strategies used by partners for providing and boosting access to their Archive-It collections. Recorded May 2, 2017.
Under the Hood: Tips & Tools
This webinar takes a look at the tips and tools the Archive-It team uses most in their own web archiving and quality assurance workflows. Recorded February 13, 2018.
Under the Hood Section |
Time |
Collections and Seeds | 02:00 |
CDX | 07:05 |
Browser Tips | 19:10 |
Wayback QA | 23:35 |
Search | 30:20 |
Describing Web Archives
This webinar takes a look at some ideas and methods for descriptive metadata practice and features Archive-It partners and peers. Recorded May 22, 2018. Presentation materials and further discussion about this topic may be found here in the Archive-It Community Forum.
Intro to Brozzler
This webinar describes and demonstrates the new browser-based web capture technology available to Archive-It partners. Recorded July 10, 2018.
WARC Tools for Management and Preservation
This webinar takes a look at the tools some Archive-It partners use in their own web archiving workflows for WARC management and preservation. Recorded November 20, 2018.
Archive-It as a Reference Tool
This webinar demonstrates how you can use Archive-It and other web archives as a reference tool. Recorded February 26, 2019.
The Web Archiving Systems API (WASAPI)
This webinar demonstrates how to use WASAPI as a tool to access, download, and transfer WARC file data and metadata. Recorded May 29, 2019.
The Archive-It Partner API
This webinar introduces the Archive-It Partner API and describes the way you can use it to develop custom access layers, manage administrative metadata, or to preserve technical and descriptive metadata, among other use-cases. Recorded August 21, 2019.
Teaching Archive-It
A webinar with Archive-It partners from Library and Archives Canada, New York University, and the Internet Archive about their web archiving training and education practices and resources for new web archivists as they get started. Recorded November 19, 2019.
Collection development policies
A webinar with Archive-It partners from Queens Public Library, Austin Presbyterian Theological Seminary, and UC San Diego about making, following, and updating the policies that guide their collecting decisions. Recorded February 18, 2020.
The Wayback API
A webinar about the APIs that anyone can use in order to retrieve metadata about web captures in the Internet Archive's Wayback Machine and Archive-It partners' collections, including access, quality assurance, and development applications. Recorded April 1, 2020.
Introduction to the WARC
A recording of the live webinar introduction to the Web ARChive (WARC) file format for Archive-It partners, web archivists, and peers in digital preservation. Recorded May 5, 2020.
Sample WARC file for Archive-It Advanced Training (available for download)
The State of Social Media Web Archiving
A webinar discussing the state of web archiving social media. Recorded December 16, 2020.
Reassess Your Access: Waybackfill and Redirection Service
A webinar discussing the new Archive-It services Waybackfill and Redirection Service. Recorded March 31, 2021.
Comments
3 comments
Dear,..
I understand its a little, but I'm deaf, can't ear by sound. Will help me & traning to understand one by one in a few week then done. If can do....
Thanks,....
I would like more example documentation regarding calling APIs with javascript, eg. in order to display collections on my HTML home page. There's little/no information as to how to do this.
Does IA have captioned versions of these videos available anywhere?
Please sign in to leave a comment.