Archive-It has multiple ways for partners to integrate data into their own systems. Using one or more can connect your Archive-It data into your other access points. Some examples of integrations are listed below; please reach out to us if you have additional examples of how you are integrating or would like to integrate your data into external systems.
OpenSearch is an API to exchange search results between search engines. It’s possible to use OpenSearch to integrate search functionality, complete with a search box, or to create a custom integration.
- NYARC uses the OpenSearch API to connect to their custom built Primo federated search so that web archived results appear in their catalog, alongside physical and other digital results.
- Princeton Theological Seminary integrates a searchbox to search and display results from their archived web content.
OAI-PMH is an API that moves metadata between Archive-It and an endpoint; many use it to connect to their catalogs, or other access points. Our collection level OAI-PMH feed is automatically harvested by OCLC’s WorldCat Gateway service and will show up in searches on WorldCat.
Parter Data API
The Partner Data API connects to partner-centric data, from content to metadata. It’s possible to tap the Partner Data API to build custom integrations.
- SUNY Albany uses the Partner Data API to populate university archives search results with archived pages from Albany’s web domain. They include relevant information from their Archive-It account such as crawl start and end times, duration, and any rules that were in place for a crawl.
- Princeton Theological Seminary uses the Partner Data API to pull in XML metadata about collections and seeds, to share descriptive metadata about their archived content, including browsing through their web archives on their website.
Archive-It’s Wayback CDX is the index of all archived content that the Wayback browsing interface uses to provide requested archived pages to lookup and serve the specific captures requested by an end-user, such as from the Wayback calendar page.
- SUNY Albany uses the CDX to populate university archives search results with archived pages from Albany’s web domain.
- Princeton Theological Seminary pulls from the CDX to show a custom display of capture dates of their crawled content.
Below is a list of partners who have built a custom portal that integrates their archived web content into their own websites:
- Washingtoniana Web Archive (DC Public Library)
- Clark Art Institute Library
- Princeton Theological Seminary Web Archive
- Emory Web Archives (Emory University)
- Virginia Memory (Library of Virginia)
- Human Rights Documentation Initiative (University of Texas at Austin)
- Bryn Mawr, Haverford and Swarthmore Colleges (the Tri-College Consortium)
- Latin American Government Documents Archive (University of Texas at Austin)
- Columbia University Human Rights Web Archive
- National Institute of Health Web Archive
- North Carolina State Government Web Site Archives (North Carolina Office of Archives and History)
- Tennessee State Government Websites (Tennessee State Library and Archives)
- TRAIL: Texas Records and Information Locator (Texas State Library and Archives Commission)
- Web Archives at Indiana University
- Web Archiving @ the University of Melbourne
- University of Minnesota
- Learning and Teaching Scotland Archive Search Page
- Bodleian Libraries (University of Oxford)
- Washington and Lee University School of Law
- Autorenhomepages (Author Homepages) (University of Innsbruck)
- Maryland State Government Website Archive (Maryland State Archives)
- Web Archives at Michigan State University
- University of Alberta Archive-It Collections
- Web Archiving Project for the Pacific Islands (University of Hawaii at Manoa)
- University of Texas at San Antonio Web Collections (University of Texas at San Antonio)
- University of Wisconsin Milwaukee