Web Archiving but not on displayed on our ArchiveIT collection
Hi-- I learned that our state library has to delete some pages due to a change in law on allowing what is considered CRT on state education pages. I manage the website for our state's library association so I will be able to host the information in question on the library association's website.
In preparation for the changes, I saved the entire state library page and a couple of pages in question in my library's Archive-It account with a test crawl. I also saved them using the Save Page Now function on the Wayback Machine.
Now--
#1 I don't really want to save these pages as a collection on my library's web archive because they are not our pages or in our library's collection policy.
#2 I don't have enough space in the allotted ArchiveIT subscription
How can I make sure these pages are well archived in the Way Back Machine-- is using the Save Page Now sufficient or is there something I can do with my test crawls apart from saving them in my institution account?
-
Hello Kelly,
That's good thinking to use Save Page Now. If I understand correctly and the priority is to make sure that a few select pages are preserved in the Wayback Machine collection on archive.org, then I think you're all set. These pages are stored in the same standard WARC file format as Archive-It crawls, stored and maintained by the Internet Archive, and should remain accessible perpetually.
When/if you need more, I might recommend first checking in to see if the pages are in the collecting scope of our partners at the South Dakota State Archives and South Dakota State Library here.
Or, if you need to collect them yourself ultimately, we could check for any efficiencies that could pare down the volume of data so that it is more manageable on your budget. Selecting the right seed type and scope constraints can go a long way to elide superfluous data that you may not want or need to preserve, for instance.
Let us know here if we can help further, though!
Please sign in to leave a comment.
Comments
1 comment