Test crawls URLs not showing up in Wayback

Comments

1 comment

  • Avatar
    Karl-Rainer Blumenthal

    Hi, Sarah! Apologies that this wasn't clearer as you reviewed your crawls, but the good news is that your test captures are indeed available to view in Wayback here:

    https://wayback.archive-it.org/10614-test/*/https://twitter.com/slavresistance/status/1016697918970105857/

    https://wayback.archive-it.org/10614-test/*/https://twitter.com/MosesSumney/status/1014235370521673728/

    Just note that -test text next to the collection number in the URLs above that does not appear in the links that you are following in your post. These -test style of links should be in your test crawl's Seeds report, so do by all means let us know via the support channel if you see anything to the contrary and we'll check it out. Wayback URLs like the ones that you provided appear in your overarching collection's Seeds tab and will work only if and when you elect to permanently save the contents of your tests.

    In this case, I think that you will find little difference between crawling your Twitter post seed as a "standard" or "one page" seed because the way that the seed is formatted, with a trailing slash after the post ID number, effectively constricts the scope of the crawl to that one post alone. Let us know if that raises any new questions about formatting or scoping strategy though, of course!

     

    0
    Comment actions Permalink

Please sign in to leave a comment.