Twitter Hashtag Crawls not producing results

Comments

5 comments

  • Avatar
    Sean Volke

    This probably doesn't help but I usually use twarc for my twitter harvests. Now that I think about it, webrecorder would probably work well too. Dunno how to get it from there into wayback though.

    0
    Comment actions Permalink
  • Avatar
    Aimee Everrett

    I'm running into the same issue - have you found a fix for this?

    0
    Comment actions Permalink
  • Avatar
    Sean Volke

    I have been successful at uploading webrecorder captures into my Archive-It account as they're also in WARC format.

    0
    Comment actions Permalink
  • Avatar
    Ely Sheinfeld

    Unfortunately, it seems that I am in the same boat.

    I have also tried messing with the date elements within the search parameters and adding to the seed URL to try to force capture only those few specific tweets during that time period we are looking to capture, to no avail.

    0
    Comment actions Permalink
  • Avatar
    Karl-Rainer Blumenthal (Edited )

    Hi all. Sorry for the delay on this one. We're untangling a few inter-related issues with recent Twitter captures from here on our end. I believe that the issue causing most of your trouble is on our Wayback/replay side to fix, so I do not yet advise changing your crawling strategies yet. One important exception to this: we are not yet capable of logging into Twitter. In fact attempting to do so has been known to cause the error message that you see, Elizabeth. So I would just advise to remove any login credentials from the seeds that you archive regularly.

    Apologies again for the lack of a fix already, but we're on the case and eager to see this one resolved for everyone! Stay tuned and I'll provide the update here when there is new information and hopefully a fix to review.

     

    1
    Comment actions Permalink

Please sign in to leave a comment.