Avatar

Russell White

  • Total activity 25
  • Last activity
  • Member since
  • Following 0 users
  • Followed by 0 users
  • Votes 10
  • Subscriptions 10

Activity overview

Latest activity by Russell White
  • Avatar

    Russell White commented,

    I've uploaded this Python reporting script to a Github repository along with documentation. There's the script file (seedstats.py) and a config file (seedstats_config.py). To run the script you'll ...

  • Avatar

    Russell White commented,

    Okay, great! In its present form, the script takes a string as input and gets a total data count (all crawls, all time) for any seed URL matching the string, which could be specific (e.g., 'https:/...

  • Avatar

    Russell White commented,

    I realize this is not quite what you're asking for, but I wrote a command line script in Python recently that does this by retrieving seed and crawl data from Archive-It's Partner API. This might w...

  • Avatar

    Russell White created a post,

    Seed-level WARCs and data de-duplication

    My institution is excited about the new procedure of writing WARC files per seed rather than per crawl as described here. One thing we're wondering about is how seed-level WARCs work with data-de-d...

  • Avatar

    Russell White commented,

    What are the optimal/allowable dimensions for customized institutional logos?