ArcGIS is a geographic information system maintained by the Environmental Systems Research Institute (ESRI). Because of the large amount of custom Javascript involved with displaying the ArcGIS platform, it can be challenging to capture all necessary pieces required to replay in Wayback. Due to the variability of individual ArcGIS pages, we strongly recommend running a test crawl and reviewing the results before proceeding. Unfortunately we are not able to devote engineering resources to individual issues with ArcGIS at this time.
With known capture/replay constraints in mind, here are the recommendations for crawling ArcGIS with Archive-it:
- Expand the scope of your seed to include arcgis.com. ArcGIS often hosts data on multiple servers/hosts, thus making it hard to find and implement a simple fix.
- Crawl with Brozzler.
- Run a test crawl - because expanding scope can increase the amount of data captured, we recommend running test crawls before committing data directly to your account.
- Use Wayback QA on the archived pages.
You may be able to get a more complete capture of ArcGIS content using manual web crawling software like Conifer. If you choose to use this please keep in mind that, due to issues replaying the previously mentioned custom Javascript, we cannot guarantee Wayback replay of uploaded WARCs that include ArcGIS.
Comments
0 comments
Please sign in to leave a comment.