Overview
While we continuously investigate and implement capture improvements, some websites are not created in a way that is "archive-friendly" and can be difficult to capture or replay in their entirety. These difficulties affect all web crawlers, not just ours. When selecting seed URLs and reviewing your archived content, please keep these limitations in mind.
On this page:
About
Dynamic web content refers to content that changes based on the behavior and preferences of the user and is a known web archiving challenge. While many sites with dynamic web content can be archived without issue, there are some types of dynamic web content that can be difficult to capture or replay. Particularly, anything highly dependent upon human interaction (for example, if a click is needed to activate something), or JavaScript (for example, when you mouse over a word and a drop-down menu suddenly appears).
Known issues
Here are some examples of dynamic web content that can be challenging to archive:
- Images or text size that adjust dynamically to browser size
- Maps that zoom in and out
- Downloadable files
- Media that requires clicking a “play” button
- Navigation menus
- JavaScript based pagination
- 3D virtual tours
Troubleshooting
1. Review our Quality Assurance overview
While each situation is different and can sometimes need special attention, it is helpful to employ these general recommendations for troubleshooting as a first step.
2. Crawl using Brozzler
If you haven’t yet, it’s also a good idea to try crawling dynamic content using Brozzler (rather than Archive-It’s "Standard" crawling technology). This is because, unlike Standard, Brozzler records interactions between servers and web browsers as they occur, more closely resembling how a human user would experience the web.
3. Try these specific troubleshooting steps:
Dynamic content |
Troubleshooting |
Images or text size that adjust dynamically to browser size |
|
Maps that zoom in and out |
|
Downloadable files |
|
Media that requires clicking a “play” button |
|
Navigation menus |
|
JavaScript based pagination |
|
Outcome
At present, because of general web archiving limitations, we will not be able to capture or replay some dynamic content in its entirety. If you need help troubleshooting please feel free to submit a support ticket and we will investigate.
Related content
How to use the Wayback QA tool
Comments
0 comments
Please sign in to leave a comment.