What are all these other hosts listed in my crawl's Hosts report?

Updated February 25, 2026 22:46

Websites can be composed of elements from various locations. Archive-It's crawlers collect all embedded elements (images, video players, stylesheets, analytics, etc.), even if their host domain differs from the seed's. The crawler may also discover and collect a few documents from a host before determining that the rest are out of scope. If there are some particularly odd ones, contact us and we will investigate whether or not they present any problems.

Comments

0 comments

Please sign in to leave a comment.

Articles in this section

Related articles