On this page:
- When to use Brozzler
- How to use Brozzler
When to use Brozzler
In most cases, we recommend using Brozzler only after you've tried a crawl using the Standard crawling technology (Heritrix/Umbra). If it seems like dynamic elements were not captured in your Standard crawl or you're seeing a number of error pages when viewing results in Wayback, running a test crawl using Brozzler is a good next step.
Brozzler is also recommended when crawling:
Differences between Brozzler and Standard crawls:
Please keep in mind that Brozzler and Standard crawls use different capture mechanisms so there may be differences in the amount of data each crawler can capture from the same Seed. When using a new crawling technology, please run a test crawl first. Please keep in mind the following:
- Brozzler is not yet configured for PDF-only crawls.
- Brozzler is globally available for One-Time or Test crawls. If you would like to use it for Recurring crawls please submit a support ticket to have the feature enabled.
How to use Brozzler
Running Brozzler Crawls
You will see the option to choose between Brozzler and our “Standard” (Heritrix/Umbra) crawling technology in a new field within the “Run Crawl” dialog called “Crawling Technology”.
Run a Brozzler test crawl before deciding to use Brozzler for a production crawl.
Reviewing Brozzler Crawls
Brozzler crawls return the same post-crawl reports as Standard crawls and can be differentiated from Standard crawls by the Brozzler icon. It will be listed next to the Crawl ID in the Crawl Report list and on individual crawl reports for all Brozzler crawls.
The crawling technology used in a crawl (Brozzler or Standard) is also indicated on the Overview tab of each crawl report.
Enabling Brozzler For Recurring Crawls
Brozzler can be enabled as an option for recurring crawls. If you would like to use Brozzler on recurring crawls please request it by submitting a support ticket.