Wordpress is a very popular website hosting platform and supports many of the sites that our partners wish to archive, from simple blogs to highly sophisticated sub-domains of larger websites. In general, our crawler can reliably archive material hosted and served by Wordpress without any special scope modifications; you may crawl and archive these sites as you would any other typical seed in your collections.
As with all such sites, however, we strongly recommend reviewing the results of your crawls in order to ensure that no special limitations, such as robots exclusions, prevent your Wordpress site from archiving fully. When reviewing your Hosts report in particular, you may notice that your Wordpress site includes many URLs with directories like /wp-admin or /wp-login in them, and which were either blocked by a robots exclusion or deemed "out of scope" and therefore not archived. This is completely normal and appropriate, as those URLs refer to areas reserved for administrators of the targeted website, rather than any publicly visible front-end material.
For specific guidance on archiving password-protected Wordpress sites, see our guide: How to archive password protected sites.