Google sites - advice for getting a good capture?

Comments

1 comment

  • Official comment
    Avatar
    Mary Haberle

    Hi Dana,

    Thanks for sharing your question. It sounds like some of the stylesheets and/or page templates are out of scope. You can check the Out of Scope column for the domain sites.google.com in your crawl report’s Hosts tab to view all the documents that the crawler found, but determined to be out of scope.

    In the past we’ve noticed that it’s necessary to add an expand scope rule to include URLs that contain “sites.google.com” at the seed level to ensure that the look and feel elements of the site will be captured. Adding this at the seed level is important because this expand scope rule at the collection level will cause every seed in every crawl to look for google content.

    If you don’t notice any improvement or are having trouble reading the hosts report, please feel welcome to submit a support ticket and we’ll take a closer look for you.

    Thanks again,

    Mary

Please sign in to leave a comment.