Does seed redirect have influence on the scoping
When the crawler redirect my seed URL to the same URL without the final slash does this impact the scoping for this seed?
-
Official comment
Seed redirects can happen for a number of reasons, but most commonly happen when the URL on the live website is updated. And it's usually a good thing the crawler can follow them!
When the URL redirects to a version without the final 'trailing slash', this does have the potential to open up the scope of the crawls with the Standard crawling technology. It can sometimes collect some documents from the final directory before the slash, but the data collected is often within reason.If you see this kind of redirect happen and believe it's collected way more than it should have, adding limits to your future crawls can help!
And if that data is a very unreasonable amount, submit a support ticket with the details. Be sure to include a link to the crawl report where you see this happening.Comment actions -
Funnily enough, I just had a seed do the same thing - prompting the same question. I hope someone has some insight for us, Evelyne!
In our case, the seed had a data limit, and so it wasn't possible for it to pull in a ton of out-of-scope content. The docs it gathered seemed typical.
-
Thank you Tanya for the answer. Is there a rule here that makes it possible to understand fully how the scoping will behave or is it more randomly?
I am trying to understand the scoping but I guess that when we want to include the whole content in it is better to remove the final 'trailing slash' from the seed.
Please sign in to leave a comment.
Comments
3 comments