Podcast Seeds and Scoping
I am interested in capturing MyFellowKansans; this is a podcast site following Governor Laura Kelley of Kansas. I want to crawl each episode as an individual seed under one collection but my URLs are not accepted. Has anyone had successful crawls of podcast sites? Thank you.
-
It looks like the MP3 files are directly linked in the HTML which is good:
<a href="https://cpa.ds.npr.org/kcur/audio/2018/12/121218_MFKLive_Roundtable.mp3" title="" class="jp-play"></a>
But note that they are hosted on a different hostname cpa.ds.npr.org instead of www.kcur.org. I wonder if it will help if you configure your crawl to follow links to that other hostname?
Please sign in to leave a comment.
Comments
2 comments