Has anyone done any kind of integration with a WordPress site in terms of capturing WordPress categories and tags as Archive-It metadata?
Or does anyone have any suggestions for how we might automate the collection of metadata from a WP site?
To give a concrete example, we are archiving our main site, https://www.internetsociety.org/ On that site, we frequently publish blog posts that already have structured meta data in terms of the author name, category, sometimes tags, etc.
In my ideal world, all of that metadata would somehow be captured so that when I go to the Collection page for the archive - https://archive-it.org/collections/10101 - I could see the categories, authors, etc. on the left side and be able to go to archive search results for those terms.
Now I realize it's hard for a crawler to FIND some of these items in the source of a page, but I wondered if there are specific <link> elements or other elements that we could include to help the Archive-It crawler.
Alternatively, we have a RSS feed - https://www.internetsociety.org/feed/ - that again includes structured metadata.
Could any of this be used to help us automate the creation of metadata in our Archive-It collection?
Please sign in to leave a comment.