Fix sitemap generation
Also look into adding sitemap index file, I think the sitemap generator can be configured to do this.
Thoughts:
- Confirmed last time I did this that NOT having URLs in the sitemap (like every record) will not prevent Google from indexing all the records. But in this change, it will now include every item and collection in the sitemap. We'll need to test this on prod
- There is a limit on the number of URLs in a sitemap: 50,000 (see here)
- Should probably submit the sitemap to Google Search console.
- Should set up the rake task to run periodically
- Should add the URL of the sitemap index file to public/robots.txt to help search engines find the sitemap. The URL should be the complete URL to the sitemap index. For example:
- Sitemap: http://dlg.usg.edu/sitemap.xml.gz
Other resources:
Edited by Sean Purcell