Here’s some Python code to create a sitemap in the format specified by sitemaps.org and read by search engines. Download the file sitemapmaker.txt and change the extension from .txt
to .py
.
Change the url
variable in the script before running it or else you’ll point search engines to my website rather than yours. Also, edit the file extensions_to_keep
variable if you want to index any file types besides HTML and PDF.
Copy the file sitemapmaker.py
to the directory on your computer where you have your files. Run the script and direct its output to a file, sitemapmaker.py > sitemap.xml
. See sitemaps.org for instructions on how to let search engines know about your sitemap.
This code assumes all the files to index in your sitemap are in one directory, the directory you run the script from. It also assumes the timestamps on your computer match those on your web server. Optional fields are left out of the sitemap.