Skip to content

Crawl any Web page and generate XML sitemap compatible with Google's indexing robots.

License

Notifications You must be signed in to change notification settings

wiejakp/python-sitemap-generator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Python Sitemap Generator

  • Version: 0.4.2
  • Update: 2022/12/26

Python Site Map Generator uses python multi-threaded approach to read all links accessible through the Web site and generate proper sitemap for SEO purposes. Script was meant to use threading technology to allow easy and very fast approach while generating sitemaps for your Web pages. The script will run under Linux operating system which supports Python 3 language.

Use with caution, if you set thread count too high, it can cause your web server to bug out and cause some links to throw an error, or your IP will be blocked due to firewall threashold.

REQUIREMENTS

  • Python 3
  • sudo apt-get install python-beautifulsoup
  • sudo apt-get install python-pip
  • sudo apt-get install python3-pip
  • pip3 install setuptools
  • pip3 install bs4
  • pip3 install lxml
  • pip3 install var_dump

USAGE:

  • Set up the 'InitialURL' variable to point to Web site which you want to generate sitemap for.
  • Set script to executable: sudo chmod +x python-sitemap-generator.py.
  • Run script: python3 python-sitemap-generator.py.

Python Sitemap Generator

About

Crawl any Web page and generate XML sitemap compatible with Google's indexing robots.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages