Skip to content

robpotter89/weblib

 
 

Repository files navigation

Weblib

image

image

image

Weblib provides tools to solve typical tasks in web scraping:

  • processing HTML
  • handling text encodings
  • controling repeating and parallel tasks
  • parsing RSS/ATOM feeds
  • preparing data for HTTP requests
  • working with DOM tree
  • working with text and numeral data
  • list of common user agents
  • cross-platform file locking
  • operations with files and directories

Installation

Run:

pip install -U weblib

Documentation

Docs are incomplete. Most docs are auto-generated from modules/methods docstrings. Check out docs here http://weblib.readthedocs.org/en/latest/

About

Tools for web-scraping

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 99.5%
  • Makefile 0.5%