Skip to content

sevas/csxj-crawler

Repository files navigation

Travis_

What is this ?

This software crawls articles published on the frontpage of various online news outlets. For every article, it extracts its title, category, content, links and links to embedded medias. The extracted data is stored in a plaintext database, as a series of JSON files.

3rd party Dependencies

License

This project is licensed under the MIT open-source license. See LICENSE.txt for details.

Notes

This project was tested with python 2.6 and python 2.7.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Packages

No packages published