Skip to content

Galina-Blokh/scrapping_kaggle_itc

Repository files navigation

README

Scraps information about kaggle competitions from kaggle.com and insert it to database

What is this repository for?

This is Kaggle parsing project for ITC data mining task

How do I get set up?

You need to install Chromedriver Please see https://sites.google.com/a/chromium.org/chromedriver/home

To economy your time you can start from the step 2 and use links from file test_links.txt, dor this run ./run.sh 2 To know more information about run.sh you could run the script without parametrs.

Project contains 3 main scripts (steps):

  1. get_links.py - collects links of kaggle competitions
  2. main.py - collect data about competitions
  3. insert_to_db.py - insertind data to database

For more information you can run one of the scripts with -h

Contribution guidelines

Who do I talk to?