Skip to content

sasha00123/KruzhokCrawlerStage1

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Kruzhok Crawler

Implemeted

  • Fetching organizations
  • Checking for website availability
  • Fetching metadata (keywords, description, title)
  • Fetching social accounts (VK, Instagram, Facebook, Twitter) and number of followers

Open problems

  • Checking website ownership (ML model? Expert rules?)

Analyzing data

Collected data stored in results.csv

Running locally

Tested on python 3.8.6, pip 20.2.4

pip install -r requirements.txt
python main.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages