Skip to content

kota999/git-trend

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

git-trend

Crawl & Scraping scripts for Github - Trending, and insert postgres.

Installation

pip install -r requirements.txt
git clone https://github.com/kota999/pgpipelines
cd pgpipelines && python setup.py install && cd ..
git clone https://github.com/kota999/scrapy-dynamodb
cd scrapy-dynamodb && python setup.py install && cd ..

Usage

scrapy crawl github_trend_crawler -o <output_jl_path> [-a timescale=<keyword>]
# timescale default: all
#    keyword:
#            all: crawl for daily and weekly, monthly trending
#            daily: crawl for daily trending
#            weekly: crawl for weekly trending
#            monthly: crawl for monthly trending
#            ! optional !
#               keyword combination is ","
#               ex) timescale=daily,weekly: crawl for daily and weekly trending

Dockernize

# copy customized github-trend to ./dockernize/main/
cd ./dockernize
# if want to set crawling conf, edit ./dockernize/scrape
# Docker Build
cd basei && docker build -t basei . && cd ..
cd main && docker build -t github-crawler . && cd ..
# Docker Run
docker run -it github-crawler

Crawl Language

  • All Language
  • UnKnown Language
  • C
  • C#
  • C++
  • CUDA
  • DockerFile
  • Dart
  • Elixir
  • Erlang
  • Go
  • GraphQL
  • Haskell
  • Java
  • Javascript
  • Julia
  • Jupyter-notebook
  • llvm
  • markdown
  • numpy
  • Objective-C
  • Objective-C++
  • PHP
  • PowerShell
  • Python
  • R
  • Ruby
  • Rust
  • Scala
  • Shell
  • Swift
  • Vim Script

Configure

About

Crawler of github-trending

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published