Skip to content
This repository has been archived by the owner on Dec 28, 2020. It is now read-only.

A simple example of using storytracker and the PastPages API to conduct a link analysis

Notifications You must be signed in to change notification settings

palewire/drudge-domain-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Drudge domain analysis

A simple example of using storytracker and the PastPages API to conduct a link analysis

Getting started

Create a virtualenv and activate it.

$ virtualenv drudge-domain-analysis
$ cd drudge-domain-analysis
$ . bin/activate

Clone the repository and jump into it.

$ git clone https://github.com/pastpages/drudge-domain-analysis.git repo
$ cd repo

Install the requirements.

$ pip install -r requirements.txt

Running the analysis

Download the archived screenshots from PastPages.

$ python download.py

Extract the hyperlinks from each one.

$ python extract.py

Analyze the hyperlinks and spit out the results.

$ python analyze.py

About

A simple example of using storytracker and the PastPages API to conduct a link analysis

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages