Skip to content

nareto/pagerank

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PageRank

For a University exam on numerical methods for Markov chains, I had to read and make an oral report on two articles on PageRank algorithms, on the inner-outer iteration method and fast sampling method.

This repository contains my python implementation of some of those algorithms, which I made to make some quick tests. Unfortunately I was not able to get the sampling methods to work - the sampling function is not working as expected (so the fastsample.py file is to be considered broken). Also the Gauss-Seidel method (gs.py) is really slow: I used scipy.sparse.csr_matrix to store the matrixes, and it works very well for methods that require many matrix x vector products, but very poorly with the Gauss-Seidel method that needs to make lots of accesses to single matrix elements.

I tested the programs with the web graphs from http://snap.stanford.edu/data/index.html and the much smaller Harvard500 graph (from which you have to manually remove the first lines)

Example:

python readandsavegraph.py wiki-Vote.txt wiki float64
python pow.py wiki
python innerouter.py wiki
python plotresidues.py wiki

About

Some algorithms for computing PageRank

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages