Probabilistic Ranking of Researchers

A framework for performing probabilistic ranking on bibliographic data under ambiguity.

To run:

$ python -m src.main

To run a test:

$ python -m test.<test_name>

To run all tests:

$ nosetests test

The format of the training files are:

(ref)<>(author)_(index)<>(coauthor1):(coauthor2):(coauthorI:)*(coauthorN)<>(venue)<>(name)<>(title)<>

where

ref is the reference id
author is the author id
index is an index within the author's references
coauthorI is the i-th coauthor
venue is where it was published (radicals only)
name if the author name used in this reference
title is the title of the paper (radicals only)

If possible, the references should be blocked in groups with the same initial and last name, which have an empty line dividing.

The test file has the same format, but author and index have to be any character, or even empty.

*** PACKAGES REQUIRED:

python-Levenshtein (https://pypi.python.org/pypi/python-Levenshtein/)
munkres (https://pypi.python.org/pypi/munkres/)
scikit-learn (https://pypi.python.org/pypi/scikit-learn/0.14.1)
numpy (https://pypi.python.org/pypi/numpy)

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
data		data
experiments		experiments
gen		gen
new_rankings		new_rankings
pkl		pkl
rankings		rankings
script		script
src		src
test		test
time		time
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

experiments

experiments

gen

gen

new_rankings

new_rankings

pkl

pkl

rankings

rankings

script

script

src

src

test

test

time

time

.gitignore

.gitignore

README.md

README.md

Repository files navigation

Probabilistic Ranking of Researchers

About

Releases

Packages

Languages

lucianamaroun/probabilistic-ranking

Folders and files

Latest commit

History

Repository files navigation

Probabilistic Ranking of Researchers

About

Resources

Stars

Watchers

Forks

Languages