Paperget

Ever tired of this?

Find some scientific article on DBLP.
Click "electronic edition", be forced to navigate some publisher's site while all you want is the pdf.
Finally find the pdf, with a publisher generated crazy filename.
Be forced to go back to DBLP to download the matching bibtex file.
Change the pdf/bib filenames to something reasonable.

Paperget automates all of this.

You give paperget fragments of a title, authors, etc., and it figures out how to download the pdf and bib file and saves both with reasonable filenames matching the bibtex key.

Example

$ ./paperget.py reingold connectivity log space
        authors: Omer Reingold
             ee: http://doi.acm.org/10.1145/1391289.1391291
          title: Undirected connectivity in log-space.
          venue: J. ACM
           year: 2008
Is this the paper you are looking for? [Yn]
Use the bibtex key 'reingold2008'? [Yn]
Following http://doi.acm.org/10.1145/1391289.1391291
Found acceptable destination: http://dl.acm.org/citation.cfm?doid=1391289.1391291
Using module ACM to download pdf...
Successfully downloaded ~/papers/reingold2008.pdf
No suitable bib module for: http://dl.acm.org/citation.cfm?doid=1391289.1391291
Falling back to DBLP bibtex information.
Successfully downloaded ~/papers/reingold2008.bib

The path ~/papers/ and the bibtex key schema are hardcoded but can easily be changed. TODO: Make this configurable.

What it doesn't do

Paperget does not break paywalls. Downloading pdfs from a paywalled site only works if you have access, for example via your academic institution. Having said this, if paperget hits a paywall, it will ask google for the title of a paper. Often a paper turns out to be available for free on the author's website.

Paperget does not support batched downloads.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
modules		modules
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
paperget.py		paperget.py
todo.org		todo.org

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

modules

modules

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

paperget.py

paperget.py

todo.org

todo.org

Repository files navigation

Paperget

Example

What it doesn't do

About

Releases

Packages

Languages

License

silentcedar/paperget

Folders and files

Latest commit

History

Repository files navigation

Paperget

Example

What it doesn't do

About

Resources

License

Stars

Watchers

Forks

Languages