Skip to content

tirami/udadisi-google-scholar

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

udadisi-twitter

The Twitter miner part of Udadisi project. The miner is configured with a set of Twitter user ids and then subscribes to their streams. When a Tweet is recorded, it processes using some simple NLP to extract the interesting words and saves the processed Tweet into a database. The Tweets can then be retrieved in bulk by the Udadisi Aggrigator.

Installation

The miner is build using Python 2.7 and the Flask framework and a bunch of other libraries. Assuming you have an up-to-date Python 2.7 environment, the simplest way to install all the dependencies is to use pip and the requirements file that's in the repository. The command is:

pip install -r requirements.txt

Following that you will need to install the NLTK stopwords corpus using the NTLK downloader. Follow the instructions here to find out how to install corpora.

The miner is then running by executing runserver.py using Python.

python runserver.py

Usage

The miner is designed to be managed via the admin interface of the Udadisi engine. It exposes the following API endpoints.

GET /categories returns a html table listing all the categories currently availabily.

POST /categories takes a json object as a parameter in the post body. Content-Type should be application/json. The object should contain a single key named 'id' with an integer value. This will create a new category with that array.

GET /categories/<category_id returns a form for viewing and editing the settings for the category.

POST /categories/<category_id processes the form and updates the settings for the category. When the settings are saved the miner will begin mining that category.

DELETE /categories/<category_id stop the miner from mining that category and delete the category.

About

A Udadisi miner for terms in Google Scholar abstracts.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published