Skip to content

nyimbi/memex

 
 

Repository files navigation

This repository contains satelite projects of ache, a focused crawler.

To install it, you need:

  1. install git sudo apt-get install git

  2. Install Java, like the OpenJDK such as java-7-openjdk sudo apt-get install openjdk-7-jdk

  3. Make sure python 2.7 is installed

  4. virtualenv sudo pip2 install virtualenv

  5. install fabrick pip2 install fabric

  6. install numpy and scipy sudo apt-get install pyton-numpy python-scipy

  7. install elasticsearch wget https://download.elastic.co/elasticsearch/elasticsearch/elasticsearch-1.5.2.deb sudo dpkg -i elasticsearch-1.5.2.deb rm elasticsearch-1.5.2.deb

  8. Start elasticsearch, for now: sudo /etc/init.d/elasticsearch start

  9. On this directory (seed_crawler), type: fab setup

    It will take some time to proceed, download everything you need, check that things are properly installed, and stop.

  10. Run the program: fab runvis

  11. Open a web browser and connect to the vis server at the following url: http://localhost:8084/seedcrawler

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages

  • HTML 84.3%
  • Classic ASP 9.7%
  • Java 2.2%
  • JavaScript 1.4%
  • PHP 1.1%
  • Python 1.0%
  • Other 0.3%