Navigation Menu

Skip to content

chrismattmann/parser-indexer-py

 
 

Repository files navigation

Parser-Indexer

Requirements

  1. Solr
  2. Parser Server

1. Setting up solr

Download Solr

mkdir workspace && cd workspace
wget http://archive.apache.org/dist/lucene/solr/6.1.0/solr-6.1.0.tgz
tar xvzf solr-6.1.0.tgz
cd solr-6.1.0

Start and Create a Core

PORT=8983
bin/solr start -p $PORT
bin/solr create_core -c docs -d $YOUR_PATH/conf/solr/docs -p $PORT

2. Parser Server

Refer to README of parser-server in sub directory.

Examples :

Checkout docs folder.

  • To parse and index jounrals : docs/parser-index-journals.md

About

Python tools for parsing documents and building the inverted index with enriched metadata

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • XSLT 35.4%
  • JavaScript 33.0%
  • Python 21.7%
  • CSS 5.3%
  • HTML 4.1%
  • Shell 0.5%