Skip to content

Vangelys/CoreNLP-Semafor-Pipeline

Repository files navigation

Internship project : pipeline between Stanford Core NLP and Semafor

1st may / 20 july - Guilherme RAZET, LORIA, Nancy

Setup

  1. Clone the repository ( link);

  2. Open a terminal in CoreNLP-Semafor-Pipeline;

  3. Run this command : ./bin/install.sh [language] with language = [a=arabic] [c=chinese] [e=english] [f=french] [g=german] [s=spanish];

  4. Check environnements variables in bin/config.sh;

The automatic report generation will work only if you have a LaTeX compiler (as TexLive) on your computer.

The pipeline is ready !

Use

  1. Place your document in the folder data (this document must be a plain text file);

  2. Open a terminal in CoreNLP-Semafor-Pipeline;

  3. Run this command :./bin/runSemafor.sh [input] [output] [number of threads] [language], with :

    1. [input] = name of your document (exemple : test.txt);

    2. [output] = path and name of your output (exemple : data/test.out.xml). CAUTION : this document must be in .xml and it must not exist;

    3. [number of threads] = number of threads used in the process, usually at least 2;

    4. [language] = language of the document, with [a=arabic] [c=chinese] [e=english] [f=french] [g=german] [s=spanish]. CAUTION : the language package must be download before the process, usually during the install.

This process need 2MiB of free RAM, else it will not work.

Documentation

Documentation of Stanford Core NLP : click here.

Documentation of Semafor : click here.

Documentation of PyLateX : click here.

About

Internship project : pipeline between Semafor and Stanford Core NLP

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published