Skip to content

m2pathan/phrasal

 
 

Repository files navigation

Phrasal: A statistical machine translation system

Phrasal is licensed under the GPL. For details, please see the file LICENSE.txt in the root directory of this software package.

Copyright (c) 2007-2015 The Board of Trustees of The Leland Stanford Junior University. All Rights Reserved.

Installation

Both ant and Gradle installation build scripts are provided. We strongly recommend that you use Gradle, which will fetch dependencies and build a jar for you.

The Phrasal decoder requires that you also install Stanford CoreNLP. We recommend that you clone and build the latest repository from Github.

Some advanced Phrasal features have external dependencies. If you build Phrasal with Gradle, these dependencies will be retrieved for you.

Linux

  1. Set the CORENLP_HOME environment variable to the root of the CoreNLP repository, which should have been built.

  2. gradle build

  3. (Optional) Build Eclipse project files by typing: gradle eclipse.

  4. (Optional, requires JDK) Build the KenLM loader: gradle compileKenLM.

  5. (Optional) Create classpath and startup scripts for Phrasal: gradle startupScripts.

Windows

Installation of Gradle is required. Follow the Linux instructions above. Then be sure to run gradle startupScripts to generate a .bat file for running Phrasal.

Stanford NLP cluster

We used to commit some third-party dependencies to the repository, and for others we had a bash script that would try to wget the rest. The script would fail when URLs for archived versions of libraries would change (this was especially true for jetty). Now we use Gradle/Maven to fetch the dependencies. Once you've fetched them, you can then compile with ant as before.

  1. Download and unpack Gradle into your home directory.

  2. Add Gradle to PATH: export PATH="$PATH":/u/username/NLP-HOME/gradle/bin

  3. Set reference to JavaNLP: export CORENLP_HOME=/u/username/NLP-HOME/javanlp/projects/core

  4. Download external dependencies: gradle copyDeps

  5. ant all

  6. Update CLASSPATH: export CLASSPATH=$CLASSPATH:/u/username/NLP-HOME/phrasal/classes

Contributors

  • Daniel Cer (original author)
  • Michel Galley
  • Spence Green
  • John Bauer
  • Chris Manning

Documentation / User Guide

The user guide for complete installation and configuration instructions. The guide also contains a tutorial for building an MT system from raw text.

Support

We have 3 mailing lists for Phrasal, all of which are shared with other JavaNLP tools (with the exclusion of the parser).

Each address is at @lists.stanford.edu:

java-nlp-user -- This is the best list to post to in order to ask questions, make announcements, or for discussion among JavaNLP users. You have to subscribe to be able to use it. Join the list via this webpage or by emailing java-nlp-user-join@lists.stanford.edu. (Leave the subject and message body empty.) You can also look at the list archives.

java-nlp-announce -- This list will be used only to announce new versions of Stanford JavaNLP tools. So it will be very low volume (expect 1-3 message a year). Join the list via via this webpage or by emailing java-nlp-announce-join@lists.stanford.edu. (Leave the subject and message body empty.)

java-nlp-support -- This list goes only to the software maintainers. It's a good address for licensing questions, etc. For general use and support questions, please join and use java-nlp-user. You cannot join java-nlp-support, but you can mail questions to java-nlp-support@lists.stanford.edu.

About

A large-scale statistical machine translation system written in Java.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Java 42.0%
  • C++ 26.0%
  • C 18.2%
  • Python 6.4%
  • Perl 3.3%
  • Shell 2.4%
  • Other 1.7%