This is the homework assignment for CSCI 544 — Applied Natural Language Processing, Spring 2016.
Homework Description:
Homework 1:
- Computation of all the anagrams of a word.
- Conversion of UTF-16 to UTF-8.
Homework 2:
Developing a Naive Bayes classifier to classify hotel reviews.
Homework 3:
Developing a Hidden Markov Model for part-of-speech tagger for Catalan.
The training data is already provided tokenized and tagged. The test data will be provided tokenized, and our tagger will add the tags.
Homework 4:
We implemented a program that calculates the BLEU evaluation metric, as defined in Papineni, Roukos, Ward and Zhu (2002): Bleu: a Method for Automatic Evaluation of Machine Translation, ACL 2002.
We ran the program on sets of candidate and reference translations with the data in four languages : German, Greek, Portuguese and English.