Skip to content

jcblender/word-type-annotation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 

Repository files navigation

POS Tagging, Smoothing and Unknown Words

Introduce

A POS tagging project uses HMM and Viterbi written in Python by 190000096. This project investigates whether the use of Witten-Bell Smoothing and UNK tags will improve the tagging accuracy in English and other languages.

Content

This project includes three .py files: 1.smooth.py : This smoothing program investigates whether the use of Witten-Bell Smoothing will improve the accuracy. 2.tagging.py: This tagging program investigates whether the use of UNK tags will improve the accuracy in English. It may takes about half an hour to output. 3.tagging_for_portuguese.py : This tagging program investigates whether the use of UNK tags will improve the accuracy in Portuguese language.

Requirement

1.Python 3.8 2.NLTK Package 3.Texttable package

How To Start

$ python3 code/smooth.py
$ python3 code/tagging.py
$ python3 code/tagging_for_portuguese.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages