GitHub - HalinGG/Natural-Language-Processing: Algorithms for modeling natural language.

"# Natural-Language-Processing"

Natural Language Processing(NLP): These files contain an implementation of the baum welch algorithm. The algorithm is used to recognize patterns in sentences such as grammar by tagging a sequence of words with a sequence of hidden Markova model states. The formal problem is usually referred to as parts of speech tagging. This algorithm also has many other uses in recognizing time series such as predicting stock values, speech recognition, genetics and even body frame recognition.

The algorithm is trained inside nlp_training.py where it is feed a .dat file containing the brown corpus and a training file with any English text. The algorithm will recognize the patterns in the training file and use these label words with it's states these states can then be statistically compared against words labeled with English grammar symbols. The brown_words.dat file contains a corpus that is labeled with correct English grammar symbols. However, in practice one might often see the labels left as raw states.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.cache/v/cache		.cache/v/cache
__pycache__		__pycache__
README.md		README.md
asop.txt		asop.txt
hmm2.dat		hmm2.dat
nlp.py		nlp.py
nlp_main.py		nlp_main.py
nlp_plot.py		nlp_plot.py
nlp_training.py		nlp_training.py
test_nlp.py		test_nlp.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.cache/v/cache

.cache/v/cache

pycache

pycache

README.md

README.md

asop.txt

asop.txt

hmm2.dat

hmm2.dat

nlp.py

nlp.py

nlp_main.py

nlp_main.py

nlp_plot.py

nlp_plot.py

nlp_training.py

nlp_training.py

test_nlp.py

test_nlp.py

Repository files navigation

About

Releases

Packages

Languages

HalinGG/Natural-Language-Processing

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Languages