Skip to content

deyim/NLT_PROJECT

Repository files navigation

NLT Final Project

1 Project Object

2 Procedure

A. Combine IC and TFIDF to extract significant words from abstract

3 How to run

You can modify freqs and corpus from source folder.

Normalizing needs precalculation for information content and TFIDF

To calculate information content python CalculateInformationContent.py To calculate TFIDF python CalculateTFIDF.py

each results will be stroed in source/ic.txt and source/tfidf.txt

Normalize and extract words

python Start.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages