Skip to content

Simple python indexer that analyzes a set of html documents and returns the direct and indirect indexes

Notifications You must be signed in to change notification settings

bcbogdan/html_indexer

Repository files navigation

html-indexer

Simple python indexer that analyzes a set of html documents and returns the direct and indirect indexes

install

run pip install -r requirements.txt run nltk.py run python -m nltk.downloader wordnet stopwords

run

python html_indexer.py input_folder special_files_folder result_folder

======= Simple python indexer that analyzes a set of html documents and returns the direct and inverted indexes

About

Simple python indexer that analyzes a set of html documents and returns the direct and indirect indexes

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages