Automatic Keyword Extraction with Streamlit

A simple streamlit app showing a keyword extraction report on a corpus.

Usage

in your command line, run make run-app to pull and run the latest docker image for the app.
Once in the app, you can submit a corpus to be processed by dropping the zip file containing your dataset in the file uploaded.
Select the extraction model to use. currently supported are TfIdf, TextRank and EmbedRank. You can also select how many keywords to extract from each document.

In the main section of the page, you'll be able to see a summary of the extraction. You can also select a larger full fledge report that will show the keywords in context.

Loading the EmbedRank model may take some time on the first run as the universal sentence encoder needs to be fetched from Tensorflow Hub. It should get cached on subsequent executions.

For development

make sure you have Python 3.7 installed on your local machine.
Install pipenv via pip install pipenv
run make setup

TODO

allow user to select model parameters from sidebar.
add a default corpus.
add a filter for documents in the full report.
add full support for EmbedRank++

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
keyword_extractor		keyword_extractor
notebooks		notebooks
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
Makefile		Makefile
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
app.py		app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

keyword_extractor

keyword_extractor

notebooks

notebooks

tests

tests

.dockerignore

.dockerignore

.gitignore

.gitignore

Dockerfile

Dockerfile

Makefile

Makefile

Pipfile

Pipfile

Pipfile.lock

Pipfile.lock

README.md

README.md

app.py

app.py

Repository files navigation

Automatic Keyword Extraction with Streamlit

Usage

For development

TODO

About

Releases

Packages

Languages

ndilsou/keyword-extractor

Folders and files

Latest commit

History

Repository files navigation

Automatic Keyword Extraction with Streamlit

Usage

For development

TODO

About

Resources

Stars

Watchers

Forks

Languages