Skip to content

This code takes in JSTOR OCR raw text and expert-generated dictionaries, computes embeddings, uses these to expand dictionaries, gets doc/dict cosine similarities, and visualizes overall trends.

License

Notifications You must be signed in to change notification settings

h2researchgroup/embeddings

Repository files navigation

Word embeddings with JSTOR data

Word2Vec, hierarchical clustering, and visualization

This code takes in JSTOR OCR raw text and expert-generated dictionaries, computes embeddings, uses these to expand dictionaries, gets doc/dict cosine similarities, and visualizes overall trends.

Guide to Codebase

Word2Vec:

Dictionary Method:

Validation and visualize trends:

Clustering:

About

This code takes in JSTOR OCR raw text and expert-generated dictionaries, computes embeddings, uses these to expand dictionaries, gets doc/dict cosine similarities, and visualizes overall trends.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published