GR 5067 Natural Language Processing

This repo contains assignments for the class GR 5067: Natural Language Processing offered by the Columbia University Quantitative Methods in Social Science department. The class and its assignments aim to "provide a detailed tour on how to access, clean, “munge” and organize data, both big and small." (taken from the course syllabus, which the instructor would prefer not to be forked).

Course assignments focused on:

HW1 - Familiarising students with Python syntax
HW2 - Use a Google search crawler (instructor provided) to generate a corpus of text files
HW3 - Simple word search, and model based sentiment analysis
HW4 - Streaming twitter classifier

The course final project was a free-choice natural language processing project, and a class presentation. I chose to run an LDA model on the Book of Psalms. A full report is available in this repo.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
HW1		HW1
HW2		HW2
HW3		HW3
HW4		HW4
Project		Project
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HW1

HW1

HW2

HW2

HW3

HW3

HW4

HW4

Project

Project

.gitattributes

.gitattributes

.gitignore

.gitignore

README.md

README.md

Repository files navigation

GR 5067 Natural Language Processing

About

Releases

Packages

Languages

timothyLeeXQ/GR-5067-Natural-Language-Processing

Folders and files

Latest commit

History

Repository files navigation

GR 5067 Natural Language Processing

About

Resources

Stars

Watchers

Forks

Languages