基于中文的Semi-supervised Recursive Autoencoders

<<<<<<< HEAD

基于中文的Semi-supervised Recursive Autoencoders

Twitter data from the first 2008 Presidential debate Total number of tweets is 3,238

Data is in ./data folder:

rawdata.tsv - the whole data set
test.tsv - a small set for testing purpose
trainset - the training data
testset - the test data

Prerequisite

The system is buit on Python 2.7

Other packages you need to install before running the system:

numpy
scikit learn
matplotlib
nltk

How to use

Download the code from github
run the following depending on the usage
- for cross validation of the training data, run:
crossvalidation.py data/model.json
- for analysis, run:
analysis.py data/model.json
- for training and predict test data, run:
main.py data/model.json

Model configuration

Parameters of the model can be changed in ./data/model/json

d : dimension of the word vector
cat : number of categories of the classification problem
alpha : the proportion of supervised (classification) error and unsupervised (reconstruction) error
lambdaW : regularisation term on word vector reconstruction matrices
lambdaCat : regularisation term on category
lambdaL : regularisation term on word embedding
iter : number of maximum iteration of the minFunc solver =======

rnn

4d60b68eac2a8f8ab6e0d14db8219984e7ede1f5

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
References		References
data		data
sa		sa
README.md		README.md
Report.pdf		Report.pdf
rnn.py		rnn.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

References

References

data

data

sa

sa

README.md

README.md

Report.pdf

Report.pdf

rnn.py

rnn.py

Repository files navigation

基于中文的Semi-supervised Recursive Autoencoders

Prerequisite

How to use

Model configuration

rnn

About

Releases

Packages

Languages

iron-fe/stanford_Rnn

Folders and files

Latest commit

History

Repository files navigation

基于中文的Semi-supervised Recursive Autoencoders

Prerequisite

How to use

Model configuration

rnn

About

Resources

Stars

Watchers

Forks

Languages