GitHub - We-can-apply-GPU/aMeLiDoSu-HW3

Usage : cat Holmes_Training_Data/*.TXT | ./preprocessing.sh > training.txt

ref : https://code.google.com/p/pygoogle

Data preprocessing

./preprocessing <training data 路徑> <training data filename or -a（路徑下所有檔案）

會產生一個train_pro.txt，可以給word2vec使用

Feature extract

output為一個dic,傳入vocab,可以回傳他的feature。import extract之後，extract.extract(dim ,training data, trained)，如果已經執行過一次，產生過.bin檔，trained那邊可直接傳入true，word2vec就不用在跑一次了。For example:extract.extract(100, 'train_pro.txt', 'true')

###Issue need to solve :

目前 outputLayer是以softMax做nonlinearity，但今天她代表的是機率，跟我們從word2vec得到的R^m vector是不同的東西 (看docs上說word2vec是以hierarchical softMax實作，但今天他有正有負(log(prob)??)) 所以...是要照目前的方式，然後再加深層數之類的做deep RNN，還是我們要將所有的word做classification(dim炸裂囧) 不過...可能是需要做regularization
3D matrix manipulication?? -> tensordot (solved?) 張量內積那邊numpy的docs寫的還蠻清楚的(也沒有啦例子很懂懂XDD)
nesterov grad不能用吧XD 目前是用rmsprop做為update method (做實驗的好材料？) 可以試試其他方式
Regularization? 不要讓output炸開而是能壓在一定的range以下?(lasagne.regularization有包好~~但我覺得好像也應該是要跟word2vec做match?)
X_train ,Y_train 的給法 e.g. Given wordSeq {w_1,w_2,....w_n} Assume 5-gram : X_train : {w_1,w_2,...w_5} Y_train : {w_2,w_3,...w_6} , another method ?

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
Holmes_Training_Data		Holmes_Training_Data
network		network
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
extract.py		extract.py
google.py		google.py
iodata.py		iodata.py
predict.py		predict.py
preprocessing.py		preprocessing.py
preprocessing.sh		preprocessing.sh
sentence.py		sentence.py
sentence2.py		sentence2.py
settings.py		settings.py
thesis.py		thesis.py
train.py		train.py
train_pro.txt.bin		train_pro.txt.bin

We-can-apply-GPU/aMeLiDoSu-HW3

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Languages