STS13

Semantic Textual Similarity task 2012/2013/2014

Dependencies

numpy
sklearn (scikit-learn)
nltk (may require X11 under OS X)
Google n-gram word counts and Takelab LSA models under directory _data

Python path

Make sure you have lib/python in your PYTHONPATH, e.g. in Bash use

$ export PYTHONPATH=$PYTHONPATH:~/Projects/SemTextSim/github/STS13/lib/python

Example: generating Takelab features for new STS14 trial data

Make sure you have Google n-gram word counts and Takelab LSA models under directory _data
Add 2014 trial data files under new directory data/STS2014-trial
Create lib/python/sts/sts14.py defining dirs, ids and filenames for STS14 trial data
Create lib/python/ntnu/sts14.py defining dirs and filenames of features for STS14 trial data
Create Takelab features by adding function to ntnu/make-takelab-feat.py:

make_feats(sts.sts14.trial_input_fnames, ntnu.sts14.test_dir, with_lsa)

Temporary comment out calls to make_feats for other STS datasets
Change to dir ./ntnu and run ./make-takelab-feat.py

New features appear in files out/STS2014-trial//.txt

Example: testing a new feature

Suppose we have a new feature called "my_feat" that we want to try on the MSRpar dataset from STS12.

Add the feature to the training and test data as files

out/STS2012-train/MSRpar/my_feat.txt out/STS2012-test/MSRpar/my_feat.txt
Run a script ntnu/my_feat.py

Check comments in the script

Name		Name	Last commit message	Last commit date
Latest commit History 123 Commits
bin		bin
data		data
dkpro		dkpro
lib/python		lib/python
ntnu		ntnu
out		out
txt		txt
var		var
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bin

bin

data

data

dkpro

dkpro

lib/python

lib/python

ntnu

ntnu

out

out

txt

txt

var

var

.gitignore

.gitignore

README.md

README.md

Repository files navigation

STS13

Dependencies

Python path

Example: generating Takelab features for new STS14 trial data

Example: testing a new feature

About

Releases

Packages

Contributors 4

Languages

STS-NTNU/STS13

Folders and files

Latest commit

History

Repository files navigation

STS13

Dependencies

Python path

Example: generating Takelab features for new STS14 trial data

Example: testing a new feature

About

Resources

Stars

Watchers

Forks

Languages