GitHub - morten-pedersen/Wordnet_NLTK: Research on Norwegian contents of WordNet using NLTK

pip install nltk for initial download of Wordnet nltk.download('wordnet') nltk.download('omw')

help(wn) for help.

GUI browser: nltk.app.wordnet()

synsets(): lists synsets of a word, useful for when the synset is not known. Example:

wn.synsets('dog')

synset(): a set of synonyms that share a common meaning. A synset in NLTK is the "ID" for each word, and used to access most other methods.

lemmas(): Lists words of the same meaning, with their synset and lemma names. Can be called from synset.

lemma_names(): lists lemma names of a synset. Can have relations between them, using derivationally_related_forms(), pertainyms() and antonyms().

definition(): lists definition of a synset. Called directly from synset

name(): lists name of synset. Called from synset.

examples(): lists examples of a synset. Called from synset.

hypernyms(): definitions above word / superclass, hypernym_paths()

hyponyms(): definitions below word / subclass

hypernym_paths(): lists the paths of the synset to its highest hypernym. Callable from synset.

root_hypernyms(): lists the highest hypernym of the synset. Callable from synset.

hypernym_distances(): lists distance between hypernyms from a synset. Callable from synset.

lowest_common_hypernyms(): lists the lowest common hypernym between two synsets. Example:

wn.synset('tree.n.01').lowest_common_hypernyms(wn.synset('forest.n.01'))

min_depth(): returns a number of how specific a synset is, meaning how deep the synset is in the hyponym tree

meronym: lists items that are components to this word, part_meronyms(), substance_meronyms(), member_meronyms()

holonym: reverse to meronym: part_holonyms(), substance_holonyms(), member_holonyms()

entailments(): walking entails stepping

antonyms(): lists word(s) with opposite meanings. Has to be called from lemmas, and specified which item in index to list antonyms from, for example:

walk = wn.synset('walk.v.01')

walk.lemmas()[0].antonyms()

pertainyms(): lists words pertaining to the word (belonging to). Needs to be called in same way as antonyms()

derivationally_related_forms(): lists derivationally related forms. Needs to be called in same way as antonyms()

path_similarity(): score between 0 and 1 on shortest path between concepts in hypernym hierarchy. Needs to be called using the synset of the words you want to compare:

tree = wn.synset('tree.n.01')

forest = wn.synset('forest.n.01')

tree.path_similarity(forest)

or

wn.synset('tree.n.01').path_similarity(wn.synset('forest.n.01'))

dir(): shows lexical relations and other methods on a synset. Example:

dir(wn.synset('tree.n.01'))

We are interested in the Norwegian lemmas of the English word, which we can access by using either of the following:

wn.lemmas(word, lang = 'nob')

wn.synset('dog.n.01').lemma_names('nob')

or access them from Norwegian:

wn.synsets('hund', lang = 'nob')

wn.lemmas('hund', lang = 'nob')

Språkbanken has a word net consisting of 50,000 synsets (in both bokmål and nynorsk) at Norsk Ordvev: https://www.nb.no/sprakbanken/show?serial=oai%3Anb.no%3Asbr-27&lang=nb (updated Feb 2016)

Resources:

http://compling.hss.ntu.edu.sg/omw/

https://www.nb.no/forskning/sprakbanken

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
.gitattributes		.gitattributes
.gitignore		.gitignore
Dictionary.txt		Dictionary.txt
NLTK.png		NLTK.png
NLTK.xml		NLTK.xml
README.md		README.md
SentimentLookupTable.csv		SentimentLookupTable.csv
WordNet.xlsx		WordNet.xlsx
dictionary.py		dictionary.py
getter.py		getter.py
nob_to_en.py		nob_to_en.py
read_write.py		read_write.py
testfile.txt		testfile.txt
testing.py		testing.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.gitattributes

.gitattributes

.gitignore

.gitignore

Dictionary.txt

Dictionary.txt

NLTK.png

NLTK.png

NLTK.xml

NLTK.xml

README.md

README.md

SentimentLookupTable.csv

SentimentLookupTable.csv

WordNet.xlsx

WordNet.xlsx

dictionary.py

dictionary.py

getter.py

getter.py

nob_to_en.py

nob_to_en.py

read_write.py

read_write.py

testfile.txt

testfile.txt

testing.py

testing.py

Repository files navigation

About

Releases

Packages

Contributors 2

Languages

morten-pedersen/Wordnet_NLTK

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Languages