Python WordEmbedding.train Examples

Programming Language: Python

Namespace/Package Name: embeddings

Class/Type: WordEmbedding

Method/Function: train

Examples at hotexamples.com: 4

Python WordEmbedding.train - 4 examples found. These are the top rated real world Python examples of embeddings.WordEmbedding.train extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

WordEmbedding(11)

train(4)

save(3)

Projection(2)

n_words(2)

set_elmo(2)

set_bert(1)

set_glove(1)

Example #1

Show file

File: embeddings_train.py Project: maeda/polysemybot

def retrain():
    ds = process(PreProcessing('./data/starwars.txt'))

    word_embedding = WordEmbedding(source='./embedding/FT/fasttext_cbow_300d.bin')

    word_embedding.train(ds.pairs)
    word_embedding.save('./embedding/starwars', 'starwars.bin')

Example #2

Show file

File: embeddings_train.py Project: maeda/polysemybot

def train():
    ds = process(PreProcessing(open('./data/starwars.txt', 'r')))

    word_embedding = WordEmbedding(source=ds.pairs)

    word_embedding.train(ds.pairs)

    word_embedding.save(target_folder='./embedding/starwars', filename='starwars.bin')

Example #3

Show file

File: tests.py Project: maeda/polysemybot

    def test_load_from_file(self):
        embeddings_path = os.path.join(settings.BASE_DIR, 'embeddings',
                                       uuid.uuid4().hex)
        filename = str(self.__class__.dataset.idx) + ".bin"

        word_embedding = WordEmbedding(source=self.__class__.dataset.pairs)
        word_embedding.train()
        word_embedding.save(embeddings_path, filename)

        model = WordEmbedding(source=os.path.join(embeddings_path, filename))
        print(model._embedding.wv.similarity('batendo', 'porta'))

Example #4

Show file

File: tests.py Project: maeda/polysemybot

 def test_should_generate_training_pairs(self):
     pre_processing = PreProcessing(sentences)
     dataset = ds.process(pre_processing)
     word_embedding = WordEmbedding(freeze=False, source=dataset.pairs)
     word_embedding.train()
     self.assertEqual(len(dataset.training_pairs(2, word_embedding)), 2)