Esempi in Python per FastText.similarity

Linguaggio di programmazione: Python

Spazio dei nomi/nome del pacchetto: gensim.models.fasttext

Classe/tipologia: FastText

Metodo/funzione: similarity

Esempi su hotexamples.com: 3

FastText.similarity in Python: 3 esempi trovati. Questi sono i migliori esempi reali in Python per gensim.models.fasttext.FastText.similarity, estratti da progetti open source. Li puoi valutare, per aiutarci a migliorare la qualità dei nostri esempi.

Metodi utilizzati di frequente

Mostra Nascondi

FastText(30)

train(30)

build_vocab(30)

save(30)

load_fasttext_format(30)

load(30)

most_similar(6)

init_sims(4)

similarity(3)

__init__(2)

wmdistance(2)

fit(1)

corpus_count(1)

train_count(1)

accuracy(1)

total_train_time(1)

add(1)

compile(1)

running_training_loss(1)

predict(1)

n_similarity(1)

model_trimmed_post_training(1)

get_latest_training_loss(1)

min_alpha_yet_reached(1)

corpus_total_words(1)

load_facebook_vectors(1)

doesnt_match(1)

epochs(1)

estimate_memory(1)

get_word_vector(1)

get_vocab_word_vecs(1)

get_output_matrix(1)

get_words(1)

Esempio n. 1

Mostra file

# Initialize the model
model = FT_gensim(size=32)

# build the vocabulary
model.build_vocab(corpus_file=corpus_file)

# train the model
model.epochs = 15
model.train(corpus_file=corpus_file,
            epochs=model.epochs,
            total_examples=model.corpus_count,
            total_words=model.corpus_total_words)
print(model)

# saving a model trained via Gensim's fastText implementation
model.save(save_file, separately=[])

# run some basic tests on the model
print("job" in model.wv.vocab)
print("salary" in model.wv.vocab)
print("learn" in model.wv.vocab)

# print vector representaion
print(model["job"])

# test similarity
print(model.similarity("job", "salary"))
print(model.similarity("job", "learn"))
print(model.similarity("job", "the"))

Esempio n. 2

Mostra file

###############################################################################
#
# Similarity operations work the same way as word2vec. **Out-of-vocabulary words can also be used, provided they have at least one character ngram present in the training custom.**
#


print("nights" in model.wv.vocab)

###############################################################################
#
print("night" in model.wv.vocab)

###############################################################################
#
print(model.similarity("night", "nights"))

###############################################################################
#
# Syntactically similar words generally have high similarity in fastText models, since a large number of the component char-ngrams will be the same. As a result, fastText generally does better at syntactic tasks than Word2Vec. A detailed comparison is provided `here <Word2Vec_FastText_Comparison.ipynb>`_.
#


###############################################################################
#
# Other similarity operations
# ^^^^^^^^^^^^^^^^^^^^^^^^^^^
#
# The example training corpus is a toy corpus, results are not expected to be good, for proof-of-concept only
print(model.most_similar("nights"))

Esempio n. 3

Mostra file

File: FastText_Vector_Generator.py Progetto: nishanthsanjeev/GenderBiasDetection

# In[ ]:

from gensim.models.fasttext import FastText

ft_model = FastText(train_data,
                    size=embedding_size,
                    window=window_size,
                    min_count=min_word,
                    sample=down_sampling,
                    sg=1,
                    iter=10)

# In[ ]:

semantically_similar_words = {
    words: [item[0] for item in ft_model.wv.most_similar([words], topn=5)]
    for words in
    ['kitchen', 'death', 'king', 'queen', 'strong', 'weak', 'woman', 'man']
}

for k, v in semantically_similar_words.items():
    print(k + ":" + str(v))

# In[ ]:

ft_model.similarity("annabeth", "percy")

# In[ ]:

ft_model.wv.save_word2vec_format('FTvectors')