Python FastText.similarity примеры использования

Язык программирования: Python

Пространство имен/Пакет: gensim.models.fasttext

Класс/Тип: FastText

Метод/Функция: similarity

Примеров на hotexamples.com: 3

Python FastText.similarity - 3 примера найдено. Это лучшие примеры Python кода для gensim.models.fasttext.FastText.similarity, полученные из open source проектов. Вы можете ставить оценку каждому примеру, чтобы помочь нам улучшить качество примеров.

Основные методы

Показать Скрыть

FastText(30)

train(30)

build_vocab(30)

save(30)

load_fasttext_format(30)

load(30)

most_similar(6)

init_sims(4)

similarity(3)

__init__(2)

wmdistance(2)

fit(1)

corpus_count(1)

train_count(1)

accuracy(1)

total_train_time(1)

add(1)

compile(1)

running_training_loss(1)

predict(1)

n_similarity(1)

model_trimmed_post_training(1)

get_latest_training_loss(1)

min_alpha_yet_reached(1)

corpus_total_words(1)

load_facebook_vectors(1)

doesnt_match(1)

epochs(1)

estimate_memory(1)

get_word_vector(1)

get_vocab_word_vecs(1)

get_output_matrix(1)

get_words(1)

Пример #1

Показать файл

# Initialize the model
model = FT_gensim(size=32)

# build the vocabulary
model.build_vocab(corpus_file=corpus_file)

# train the model
model.epochs = 15
model.train(corpus_file=corpus_file,
            epochs=model.epochs,
            total_examples=model.corpus_count,
            total_words=model.corpus_total_words)
print(model)

# saving a model trained via Gensim's fastText implementation
model.save(save_file, separately=[])

# run some basic tests on the model
print("job" in model.wv.vocab)
print("salary" in model.wv.vocab)
print("learn" in model.wv.vocab)

# print vector representaion
print(model["job"])

# test similarity
print(model.similarity("job", "salary"))
print(model.similarity("job", "learn"))
print(model.similarity("job", "the"))

Пример #2

Показать файл

###############################################################################
#
# Similarity operations work the same way as word2vec. **Out-of-vocabulary words can also be used, provided they have at least one character ngram present in the training custom.**
#


print("nights" in model.wv.vocab)

###############################################################################
#
print("night" in model.wv.vocab)

###############################################################################
#
print(model.similarity("night", "nights"))

###############################################################################
#
# Syntactically similar words generally have high similarity in fastText models, since a large number of the component char-ngrams will be the same. As a result, fastText generally does better at syntactic tasks than Word2Vec. A detailed comparison is provided `here <Word2Vec_FastText_Comparison.ipynb>`_.
#


###############################################################################
#
# Other similarity operations
# ^^^^^^^^^^^^^^^^^^^^^^^^^^^
#
# The example training corpus is a toy corpus, results are not expected to be good, for proof-of-concept only
print(model.most_similar("nights"))

Пример #3

Показать файл

Файл: FastText_Vector_Generator.py Проект: nishanthsanjeev/GenderBiasDetection

# In[ ]:

from gensim.models.fasttext import FastText

ft_model = FastText(train_data,
                    size=embedding_size,
                    window=window_size,
                    min_count=min_word,
                    sample=down_sampling,
                    sg=1,
                    iter=10)

# In[ ]:

semantically_similar_words = {
    words: [item[0] for item in ft_model.wv.most_similar([words], topn=5)]
    for words in
    ['kitchen', 'death', 'king', 'queen', 'strong', 'weak', 'woman', 'man']
}

for k, v in semantically_similar_words.items():
    print(k + ":" + str(v))

# In[ ]:

ft_model.similarity("annabeth", "percy")

# In[ ]:

ft_model.wv.save_word2vec_format('FTvectors')