Python KeyedVectors.similar_by_vector示例

编程语言: Python

命名空间/包名称: gensim.models

类/类型: KeyedVectors

方法/功能: similar_by_vector

hotexamples.com的示例: 2

Python KeyedVectors.similar_by_vector - 已找到2个示例。这些是从开源项目中提取的最受好评的gensim.models.KeyedVectors.similar_by_vector现实Python示例。您可以评价示例，以帮助我们提高示例质量。

常用方法

显示隐藏

add(30)

load_word2vec_format(30)

load(17)

KeyedVectors(15)

save_word2vec_format(15)

most_similar(14)

save(12)

index2word(10)

vocab(9)

add_vectors(8)

vectors(6)

syn0(6)

init_sims(5)

get_vector(4)

vector_size(3)

cosine_similarities(3)

similar_by_vector(2)

distances(2)

similarity(2)

vectors_norm(1)

similar_by_word(1)

index2entity(1)

add_vector(1)

nbow(1)

n_similarity(1)

evaluate_word_analogies(1)

load_fasttext_format(1)

evaluate_word_pairs(1)

load_word2v1ec_format(1)

示例#1

显示文件

文件： ex9.py 项目： mat-hek/pjn

def rem_add(x, rem, add, wv: KeyedVectors):
    y = wv[parse(x)] - wv[parse(rem)] + wv[parse(add)]
    return wv.similar_by_vector(y, topn=5)

示例#2

显示文件

class VectorSpaceModel(object):

    """Base class for models that represent words as vectors.

    For now, this really is just a wrapper around the Gensim KeyedVectors / Word2Vec class.

    """

    def __init__(self, name=None):
        self.name = name
        self.m = KeyedVectors()
        return

    @classmethod
    def load(cls, filename, modelname=None, **kwargs):
        if filename.endswith('.pkl'):
            model = cls.load_pickle(filename, modelname=modelname, **kwargs)
        else:
            model = cls.load_w2v(filename, modelname=modelname, **kwargs)
        return model

    @classmethod
    def load_pickle(cls, filename, **kwargs):
        debug("Loading pickled model from file {:}".format(filename))
        model = pickle.load(filename)
        return model

    @classmethod
    def load_w2v(cls, filename, modelname=None, **kwargs):
        """Load the model from disk."""
        debug("Loading word2vec model from file {:}".format(filename))
        if filename.endswith(".bin"):
            m = KeyedVectors.load_word2vec_format(filename, binary=True)
        else:
            m = KeyedVectors.load_word2vec_format(filename)
        model = cls()
        model.m = m
        if modelname is None:
            modelname = os.path.basename(filename)
            modelname = re.sub('.bin', '', modelname)
        model.name = modelname
        return model

    def save_pickle(self, filename):
        debug("Saving model {:} to pickle file {:}".format(self.name, filename))
        pickle.dump(self, filename)
        return

    def __getitem__(self, word):
        return(self.m[word])

    def most_similar(self, query, k=5):
        """Return the most similar words to the query. `query` can be either a string or a
        vector. If it is a string, then its vector will be looked up in the current VSM.
        """
        if type(query) is str:
            results = self.m.most_similar(query, topn=k)
        else:
            results = self.m.similar_by_vector(query, topn=k)
        return results

    def __repr__(self):
        return "<VectorSpaceModel {:} with {:,} vectors>".format(repr(self.name), self.m.syn0.shape[0])