Python Word2Vec.create_dataset示例

编程语言: Python

命名空间/包名称: word2vec

类/类型: Word2Vec

方法/功能: create_dataset

hotexamples.com的示例: 2

Python Word2Vec.create_dataset - 已找到2个示例。这些是从开源项目中提取的最受好评的word2vec.Word2Vec.create_dataset现实Python示例。您可以评价示例，以帮助我们提高示例质量。

常用方法

显示隐藏

Word2Vec(30)

load(5)

contains(2)

create_dataset(2)

similarity(2)

vocab_to_num(2)

__init__(1)

add_vector(1)

init_from_config(1)

load_word2vec_format(1)

restore(1)

vector(1)

示例#1

显示文件

文件： preprocess.py 项目： johnkarasev/cs446_project

def create_dataset(tweets, window, datafile="mapped_tweets.npy", export=True):
    if tweets is None:
        try:
            tweets = np.load(datafile).item()
        except FileNotFoundError:
            print("cannot find " + datafile)
            exit(1)
    contexts, neighbors = Word2Vec.create_dataset(tweets, window)
    if export:
        print("saving train set to file")
        contexts = np.array(contexts)
        neighbors = np.array(neighbors)
        contexts.tofile('./data/npcontexts.dat')
        neighbors.tofile('./data/npneighbors.dat')

示例#2

显示文件

def create_trainset(window, export=True):
    with open("mapped_comments.json") as f:
        comments = json.load(f)
    sentences = []
    for key, index in zip(comments, range(len(comments))):
        progress(index, len(comments), "combining sentences")
        sentences.extend(comments[key])

    sentences = list(filter(lambda x: x, sentences))
    print("finished")
    sentences = np.array(sentences)
    contexts, neighbors = Word2Vec.create_dataset(sentences, window)
    if export:
        npc = np.array(contexts)
        npn = np.array(neighbors)
        npc.tofile('npcontexts.dat')
        npn.tofile('npneighbors.dat')