Python NearestNeighbors.appendの例

プログラミング言語: Python

名前空間/パッケージ名: sklearn.neighbors

クラス/型: NearestNeighbors

メソッド/関数: append

hotexamples.comのコード掲載数: 1

Python NearestNeighbors.append - 1件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのsklearn.neighbors.NearestNeighbors.appendの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

NearestNeighbors(30)

set_params(30)

radius_neighbors_graph(30)

radius_neighbors(30)

kneighbors_graph(30)

kneighbors(30)

fit(30)

predict(12)

sum(9)

compute_neighbors(7)

search(5)

query(5)

__init__(4)

distances(3)

search_by_vector(3)

knnQueryBatch(3)

get_params(3)

add_data(3)

build(3)

createIndex(3)

addDataPointBatch(3)

transform(2)

add(2)

setQueryTimeParams(2)

ravel(2)

predict_proba(1)

_random_state(1)

decision_function(1)

score(1)

nonzero(1)

_graph_mode(1)

_fitid(1)

_cluster_mode(1)

train(1)

_n_clusters(1)

nn(1)

nn_index(1)

closest_neighbor(1)

fit_predict(1)

fit_transform(1)

flatten(1)

getNearestDist(1)

getNearestDistToMature(1)

get_feature_names(1)

kneighborgs(1)

find_nnf(1)

astype(1)

kneighbours(1)

append(1)

metric(1)

コード例 #1

ファイルを表示

def q_umls_d_wiki():
    test_mentions = get_mention_docs("test")
    train_mentions = get_mention_docs("train")
    dev_mentions = get_mention_docs("dev")
    mentions = {}
    mentions.update({k: ' '.join(set(v["text"].split()) - en_stops) for k, v in train_mentions.items()})
    mentions.update({k: ' '.join(set(v["text"].split()) - en_stops) for k, v in test_mentions.items()})
    mentions.update({k: ' '.join(set(v["text"].split()) - en_stops) for k, v in dev_mentions.items()})

    mrconso = get_mrconso()
    aliases = {k: " ".join(set(v["alias"]["ENG"]) - en_stops) for k, v in mrconso.items() if "ENG" in v["alias"]}
    mention_ids = sorted(mentions)
    cuis = sorted(aliases)
    vectorizer = TfidfVectorizer(analyzer="char_wb", ngram_range=(1, 5), max_features=100000)
    print(vectorizer)
    X_cui = vectorizer.fit_transform([aliases[cid] for cid in cuis])
    X_mention = vectorizer.transform([mentions[mid] for mid in mention_ids])
    print(X_cui.shape, X_mention.shape)

    nbrs = NN(n_neighbors=64, algorithm='auto', metric='cosine', leaf_size=64, n_jobs=10)
    print("fitting nn...")
    nbrs.fit(X_cui)
    print("finding nbrs...")
    ns = nbrs.kneighbors(X_mention, return_distance=False)
    with open('ns_balltree.pkl', 'wb') as fout:
        pickle.dump((ns, cuis, mention_ids), fout)
    I = ns
    i = 0
    j = 0
    with open('mm_tfidf_candidates.json', 'w') as fout:
        for i in range(I.shape[0]):
            mention_id = mention_ids[i]
            nbrs = []
            for j in range(I.shape[1]):
                nbr = I[i, j]
                nbrs.append(cuis[nbr])
            fout.write(json.dumps({"mention_id" : mention_id, "tfidf_candidates": nbrs}))
            fout.write('\n')