Python ApiClient.w2vの例

プログラミング言語: Python

名前空間/パッケージ名: api_client

クラス/型: ApiClient

メソッド/関数: w2v

hotexamples.comのコード掲載数: 2

Python ApiClient.w2v - 2件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのapi_client.ApiClient.w2vの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

ApiClient(30)

call_api(7)

get_posts(2)

get_pages(2)

get_new_publications(2)

get_comments(2)

w2v(2)

add_annotation(2)

add_config(1)

post_measurement(1)

get_rule(1)

get_run(1)

get_settings(1)

get_sp(1)

get_summarize_result(1)

get_telemetry_unique_ids(1)

get_token(1)

get_user(1)

get_user_demand_config(1)

get_version(1)

_get_dpid(1)

parks(1)

post_body_to_summarize(1)

predict(1)

_get_spids(1)

publish(1)

register_device(1)

remove_rule(1)

request(1)

run_now(1)

run_submit(1)

save_rule(1)

send_request(1)

start_bulk_download(1)

start_cluster(1)

update_rule(1)

update_version_qps(1)

upload_file(1)

_get_id(1)

_headers(1)

algebra(1)

get_activity_list(1)

api_call(1)

confirm_device_id(1)

copy_version(1)

create_badge_assignment(1)

create_cluster(1)

create_collection_iterator(1)

create_dp(1)

コード例 #1

ファイルを表示

ファイル: create_vocabulary.py プロジェクト: RishabhMehra/Quora-question-same-intent

#Load the dataset that was obtained from API and pickeled beforehand
with open('dataset.pkl', 'rb') as handle:
    dataset = pickle.load(handle)

#Extract all questions tokenized as sentences from the dataset
sentences = []
for i in dataset:
    sentences += nltk.sent_tokenize(i['question1'])
    sentences += nltk.sent_tokenize(i['question2'])

#Tokenize as words and find their frequency and select top 10000
word_dist = nltk.FreqDist()
for s in sentences:
    word_dist.update([i.lower() for i in nltk.word_tokenize(s)])

word_dist = word_dist.most_common(10000)

#obtain the glove vectors for the 10000 words from the web service
embeddings_list = []
for i in range(10, 100):
    embeddings_list += client.w2v(
        [i[0] for i in word_dist[i * 100:i * 100 + 100]])

#with the word as the key and the glove vector as the value and pickle it
embeddings_index = {}
for i in embeddings_list:
    embeddings_index[i['word']] = i['vec']

with open('embeddings_index1.pkl', 'wb') as handle:
    pickle.dump(embeddings_index, handle, protocol=pickle.HIGHEST_PROTOCOL)

コード例 #2

ファイルを表示

ファイル: example_sentiment_analyzer.py プロジェクト: loganathan001/AI

    # you can access individual fields such as "summary" and "rating" that can be used
    # to find the sentiment
    for item in val[:100]:
        print("Summary ==> ", item["summary"], "\t\tRating ==> ",
              item["rating"])

    # summary may be one or more sentences of text. We need to break these in to words
    # further, we need to convert each word to a vector form
    # our web service provides a function that accepts a list of words and returns the
    # corresponding vectors. In the example below, we take the first item returned by the
    # previous call and convert that in to a sequence of vectors
    text = val[0]["summary"]
    print("The input text is: ", text)

    # get sentence tokens from text that may have more than 1 sentence
    # we use NLTK's sent_tokenize for this
    sentences = sent_tokenize(text)  # we get the list of sentences
    all_words = []
    for sentence in sentences:
        all_words.extend(word_tokenize(sentence))

    # all_words contains all the words in the text as a single list
    # let us get the vectors for these
    vals = client.w2v(all_words)
    for val in vals:
        print(val["word"], val["vec"])

    # now you can continue further by vectoring the class label and creating the required dataset
    # your code ......