Python Vocabulary.from_textの例

プログラミング言語: Python

名前空間/パッケージ名: vocabulary

クラス/型: Vocabulary

メソッド/関数: from_text

hotexamples.comのコード掲載数: 1

Python Vocabulary.from_text - 1件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのvocabulary.Vocabulary.from_textの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

Vocabulary(30)

add_word(15)

clean_text(8)

build_vocab(8)

add_words(8)

deserialize(7)

compile(4)

add(4)

antonym(4)

auto_punctuate(3)

add_token(3)

encode(3)

add_from_file(2)

decode_output(2)

getUniGrams(2)

from_documents(2)

build_corpus(2)

getVocabularyByDocument(2)

getBiGrams(2)

get_id_from_token(2)

add_a_word(2)

add_text(2)

add_many(2)

getFullDict(2)

gen_DAG(1)

from_text_files(1)

from_text(1)

from_serializable(1)

from_sentences(1)

get(1)

add_constant(1)

getPTStopWords(1)

getQuestions(1)

getVocabularySize(1)

get_all_source_words(1)

get_all_translations(1)

get_pos(1)

get_term_text(1)

make_dictionary(1)

seg_content(1)

from_nlp_data(1)

encode_sent(1)

from_idx2word_dict(1)

convert_sentence(1)

add_new_word(1)

add_sentence(1)

add_chunk(1)

add_word_lst(1)

append(1)

build(1)

コード例 #1

ファイルを表示

    if TRAIN_TOKENS.exists() and VAL_TOKENS.exists() and IDX_TO_TOKEN.exists(
    ) and USE_CACHE:
        token_train = np.load(TRAIN_TOKENS)
        texts_train = np.load(TRAIN_TEXTS)
        token_val = np.load(VAL_TOKENS)
        texts_val = np.load(VAL_TEXTS)
        train_labels = np.load(TRAIN_LABELS)
        val_labels = np.load(VAL_LABELS)
        idx_to_token = pickle.load(Path(IDX_TO_TOKEN).open('rb'))
        vocab = Vocabulary(idx_to_token)

    else:
        texts_train, token_train, train_labels = get_all_tokenized(
            train_data, 1)
        texts_val, token_val, val_labels = get_all_tokenized(val_data, 1)
        vocab = Vocabulary.from_text(token_train)

    if USE_CACHE:
        np.save(str(TRAIN_TOKENS), token_train)
        np.save(str(TRAIN_TEXTS), texts_train)
        np.save(str(VAL_TOKENS), token_val)
        np.save(str(VAL_TEXTS), texts_val)
        np.save(str(TRAIN_LABELS), train_labels)
        np.save(str(VAL_LABELS), val_labels)
        pickle.dump(vocab._idx_to_token, open(IDX_TO_TOKEN, 'wb'))

    # load genres:
    GENRES = Path(GENRES_TYPES_FILE).open("r").readlines()
    GENRES = list(map(lambda x: x.strip(), GENRES))
    n_genres = len(GENRES)