Python Tokenizer.texts_to_sequencesの例

プログラミング言語: Python

名前空間/パッケージ名: preprocess

クラス/型: Tokenizer

メソッド/関数: texts_to_sequences

hotexamples.comのコード掲載数: 3

Python Tokenizer.texts_to_sequences - 3件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのpreprocess.Tokenizer.texts_to_sequencesの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

Tokenizer(5)

fit_on_texts(1)

texts_to_sequences(1)

tokenizer(1)

コード例 #1

ファイルを表示

ファイル: Main (copy).py プロジェクト: chuckgu/Neural-Machine-Translation

stochastic=False
verbose=1



## tokenize text, change to matrix

text=[]
with open("data/TED2013.raw.en") as f:
    for line in f:
        text.append(line)
        #text.append(korean_morph(line))
input=Tokenizer(n_words)
input.fit_on_texts(text)
seq=input.texts_to_sequences(text,n_sentence,n_maxlen)

n_words_x=input.nb_words

text=[]
with open("data/TED2013.raw.en") as f:
    for line in f:
        text.append(line)

output=Tokenizer(n_words)
output.fit_on_texts(text)
targets=output.texts_to_sequences(text,n_sentence,n_maxlen)

n_words_y=output.nb_words

targets[:-1]=targets[1:]

コード例 #2

ファイルを表示

ファイル: Main.py プロジェクト: rubeeny/Neural-Machine-Translation

n_d = 1000  ## number of hidden nodes in decoder
n_y = dim_word

stochastic = False
verbose = 1

## tokenize text, change to matrix

text = []
with open("data/TED2013.raw.en") as f:
    for line in f:
        text.append(line)
        #text.append(korean_morph(line))
input = Tokenizer(n_words_x)
input.fit_on_texts(text)
seq = input.texts_to_sequences(text, n_sentence, n_maxlen)

n_words_x = input.nb_words
'''
text=[]
with open("data/TED2013.raw.en") as f:
    for line in f:
        text.append(line)

output=Tokenizer(n_words)
output.fit_on_texts(text)
'''
output = input
#targets=output.texts_to_sequences(text,n_sentence,n_maxlen)
targets = seq
n_words_y = output.nb_words

コード例 #3

ファイルを表示

ファイル: Main.py プロジェクト: chuckgu/Neural-Machine-Translation

stochastic=False
verbose=1



## tokenize text, change to matrix

text=[]
with open("data/TED2013.raw.en") as f:
    for line in f:
        text.append(line)
        #text.append(korean_morph(line))
input=Tokenizer(n_words_x)
input.fit_on_texts(text)
seq=input.texts_to_sequences(text,n_sentence,n_maxlen)

n_words_x=input.nb_words
'''
text=[]
with open("data/TED2013.raw.en") as f:
    for line in f:
        text.append(line)

output=Tokenizer(n_words)
output.fit_on_texts(text)
'''
output=input
#targets=output.texts_to_sequences(text,n_sentence,n_maxlen)
targets=seq
n_words_y=output.nb_words