Exemplos de Preprocessor.investigate_whitelist em Python

Linguagem de programação: Python

Espaço para nome / nome do pacote: preprocess

Classe / Tipo: Preprocessor

Método / Função: investigate_whitelist

Exemplos em hotexamples.com: 1

Preprocessor.investigate_whitelist em Python - 1 exemplos encontrados. Esses são os exemplos do mundo real mais bem avaliados de preprocess.Preprocessor.investigate_whitelist em Python extraídos de projetos de código aberto. Você pode avaliar os exemplos para nos ajudar a melhorar a qualidade deles.

Métodos Frequentes

Exibir Ocultar

Preprocessor(30)

add(4)

execute(3)

load(3)

import_video(3)

get_vocabulary(2)

get_states(2)

get_standard_form(2)

get_representer(2)

gen_data_vec(2)

setNextPitchCorner(2)

count_lines(1)

bgsub(1)

load_data(1)

_line_cleanup(1)

lda(1)

investigate_whitelist(1)

index_list_to_word_list(1)

apply(1)

basic_preprocess(1)

get_values_all(1)

get_training_data(1)

get_train_test_data_tag(1)

get_testing_data(1)

get_target_names(1)

build_vocab(1)

convert_text_to_index(1)

build_vocabulary_and_categories(1)

get_feature_names(1)

get_data(1)

get_all_text(1)

_clean_data(1)

getSentences(1)

generateTrainData(1)

convert_index_to_text(1)

gaussian(1)

format_to_nn(1)

format_to_lines(1)

fit_on_corpus(1)

get_all_tag_idx(1)

Métodos Frequentes

Preprocessor (30)

add (4)

execute (3)

load (3)

import_video (3)

get_vocabulary (2)

get_states (2)

get_standard_form (2)

get_representer (2)

gen_data_vec (2)

Métodos Frequentes

setNextPitchCorner (2)

count_lines (1)

bgsub (1)

load_data (1)

_line_cleanup (1)

lda (1)

investigate_whitelist (1)

index_list_to_word_list (1)

apply (1)

basic_preprocess (1)

get_values_all (1)

get_training_data (1)

get_train_test_data_tag (1)

get_testing_data (1)

get_target_names (1)

build_vocab (1)

convert_text_to_index (1)

build_vocabulary_and_categories (1)

get_feature_names (1)

get_data (1)

Métodos Frequentes

get_values_all (1)

get_training_data (1)

get_train_test_data_tag (1)

get_testing_data (1)

get_target_names (1)

build_vocab (1)

convert_text_to_index (1)

build_vocabulary_and_categories (1)

get_feature_names (1)

get_data (1)

get_all_text (1)

_clean_data (1)

getSentences (1)

generateTrainData (1)

convert_index_to_text (1)

gaussian (1)

format_to_nn (1)

format_to_lines (1)

fit_on_corpus (1)

get_all_tag_idx (1)

Métodos Frequentes

get_all_text (1)

_clean_data (1)

getSentences (1)

generateTrainData (1)

convert_index_to_text (1)

gaussian (1)

format_to_nn (1)

format_to_lines (1)

fit_on_corpus (1)

get_all_tag_idx (1)

Exemplo n.º 1

0

Exibir arquivo

Arquivo: Vectorize.py Projeto: IKKO-Ohta/Text2Feature

thesaurus_path = '../corpus/thesaurus/thesaurus.txt' IDF_path = '../auto/IDF.index' tfidf_DB_path = '../auto/TFIDF_vectors_DB' base_name = os.path.basename(text_path) # A.text root = os.path.splitext(base_name)[0] # A #------------------------------------------------------------------------------------- # main #------------------------------------------------------------------------------------- PREPROCESSOR = Preprocessor(thesaurus_path) # シソーラス・パスを渡さなければ置換をしません。 print('前処理を行います') PREPROCESSOR.load_text([text_path]) whitelist = PREPROCESSOR.investigate_whitelist(thesaurus_path) print('保存します') PREPROCESSOR.save(auto_text_path) PARSER = Parser() print('かかり受け解析を行います..') PARSER.t2f([auto_text_path + '/' + root + '.text'], kytea_model=kytea_path, eda_model=eda_path) print('結果を保存します') PARSER.save(tree_path) # かかり受け解析したものをファイルに保存 print("Indexを読み込みます...") VECTORIZER = Vectorizer(index_path, t=1, list=whitelist) # Indexの読み込み print('Treeを読み込みます') vectors = VECTORIZER.get_vector([tree_path + '/' + root + '.eda'], filter=3) # ベクトルを生成 print(vectors)