Python Index.add_index 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: index

클래스/타입: Index

메소드/함수: add_index

hotexamples.com에서의 예제들: 1

Python Index.add_index - 1개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 index.Index.add_index에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

Index(17)

add_document(11)

add(8)

PUT_SCHEMA(3)

add_entry(3)

add_index_range(2)

read_index(2)

exists(2)

open_or_create(2)

get_status(1)

get_term(1)

get_or_create_instance(1)

get_net_interface(1)

get_keywords(1)

index_media(1)

index_object(1)

is_duplicate(1)

CreateIndex(1)

name(1)

post_syslog(1)

get_document_vector(1)

put_status(1)

remove_word(1)

rm_data(1)

status(1)

storeIndex(1)

train_path(1)

update_md5s(1)

verify(1)

get_items_generator(1)

getParserType(1)

get_data_by_id(1)

add_key(1)

SearchIndex(1)

__init__(1)

_fields(1)

_kw(1)

addTask(1)

add_data(1)

add_doc(1)

add_downloader(1)

add_index(1)

add_word(1)

get_all(1)

agenda(1)

append(1)

articles(1)

by_prefix(1)

calculate_tfidf(1)

construct_index(1)

예제 #1

파일 보기

파일: MakeIndex.py 프로젝트: IKKO-Ohta/Text2Feature

print('前処理を行います')
PREPROCESSOR.load_text(sorted(glob.glob(text_folder_path + '/*')))
whitelist = PREPROCESSOR.investigate_whitelist(thesaurus_path)
print('保存します')
PREPROCESSOR.save(auto_text_path)
PARSER = Parser()
print('かかり受け解析を行います..')
PARSER.t2f(sorted(glob.glob(auto_text_path + '/*')),
           kytea_model=kytea_path,
           eda_model=eda_path)  # text_pathのファイルをかかり受け解析
print('結果を保存します')
PARSER.save(tree_path)  # かかり受け解析したものをファイルに保存
INDEX = Index(unigram=1, dep_trigram=1, bigram=1,
              dep_bigram=1)  # Indexをunigramとbigramの素性を、treeから読み出すことでIndexを作成する
print('Treeを読み込みます')
INDEX.add_index(sorted(glob.glob(tree_path +
                                 '/*')))  # tree_pathのフォルダ以下のファイルからインデックスを作る
print('INDEXを保存します...')
INDEX.save(index_path)  # index_pathにインデックスを保存
print(index_path)
print("Indexを読み込みます...")
VECTORIZER = Vectorizer(index_path, t=1, list=whitelist)  # Indexの読み込み  # 閾値は1
print('Treeを読み込みます')
vectors = VECTORIZER.get_vector(sorted(glob.glob(tree_path + '/*')),
                                filter=3)  # ベクトルを生成
print(vectors)
print('Vectorを保存します')
filename_list = sorted(glob.glob(tree_path + '/*'))
vector_path_list = []
for filename in filename_list:
    base_name = os.path.basename(filename)  # A.text
    root = os.path.splitext(base_name)[0]  # A