Python TextProcessor.process_textの例

プログラミング言語: Python

名前空間/パッケージ名: text_processor

クラス/型: TextProcessor

メソッド/関数: process_text

hotexamples.comのコード掲載数: 3

Python TextProcessor.process_text - 3件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのtext_processor.TextProcessor.process_textの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

TextProcessor(18)

process(4)

text_process(4)

process_text(3)

vocab_creator(2)

getProcessedText(2)

get_index(1)

process_web_action_request(1)

validate_link(1)

do_standardize(1)

set_nltk_model(1)

sentence_to_indices(1)

replace_tickers_with_company_names(1)

process_line(1)

generate_lda(1)

get_keywords(1)

create_corpus(1)

load_and_preprocess_text(1)

iterate_over_texts(1)

get_text(1)

get_output(1)

print_topics(1)

コード例 #1

ファイルを表示

ファイル: crawler.py プロジェクト: thekiminlee/Web-Crawler

 def process(self, text, doc_id):
     '''
     calls on the TextProcessor class for text processing and tokenization of documents.
     each url is designated with unique id number.
     id is increased at the end for new url.
     returns updated index.
     '''
     processor = TextProcessor(text, self.index, doc_id)
     processor.process_text()
     return processor.get_index()

コード例 #2

ファイルを表示

ファイル: prepare_data.py プロジェクト: ShT3ch/luigi_workshop

    def run(self):
        logger.info('Creating text processor')
        text_processor = TextProcessor()

        for file in self.input().keys():
            logger.info('Reading %s file: "%s"', file, self.input()[file].path)
            df = pd.read_csv(self.input()[file].path)

            logger.info('Its %s lines', df.shape[0])
            logger.info('Start processing %s...', file)

            df.name = df.name.map(lambda x: text_processor.process_text(x, lang='ru'))
            df.name = df.name.map(lambda x: ' '.join(x))

            logger.info('Processing of %s succeed, writing it to "%s"', file, self.output()[file].path)

            df.to_csv(self.output()[file].path)

コード例 #3

ファイルを表示

if __name__ == "__main__":
    pp = pprint.PrettyPrinter(indent=4, depth=2)

    # Initialize classifier
    classifier = NaiveBayes()

    # Train
    for f in find("data/1/training"):
        f = f.strip()
        if not f.endswith(".txt"):
            continue

        with open(f) as doc:
            text = doc.read()

        sentences = nlp.process_text(text)

        label = "movie" if "movie" in f else "play"

        classifier.train(sentences, label=label)

    # Test
    for f in find("data/1/testing"):
        f = f.strip()
        if not f.endswith(".txt"):
            continue

        with open(f) as doc:
            text = doc.read()

        sentences = nlp.process_text(text)