Python Preprocessor.preprocess_sentence 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: utils.preprocessor

클래스/타입: Preprocessor

메소드/함수: preprocess_sentence

hotexamples.com에서의 예제들: 2

Python Preprocessor.preprocess_sentence - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 utils.preprocessor.Preprocessor.preprocess_sentence에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

Preprocessor(16)

extract_sentences(2)

preprocess_sentence(2)

aggregateData(1)

build_sequence(1)

fit(1)

get_data(1)

label_transform(1)

np_process_obses(1)

remove_stop_words_from_sentencelist(1)

stem_word(1)

stop_word_eliminate(1)

to_lower_case(1)

예제 #1

파일 보기

파일: weights_handler.py 프로젝트: animeshramesh/T-Sum

    def generate_STM(self):    

        preprocessor = Preprocessor()
        for sentence in self.__sentenceList:
            preprocessed_words = preprocessor.preprocess_sentence(sentence)
            sentence_weight = []
            for feature in self.tot_weight_dict().keys():
                if feature in preprocessed_words:
                    sentence_weight.append(self.__tot_weight_dict[feature])
                else:
                    sentence_weight.append(0)
                
            self.__sentenceWeight_dict[sentence] = sentence_weight

예제 #2

파일 보기

파일: testing.py 프로젝트: animeshramesh/T-Sum

from sets.size import Size
from sets.intersections import Intersections
from sets.scorer import Scorer
from graphs.node_ranker import NodeRanker
from sets.distributed_ranks import RankDistributor



input_path = '/home/animesh/T-Sum/Data sets/Inception/'
files = [f for f in os.listdir(input_path) if os.path.isfile(input_path + f)]
prep = Preprocessor()
sentence_list = prep.extract_sentences(files, input_path)
preprocessed_words_in_each_sentence = []

for s in sentence_list:
    preprocessed_words_in_each_sentence.append(prep.preprocess_sentence(s)) 

size = Size()
intersections = Intersections()
scorer = Scorer()
ranker = NodeRanker()
rank_counter_in_0_to_1 = RankDistributor()

size_of_sets = size.calculate_size_of_set(preprocessed_words_in_each_sentence)
number_of_intersections_of_each_sentence = intersections.count_itersections_of_each_set(preprocessed_words_in_each_sentence)
scores = scorer.score_sentences(number_of_intersections_of_each_sentence, size_of_sets)

normalised_scores = scorer.normalise_score(scores)
distributed_ranks = rank_counter_in_0_to_1.distribute_ranks(normalised_scores)
print distributed_ranks