Python PipedInput.new 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: pipeline.pipeline

클래스/타입: PipedInput

메소드/함수: new

hotexamples.com에서의 예제들: 7

Python PipedInput.new - 7개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 pipeline.pipeline.PipedInput.new에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

get_text(11)

get_meta(8)

new(7)

get_doc_id(5)

PipedInput(3)

예제 #1

파일 보기

파일: text_processor_stage.py 프로젝트: vasalf/hse-web-search-homework

 def accept(self, consumer_input: PipedInput):
     tokens = self.mystem.lemmatize(consumer_input.get_text().lower())
     result = []
     for token in tokens:
         token = token.strip()
         if is_russian(token) or is_belarusian(token) or is_english(token):
             result.append(token)
     return consumer_input.new(text=" ".join(result))

예제 #2

파일 보기

파일: text_processor_stage.py 프로젝트: vasalf/hse-web-search-homework

 def accept(self, consumer_input: PipedInput):
     text = consumer_input.get_text().lower()
     token_words = word_tokenize(text)
     result = []
     for token in token_words:
         token = token.strip()
         if is_russian(token) or is_belarusian(token):
             result.append(self.russian_stemmer.stem(token))
         if is_english(token):
             result.append(self.english_stemmer.stem(token))
     return consumer_input.new(text=" ".join(result))

예제 #3

파일 보기

파일: json_unpacker_stage.py 프로젝트: vasalf/hse-web-search-homework

 def accept(self, consumer_input: PipedInput):
     return consumer_input.new(meta=json.loads(consumer_input.get_meta()))

예제 #4

파일 보기

파일: text_processor_stage.py 프로젝트: vasalf/hse-web-search-homework

 def accept(self, consumer_input: PipedInput):
     text = consumer_input.get_meta(
     )['title'] + " . " + consumer_input.get_text()
     return consumer_input.new(text=text,
                               meta=json.dumps(consumer_input.get_meta()))

예제 #5

파일 보기

 def accept(self, consumer_input: PipedInput):
     new_meta = copy(consumer_input.get_meta())
     new_meta["title"] = self.filter_stopwords(new_meta["title"])
     return consumer_input.new(text=self.filter_stopwords(
         consumer_input.get_text()),
                               meta=new_meta)

예제 #6

파일 보기

 def accept(self, consumer_input: PipedInput):
     new_meta = copy(consumer_input.get_meta())
     new_meta["title"] = self.lemmatize(new_meta["title"])
     return consumer_input.new(text=self.lemmatize(
         consumer_input.get_text()),
                               meta=new_meta)

예제 #7

파일 보기

 def accept(self, consumer_input: PipedInput):
     meta = json.loads(consumer_input.get_meta())
     return consumer_input.new(doc_id=meta["url"], meta=meta)