Python sent_tokenize示例

编程语言: Python

命名空间/包名称: textblob.tokenizers

方法/功能: sent_tokenize

hotexamples.com的示例: 4

Python sent_tokenize - 已找到4个示例。这些是从开源项目中提取的最受好评的textblob.tokenizers.sent_tokenize现实Python示例。您可以评价示例，以帮助我们提高示例质量。

示例#1

显示文件

文件： blob.py 项目： Arttii/TextBlob

 def _create_sentence_objects(self):
     '''Returns a list of Sentence objects from the raw text.
     '''
     sentence_objects = []
     sentences = sent_tokenize(self.raw)
     char_index = 0  # Keeps track of character index within the blob
     for sent in sentences:
         # Compute the start and end indices of the sentence
         # within the blob
         start_index = self.raw.index(sent, char_index)
         char_index += len(sent)
         end_index = start_index + len(sent)
         # Sentences share the same models as their parent blob
         s = Sentence(sent, start_index=start_index, end_index=end_index,
             tokenizer=self.tokenizer, np_extractor=self.np_extractor,
             pos_tagger=self.pos_tagger, analyzer=self.analyzer,
             parser=self.parser, classifier=self.classifier)
         sentence_objects.append(s)
     return sentence_objects

示例#2

显示文件

文件： blob.py 项目： wdw110/TextBlob

 def _create_sentence_objects(self):
     '''Returns a list of Sentence objects from the raw text.
     '''
     sentence_objects = []
     sentences = sent_tokenize(self.raw)
     char_index = 0  # Keeps track of character index within the blob
     for sent in sentences:
         # Compute the start and end indices of the sentence
         # within the blob
         start_index = self.raw.index(sent, char_index)
         char_index += len(sent)
         end_index = start_index + len(sent)
         # Sentences share the same models as their parent blob
         s = Sentence(sent, start_index=start_index, end_index=end_index,
             tokenizer=self.tokenizer, np_extractor=self.np_extractor,
             pos_tagger=self.pos_tagger, analyzer=self.analyzer,
             parser=self.parser, classifier=self.classifier)
         sentence_objects.append(s)
     return sentence_objects

示例#3

显示文件

 def test_sent_tokenize(self):
     tokens = sent_tokenize(self.text)
     assert_true(is_generator(tokens))  # It's a generator
     assert_equal(list(tokens), self.tokenizer.tokenize(self.text))

示例#4

显示文件

文件： test_tokenizers.py 项目： ANDRESVA/TextBlob

 def test_sent_tokenize(self):
     tokens = sent_tokenize(self.text)
     assert_true(is_generator(tokens))  # It's a generator
     assert_equal(list(tokens), self.tokenizer.tokenize(self.text))