Python split_message_to_sentences示例

编程语言: Python

命名空间/包名称: melusine.prepare_email.mail_segmenting

方法/功能: split_message_to_sentences

hotexamples.com的示例: 2

Python split_message_to_sentences - 已找到2个示例。这些是从开源项目中提取的最受好评的melusine.prepare_email.mail_segmenting.split_message_to_sentences现实Python示例。您可以评价示例，以帮助我们提高示例质量。

示例#1

显示文件

文件： streamer.py 项目： victorBigand/melusine

    def to_list_of_tokenized_sentences(self, text):
        """Create list of list of tokens from a text.
        Each list of tokens correspond to a sentence.

        Parameters
        ----------
        text : str

        Returns
        -------
        list of list of strings
        """
        sentences_list = split_message_to_sentences(text)
        tokenized_sentences_list = [
            self.tokenizer._tokenize(sentence) for sentence in sentences_list
            if sentence != ""
        ]
        return tokenized_sentences_list

示例#2

显示文件

    def to_list_of_tokenized_sentences(self, text):
        """Create list of list of tokens from a text.
        Each list of tokens correspond to a sentence.

        Parameters
        ----------
        text : str

        Returns
        -------
        list of list of strings
        """
        sentences_list = split_message_to_sentences(text)
        tokenized_sentences_list = [nltk.regexp_tokenize(sentence,
                                                         pattern="\w+(?:[\?\-\'\"_]\w+)*")
                                    for sentence in sentences_list
                                    if sentence != ""]
        return tokenized_sentences_list