Python blankline Examples

Programming Language: Python

Namespace/Package Name: nltk.tokenize

Method/Function: blankline

Examples at hotexamples.com: 4

Python blankline - 4 examples found. These are the top rated real world Python examples of nltk.tokenize.blankline extracted from open source projects. You can rate examples to help us improve the quality of examples.

Example #1

Show file

File: analyser.py Project: BjoernKW/Topicalizer

    def processParagraphs(self, corpus):
        from nltk import tokenize

        # get paragraphs
        paragraphs = tokenize.blankline(corpus)

        # return
        return paragraphs

Example #2

Show file

File: analyser.py Project: Wikiwix/Topicalizer

    def processParagraphs(self, corpus):
        from nltk import tokenize

        # get paragraphs
        paragraphs = tokenize.blankline(corpus)

        # return
        return paragraphs

Example #3

Show file

File: tag2tab.py Project: DrDub/icsisumm

def tabtagged(files = 'chunked', basedir= None):
    """
    @param files: One or more treebank files to be processed
    @type files: L{string} or L{tuple(string)}
    @return: iterator over lines in Malt-TAB input format
    """       
    if type(files) is str: files = (files,)

    if not basedir: basedir = get_basedir()

    for file in files:
        path = os.path.join(get_basedir(), "treebank", file)
        f = open(path).read()

        for sent in tokenize.blankline(f):
            l = []
            for t in tokenize.whitespace(sent):
                if (t != '[' and t != ']'):
                    l.append(tag2tab(t))
            #add a blank line as sentence separator
            l.append('\n')
            yield l

Example #4

Show file

File: tag2tab.py Project: steven-cutting/icsisumm

def tabtagged(files='chunked', basedir=None):
    """
    @param files: One or more treebank files to be processed
    @type files: L{string} or L{tuple(string)}
    @return: iterator over lines in Malt-TAB input format
    """
    if type(files) is str: files = (files, )

    if not basedir: basedir = get_basedir()

    for file in files:
        path = os.path.join(get_basedir(), "treebank", file)
        f = open(path).read()

        for sent in tokenize.blankline(f):
            l = []
            for t in tokenize.whitespace(sent):
                if (t != '[' and t != ']'):
                    l.append(tag2tab(t))
            #add a blank line as sentence separator
            l.append('\n')
            yield l