Python SimilaritySet.add Examples

Programming Language: Python

Namespace/Package Name: similarity_set

Class/Type: SimilaritySet

Method/Function: add

Examples at hotexamples.com: 2

Python SimilaritySet.add - 2 examples found. These are the top rated real world Python examples of similarity_set.SimilaritySet.add extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

add(2)

set_callback(2)

update(1)

Frequently Used Methods

add (2)

set_callback (2)

update (1)

Example #1

Show file

File: extract_authors.py Project: QScience/pdfparser_code

def rm_existing(authors):
    ''' removes duplicate authors '''
    res = SimilaritySet(cutoff=CUTOFF)
    for a in authors:
        for b in authors:
            if b in a and string_similarity(b, a)<0.90:
                a = a.replace(b, '')
        res.add(a.strip())

    return res

Example #2

Show file

File: extract_authors.py Project: QScience/pdfparser_code

def filter_authors(tags):
    ''' reads the xml author tags, 
        filters duplicates and stopwords in the text ''' 
    res = SimilaritySet(cutoff=CUTOFF)
    res.set_callback(replace)
    for author in tags:
        if "confidence" in author.attrib:
            # split authors on special characters
            author_text = map(lambda x: x.strip(), re.split(tokenize_regex, turn_unicode(author.text)))
            #author_text = turn_unicode(author.text).strip()
        
        #for a in [author_text]:
        for a in author_text:
            if not len(a.split(" ")) > 6:
                res.add(a)

    res = rm_existing(res)
    res = rm_stopwords(stopwords, res)
    return res