Python UniversalDetector Examples

Programming Language: Python

Namespace/Package Name: lib.chardet.universaldetector

Examples at hotexamples.com: 3

Python UniversalDetector - 3 examples found. These are the top rated real world Python examples of lib.chardet.universaldetector.UniversalDetector extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

UniversalDetector(1)

close(1)

feed(1)

Example #1

Show file

File: __init__.py Project: ktisha/ebook-service

def get_text(html):
    data,page = html
    ud = UniversalDetector()
    ud.feed(data)
    ud.close()
    encoding = ud.result['encoding']
    data = unicode(data, encoding)
    return data

Example #2

Show file

File: LibRu_tests.py Project: ktisha/ebook-service

def get_authors_title_test():
    import urllib
    l = 'http://lib.ru/TXT/ruscience.txt'
    page = urllib.urlopen(l+'_Ascii.txt')
    text = page.read(2048)
    ud = UniversalDetector()
    ud.feed(text)
    ud.close()
    encoding = ud.result['encoding']
    text = unicode(text, encoding)
    authors, title = Retriever.get_authors_and_title(text)
    assert len(authors) == 1
    assert authors[0] == u'Дмитрий Толмацкий'
    assert title == u'Российская наука на пути из реанимации в морг'
#    print 'authors', ",".join( [author.encode('utf8') for author in authors ] )
#    print 'title',title
    pass

Example #3

Show file

 def detectFileEncode(self, filePath):
     detector = UniversalDetector()
     with open(filePath, 'r') as fp:
         for line in fp.readlines():
             detector.feed(line)
             if detector.done: break
         detector.close()
     print detector.result
     return detector.result['encoding']