Python TokenSet.MatchLocationSet 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: gravity.tae.tokenizer

클래스/타입: TokenSet

메소드/함수: MatchLocationSet

hotexamples.com에서의 예제들: 1

Python TokenSet.MatchLocationSet - 1개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 gravity.tae.tokenizer.TokenSet.MatchLocationSet에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

InInterval(4)

EqualByPositionTokens(2)

IntersectedTokens(2)

AND(1)

MatchIntersectedSet(1)

MatchLocationSet(1)

NOT(1)

NotMatchSet(1)

OR(1)

예제 #1

파일 보기

def calc_types_distribution_for_completly_wrongly_recognized_entities(ner_id, lang = 'nl', model = None):
    class NotMatchLocationSet(TokenSet.MatchSet):
        def __init__(self, tokens):
            super(self.__class__, self).__init__(tokens, False)
        
        def match_tokens(self, token1, token2):
            return token1[1] >= 0 and token2[1] >= 0 and (token1[1] != token2[1] or token1[2] != token2[2])  
    
    a = load_all_recognized_tokens(ner_id, lang, model)
    r = load_all_matched_tokens(ner_id, lang, model)
    nm = TokenSet.NotMatchSet(TokenSet.MatchLocationSet(r))
    s = TokenSet(TokenSet(a).tokens(nm))

    misc = s.tokens(Token.NE_MISC)
    loc  = s.tokens(Token.NE_LOC)
    per  = s.tokens(Token.NE_PER)
    org  = s.tokens(Token.NE_ORG)
    print "======== %s Recognized entities type distribution :" % ner_id
    print "LOCATIONS    : %4d  %3d" % (len(loc), (len(loc)*100)/len(s)) 
    print "PERSONS      : %4d  %3d" % (len(per), (len(per)*100)/len(s)) 
    print "ORGANIZATION : %4d  %3d" % (len(org), (len(org)*100)/len(s)) 
    print "MISC         : %4d  %3d" % (len(misc),(len(misc)*100)/len(s)) 
    print "============================="
    print "AMOUNT       : %4d  100" % len(s)