Python TermModel.get_word_id 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: models.terms

클래스/타입: TermModel

메소드/함수: get_word_id

hotexamples.com에서의 예제들: 2

Python TermModel.get_word_id - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 models.terms.TermModel.get_word_id에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

get_word_id(2)

get_term_doc_map(1)

update_doclist(1)

예제 #1

파일 보기

파일: base.py 프로젝트: ianlivingstone/csci4141-project

    def _crawl_tree (self, parse_node):

        """
        Recursively crawls the tree building a unique list of terms and the
        node 
        """

        # Build our list of term contexts that point to a term. Once done,
        # uniqify the list by looking at the term values.
        #
        # NOTE: There must be a better way to do this, also, so we can
        # create more contexts easily without making a large amount of
        # modifications.
        terms = [
            [TermModel.get_word_id(token), context_id_map['docstring']] \
            for token in list(set(tokenize(parse_node.docstring)))
        ]
        terms.extend([
            [TermModel.get_word_id(token), context_id_map['comment']] \
            for token in list(set(tokenize(parse_node.comments)))
        ])
        terms.extend([
            [TermModel.get_word_id(token), context_id_map['library']] \
            for token in parse_node.imports
        ])
        
        if parse_node.type == 'Class':
            terms.append([
                TermModel.get_word_id(token),
                context_id_map['class']
            ])

        terms = uniqify_terms(terms)

        # NOTE: We copy the list of terms so that we dont modify the list
        # of terms for this treenode as we navigate upwards. Therefore,
        # unique terms from other nodes wont end up in the unique terms for
        # this node.
        node = dict(
            name=parse_node.name,
            type=parse_node.type,
            source=parse_node.source,
            signals=parse_node.signals,
            terms=terms[:],
            children = [],
        )

        for child in parse_node.children:
            child_node, child_terms = self._crawl_tree(child)
            node['children'].append(child_node)
            terms.extend(child_terms)

        return node, uniqify_terms(terms)

예제 #2

파일 보기

파일: query.py 프로젝트: ianlivingstone/csci4141-project

def translate (query):
    
    """
    Returns a list of pairs (token-type-id, token-id) using the lexicon
    provided through the TermModel class. Takes in a non-sanitized query
    string.

    Uses process and match functionality of this module.
    """
    
    # Convert query stirng into list of term type and their sanitized
    # values
    sanitized = process(query)
    logging.debug('Tokenized query "%s" into %s' % (
        query,
        sanitized
    ))

    # Converts term types and terms into their corresponding integer values
    # from the database and lexicon.
    return map(
        lambda x: (context_id_map[x[0]], TermModel.get_word_id(x[1])),
        sanitized
    )