Python uri_to_label示例

编程语言: Python

命名空间/包名称: conceptnet5.nodes

方法/功能: uri_to_label

hotexamples.com的示例: 2

Python uri_to_label - 已找到2个示例。这些是从开源项目中提取的最受好评的conceptnet5.nodes.uri_to_label现实Python示例。您可以评价示例，以帮助我们提高示例质量。

示例#1

显示文件

文件： transforms.py 项目： vpolimenov/conceptnet5

def choose_small_vocabulary(big_frame, concepts_filename, language):
    """
    Choose the vocabulary of the small frame, by eliminating the terms which:
     - contain more than one word
     - are not in ConceptNet
     - are not frequent
    """
    concepts = set(line.strip() for line in open(concepts_filename))
    vocab = []
    for term in big_frame.index:
        if '_' not in term and term in concepts:
            try:
                frequency = word_frequency(uri_to_label(term),
                                           language,
                                           wordlist='large')
            except LookupError:
                frequency = word_frequency(uri_to_label(term),
                                           language,
                                           wordlist='combined')
            vocab.append((term, frequency))
    small_vocab = [
        term for term, frequency in sorted(
            vocab, key=lambda x: x[1], reverse=True)[:50000]
    ]
    return small_vocab

示例#2

显示文件

def get_vector(frame, label, language=None):
    """
    Returns the row of a vector-space DataFrame `frame` corresponding
    to the text `text`. If `language` is set, this can take in plain text
    and normalize it to ConceptNet form. Either way, it can also take in
    a label that is already in ConceptNet form.
    """
    if frame.index[0].startswith('/'):  # This frame has URIs in its index
        if not label.startswith('/'):
            label = standardized_uri(language, label)
        try:
            return frame.loc[label]
        except KeyError:
            return pd.Series(index=frame.columns)
    else:
        if label.startswith('/'):
            label = uri_to_label(label)
        try:
            return frame.loc[replace_numbers(label)]
        except KeyError:
            # Return a vector of all NaNs
            return pd.Series(index=frame.columns)