Python unpickle_cds 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: CHILDES.pickled.load_pickled

메소드/함수: unpickle_cds

hotexamples.com에서의 예제들: 2

Python unpickle_cds - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 CHILDES.pickled.load_pickled.unpickle_cds에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: load_corpus_data.py 프로젝트: RobGrimm/prediction_based

def get_word_frequencies():
    """ Return a list of (word, frequency) tuples, sorted by frequency, form most to least frequent. """
    tagged_corpus = unpickle_cds()
    words = []
    for file_ in tagged_corpus:
        for sentence_ in tagged_corpus[file_]:
            sentence_ = [(token, collapse_function_tags(pos_tag)) for token, pos_tag in sentence_]
            words += sentence_
    counted_tokens = Counter(words)
    word_frequencies = sorted(counted_tokens.items(), key=operator.itemgetter(1))
    word_frequencies.reverse()
    return word_frequencies

예제 #2

파일 보기

파일: load_corpus_data.py 프로젝트: RobGrimm/prediction_based

def get_cds_words(collapse_function_words=True):
    """
    Load CDS from disk and return lower-cased tokens, as a list of sentences, where a sentence is a list of
    POS-ttagged tokens. Tokens are in the format 'word-pos_tag'. Optionally replace all closed class / function word
    POS tags with the single tag 'fn'.
    """
    sentences = []
    CDS = unpickle_cds()
    for file_name in CDS:
        for s in CDS[file_name]:
            sentences.append([])
            for w, pos_tag in s:
                w = w.lower()
                if collapse_function_words:
                    sentences[-1].append(w + "-" + collapse_function_tags(pos_tag))
                else:
                    sentences[-1].append(w + "-" + pos_tag)
    return sentences