Python FileCabinet.get_wordcounts示例

编程语言: Python

类/类型: FileCabinet

方法/功能: get_wordcounts

hotexamples.com的示例: 2

Python FileCabinet.get_wordcounts - 已找到2个示例。这些是从开源项目中提取的最受好评的FileCabinet.get_wordcounts 来自程序包 peppy现实Python示例。您可以评价示例，以帮助我们提高示例质量。

常用方法

显示隐藏

loadpathdictionary(5)

pairtreepath(4)

get_most_recent(2)

get_wordcounts(2)

get_entries_by_date(1)

get_one(1)

get_pairedpaths(1)

get_wordfreqs(1)

parse_authordate(1)

示例#1

显示文件

文件： inquirer_correlations.py 项目： ericayhayes/horizon

        if not os.path.exists(sourcedir + docid + '.tsv'):
            continue
        docs.append(row['volid'])
        logistic.append(float(row['logistic']))
        dates.append(float(row['dateused']))

logistic = np.array(logistic)
dates = np.array(dates)

numdocs = len(docs)

categories = dict()
for field in fields:
    categories[field] = np.zeros(numdocs)

wordcounts = filecab.get_wordcounts(sourcedir, '.tsv', docs)

for i, doc in enumerate(docs):
    ctcat = Counter()
    allcats = 0
    for word, count in wordcounts[doc].items():
        allcats += count
        for field in fields:
            if word in inquirer[field]:
                ctcat[field] += count
    for field in fields:
        categories[field][i] = ctcat[field] / (allcats + 1)

logresults = []
dateresults = []

示例#2

显示文件

logistic = dict()
realclass = dict()
titles = dict()
dates = dict()

with open('../metadata/prestigeset.csv', encoding='utf-8') as f:
    reader = csv.DictReader(f)
    for row in reader:
        logistic[row['volid']] = float(row['logistic'])
        realclass[row['volid']] = row['prestige']
        titles[row['volid']] = row['title']
        dates[row['volid']] = int(row['dateused'])

sourcedir = '../sourcefiles/'
documents = filecab.get_wordcounts(sourcedir, '.tsv', set(logistic))

outrows = []

for docid, doc in documents.items():
    if docid not in logistic:
        continue
    else:
        allwords = 1
        colorct = 0

        for word, count in doc.items():
            allwords += count
            if word in colors:
                colorct += count