Python nmf_articles 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: cluster

메소드/함수: nmf_articles

hotexamples.com에서의 예제들: 4

Python nmf_articles - 4개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 cluster.nmf_articles에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: topic_coherence.py 프로젝트: ewellinger/election_analysis

def get_avg_coherence(df, n_topics):
    print '{} Topics Processing...'.format(n_topics)
    nmf, X, W, W_percent, labels, topic_words, feature_names, reverse_lookup = nmf_articles(df, n_topics=n_topics, n_features=10000, random_state=1, max_df=0.8, min_df=5)
    print 'Factorizing Done...'
    pbar = ProgressBar()
    coherence = []
    for words in pbar(topic_words):
        coherence.append(topic_coherence(X, reverse_lookup, words))
    print '\n'
    return np.mean(coherence)

예제 #2

파일 보기

파일: topic_coherence.py 프로젝트: micsab/election_analysis

def get_avg_coherence(df, n_topics):
    print '{} Topics Processing...'.format(n_topics)
    nmf, X, W, W_percent, labels, topic_words, feature_names, reverse_lookup = nmf_articles(
        df,
        n_topics=n_topics,
        n_features=10000,
        random_state=1,
        max_df=0.8,
        min_df=5)
    print 'Factorizing Done...'
    pbar = ProgressBar()
    coherence = []
    for words in pbar(topic_words):
        coherence.append(topic_coherence(X, reverse_lookup, words))
    print '\n'
    return np.mean(coherence)

예제 #3

파일 보기

        fig = plt.figure(figsize=figsize)
        ax = fig.add_subplot(111)
    ax.imshow(wc)
    ax.axis('off')


if __name__ == '__main__':
    df = pd.read_pickle('election_data.pkl')

    # Plot % of articles mentioning candidate accross all news sources
    # plot_candidate_percentages(df, ['Clinton', 'Trump', 'Bush'])

    nmf, X, W, W_percent, labels, topic_words, feature_names, reverse_lookup = nmf_articles(
        df,
        n_topics=90,
        n_features=10000,
        random_state=1,
        max_df=0.8,
        min_df=5)

    outlets = [('nyt', 'NYT', '#4c72b0'), ('foxnews', 'FOX', '#c44e52'),
               ('npr', 'NPR', '#55a868'), ('guardian', 'GUA', '#8172b2'),
               ('wsj', 'WSJ', '#ccb974')]

    # predominant_source = print_topic_summary(df, labels, outlets, topic_words)

    # Create a dictionary with the topic labels for creating the plots
    topic_labels = get_topic_labels()

    # path = './topic_plots/'
    # for idx in xrange(90):

예제 #4

파일 보기

파일: plots.py 프로젝트: jbgalvanize/election_analysis

    # Create the matplotlib figure and axis if they weren't passed in
    if not ax:
        fig = plt.figure(figsize=figsize)
        ax = fig.add_subplot(111)
    ax.imshow(wc)
    ax.axis('off')


if __name__=='__main__':
    df = pd.read_pickle('election_data.pkl')

    # Plot % of articles mentioning candidate accross all news sources
    # plot_candidate_percentages(df, ['Clinton', 'Trump', 'Bush'])

    nmf, X, W, W_percent, labels, topic_words, feature_names, reverse_lookup = nmf_articles(df, n_topics=90, n_features=10000, random_state=1, max_df=0.8, min_df=5)

    outlets = [('nyt', 'NYT', '#4c72b0'), ('foxnews', 'FOX', '#c44e52'), ('npr', 'NPR', '#55a868'), ('guardian', 'GUA', '#8172b2'), ('wsj', 'WSJ', '#ccb974')]

    # predominant_source = print_topic_summary(df, labels, outlets, topic_words)

    # Create a dictionary with the topic labels for creating the plots
    topic_labels = get_topic_labels()

    # path = './topic_plots/'
    # for idx in xrange(90):
    #     # If the topic is junk, skip making the plot
    #     if topic_labels[idx] == 'junk':
    #         print '\n'
    #         continue
    #     print 'Topic {}: {}'.format(str(idx), topic_labels[idx])