Python LatentDirichletAllocation.print_topics 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: sklearn.decomposition

메소드/함수: print_topics

hotexamples.com에서의 예제들: 2

Python LatentDirichletAllocation.print_topics - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 sklearn.decomposition.LatentDirichletAllocation.print_topics에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

LatentDirichletAllocation(30)

fit_transform(30)

score(30)

perplexity(30)

fit(30)

get_params(21)

partial_fit(17)

set_params(6)

_e_step(2)

print_topics(2)

get_document_topics(2)

predict(2)

print_topic(1)

n_jobs(1)

rename(1)

reset_index(1)

save(1)

show_topics(1)

n_topics(1)

get_perplexity(1)

learn(1)

isnull(1)

head(1)

get_feature_names_out(1)

get_feature_names(1)

dump(1)

decision_function(1)

components_(1)

compile(1)

columns(1)

append(1)

add_prefix(1)

_unnormalized_transform(1)

_perplexity_precomp_distr(1)

tolist(1)

예제 #1

파일 보기

# In[44]:

#looking at the output of our LDA topic model
topics_with_wts = [item[0] for item in topics_coherences]
print('LDA Topics with Weights')
print('=' * 50)
for idx, topic in enumerate(topics_with_wts):
    print('Topic #' + str(idx + 1) + ':')
    print([(term, round(wt, 3)) for wt, term in topic])
    print()

# In[45]:

model = LdaModel(corpus=corpus_tfidf, id2word=dictionary, num_topics=10)
for idx, topic in model.print_topics():
    print('Topic: ({}) word: {}'.format(idx, topic))

# In[46]:

#viewing the topics as a list of terms without the weights
print('LDA Topics without Weights')
print('=' * 50)
for idx, topic in enumerate(topics_with_wts):
    print('Topic #' + str(idx + 1) + ':')
    print([term for wt, term in topic])
    print()

# In[52]:

LDA_viz = pyLDAvis.gensim.prepare(lda_model, corpus_tfidf, dictionary)

예제 #2

파일 보기

파일: Topic Modeling with LDA.py 프로젝트: MaloneyJason/Syracuse-University

corpus_tfidf = tfidf[bow_corpus]

from pprint import pprint
for doc in corpus_tfidf:
    pprint(doc)
    break

#########################
# LDA with BAG OF WORDS
#########################
    
# train the LDA model 
lda_model = gensim.models.LdaMulticore(bow_corpus, num_topics = 10, id2word = dictionary, passes = 2, workers = 2)

# for each topic - explore the words occuring in that topic and their relative weight
for idx, topic in lda_model.print_topics(-1):
    print('Topic {}, \nWords: {}'.format(idx, topic))



word_topic = np.array(lda_model.components_)
word_topic = word_topic.transpose()
num_topics = 10
num_top_words = 10
vocab_array = np.asarray(vocab)

#fontsize_base = 70 / np.max(word_topic) # font size for word with largest share in corpus
fontsize_base = 10

for t in range(num_topics):
    plt.subplot(1, num_topics, t + 1)  # plot numbering starts with 1