Python get_article_id_for_interest 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: utils

메소드/함수: get_article_id_for_interest

hotexamples.com에서의 예제들: 5

Python get_article_id_for_interest - 5개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 utils.get_article_id_for_interest에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: test_svd.py 프로젝트: shilad/macademia

def make_doc(interest, dictionary):
    article_id1 = utils.get_article_id_for_interest(interest)
    if not article_id1:
        return None
    doc = []
    ranks = utils.get_article_similarity_ranks(article_id1, 2000).items()
    for (article_id2, rank) in ranks:
        if article_id2 in dictionary.token2id:
            id = dictionary.token2id[article_id2]
            score = 1.0 / (math.log(rank + 5) / math.log(2))
            doc.append((id, score))
    return doc

예제 #2

파일 보기

파일: write_sparse_matrix.py 프로젝트: shilad/macademia

def build_article_adjacencies(interests):
    article_sims = collections.defaultdict(list)
    for i in interests:
        article_id = utils.get_article_id_for_interest(i)
        if not article_id:
            continue
        index1 = id_to_index(article_id)
        ranks = utils.get_article_similarity_ranks(article_id, 2000).items()
        ranks.sort(key=lambda pair: pair[1])
        for (article_id2, rank) in ranks:
            article_sims[index1].append(article_id2)

    return article_sims

예제 #3

파일 보기

파일: write_sparse_matrix.py 프로젝트: emmyrlim/macademia

def build_article_adjacencies(interests):
    article_sims = collections.defaultdict(list)
    for i in interests:
        article_id = utils.get_article_id_for_interest(i)
        if not article_id:
            continue
        index1 = id_to_index(article_id)
        ranks = utils.get_article_similarity_ranks(article_id, 2000).items()
        ranks.sort(key=lambda pair: pair[1])
        for (article_id2, rank) in ranks:
            article_sims[index1].append(article_id2)

    return article_sims

예제 #4

파일 보기

파일: test_svd.py 프로젝트: shilad/macademia

def describe_lda():
    utils.init()
    model = gensim.models.ldamodel.LdaModel.load('svd/lda.txt')
    def article_name(article_id):
        name = utils.get_article_name(article_id)
        return name.encode('ascii', 'ignore') if name else 'unknown'

#    print 'information about topics:'
#    for i in random.sample(range(model.num_topics), 50):
#        print 'topic %d:' % i
#        topic = model.state.get_lambda()[i]
#        topic = topic / topic.sum() # normalize to probability dist
#        for id in numpy.argsort(topic)[::-1][:10]:
#            score = topic[id]
#            article_id = model.id2word[id]
#            print '\t%.6f: %s' % (score, article_name(article_id))

    dictionary = model.id2word
    interests = list(utils.get_all_interests())
    for i in random.sample(interests, 50):
        article_id1 = utils.get_article_id_for_interest(i)
        if not article_id1:
            continue
        doc = make_doc(i, dictionary)

        doc_lda = model[doc]
        doc_lda.sort(key=lambda pair: pair[1])
        doc_lda.reverse()
        sys.stdout.write('topics for %s (article %s):\n' % (i.text, article_name(article_id1)))
        for (topic_id, topic_score) in doc_lda:
            sys.stdout.write('\t%.6f topic %d:' % (topic_score, topic_id))
            topic = model.state.get_lambda()[topic_id]
            topic = topic / topic.sum() # normalize to probability dist
            for id in numpy.argsort(topic)[::-1][:10]:
                score = topic[id]
                article_id = model.id2word[id]
                sys.stdout.write(', ' + article_name(article_id))
            sys.stdout.write('\n')

예제 #5

파일 보기

파일: test_svd.py 프로젝트: shilad/macademia

 def build_interests_to_articles(self):
     for i in self.interests:
         article_id = utils.get_article_id_for_interest(i)
         if article_id:
             self.mapped_interests.append((i, article_id))