Python recover_topics 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: ankura

메소드/함수: recover_topics

hotexamples.com에서의 예제들: 5

Python recover_topics - 5개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 ankura.recover_topics에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: newsgroups.py 프로젝트: lwthatcher/ankura

    def run(name, anchors):
        topics = ankura.recover_topics(dataset, anchors)
        features = ankura.topic_combine(topics, dataset)
        train, test = ankura.pipeline.train_test_split(features, .9)

        vw_contingency = ankura.measure.vowpal_contingency(train, test, 'dirname')
        print(name, 'accuracy:', ankura.measure.vowpal_accuracy(train, test, 'dirname'))
        print(name, 'f-Measure:', vw_contingency.fmeasure())
        print(name, 'ari:', vw_contingency.ari())
        print(name, 'rand:', vw_contingency.rand())
        print(name, 'vi:', vw_contingency.vi())

        coherence = []
        for topic in ankura.topic.topic_summary_indices(topics, dataset, 10):
            coherence.append(ankura.measure.topic_coherence(topic, dataset))
        print(name, 'coherence-10:', numpy.mean(coherence))

        coherence = []
        for topic in ankura.topic.topic_summary_indices(topics, dataset, 15):
            coherence.append(ankura.measure.topic_coherence(topic, dataset))
        print(name, 'coherence-15:', numpy.mean(coherence))

        coherence = []
        for topic in ankura.topic.topic_summary_indices(topics, dataset, 20):
            coherence.append(ankura.measure.topic_coherence(topic, dataset))
        print(name, 'coherence-20:', numpy.mean(coherence))

예제 #2

파일 보기

    def run(name, anchors):
        topics = ankura.recover_topics(dataset, anchors)
        features = ankura.topic_combine(topics, dataset)
        train, test = ankura.pipeline.train_test_split(features, .9)

        vw_contingency = ankura.measure.vowpal_contingency(
            train, test, 'dirname')
        print(name, 'accuracy:',
              ankura.measure.vowpal_accuracy(train, test, 'dirname'))
        print(name, 'f-Measure:', vw_contingency.fmeasure())
        print(name, 'ari:', vw_contingency.ari())
        print(name, 'rand:', vw_contingency.rand())
        print(name, 'vi:', vw_contingency.vi())

        coherence = []
        for topic in ankura.topic.topic_summary_indices(topics, dataset, 10):
            coherence.append(ankura.measure.topic_coherence(topic, dataset))
        print(name, 'coherence-10:', numpy.mean(coherence))

        coherence = []
        for topic in ankura.topic.topic_summary_indices(topics, dataset, 15):
            coherence.append(ankura.measure.topic_coherence(topic, dataset))
        print(name, 'coherence-15:', numpy.mean(coherence))

        coherence = []
        for topic in ankura.topic.topic_summary_indices(topics, dataset, 20):
            coherence.append(ankura.measure.topic_coherence(topic, dataset))
        print(name, 'coherence-20:', numpy.mean(coherence))

예제 #3

파일 보기

파일: newsgroups.py 프로젝트: nOkuda/ankura

def demo():
    """Runs the newsgroups demo"""
    dataset = get_newsgroups()
    anchors = ankura.gramschmidt_anchors(dataset, 20, 500)
    topics = ankura.recover_topics(dataset, anchors)

    for topic in ankura.topic.topic_summary_tokens(topics, dataset, 20):
        print(' '.join(topic))

예제 #4

파일 보기

파일: server.py 프로젝트: nOkuda/ankura

def topic_inference(raw_anchors):
    """Returns infered topic info from raw anchors"""
    dataset = args.get_dataset()

    if raw_anchors is None:
        anchor_tokens, anchors = args.default_anchors()
    else:
        anchor_tokens = ankura.util.tuplize(json.loads(raw_anchors))
        anchors = user_anchors(anchor_tokens)

    topics = ankura.recover_topics(dataset, anchors, epsilon=1e-6)
    topic_summary = ankura.topic.topic_summary_tokens(topics, dataset, n=15)

    return topics, topic_summary, anchor_tokens

예제 #5

파일 보기

파일: server.py 프로젝트: lwthatcher/ankura

def topic_inference(raw_anchors):
    """Returns infered topic info from raw anchors"""
    dataset = args.get_dataset()

    if raw_anchors is None:
        anchor_tokens, anchors = args.default_anchors()
    else:
        anchor_tokens = ankura.util.tuplize(json.loads(raw_anchors))
        anchors = user_anchors(anchor_tokens)

    topics = ankura.recover_topics(dataset, anchors, epsilon=1e-6)
    topic_summary = ankura.topic.topic_summary_tokens(topics, dataset, n=15)

    return topics, topic_summary, anchor_tokens