Python LDA.fit_transform_one 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: creme.decomposition

클래스/타입: LDA

메소드/함수: fit_transform_one

hotexamples.com에서의 예제들: 4

Python LDA.fit_transform_one - 4개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 creme.decomposition.LDA.fit_transform_one에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

LDA(11)

_update_indexes(5)

fit_transform_one(4)

_compute_statistics_components(3)

_update_weights(3)

_get_text(2)

fit_one(2)

preprocess(2)

process_text(2)

tokenizer(2)

transform_one(2)

예제 #1

파일 보기

파일: test_.py 프로젝트: zie225/creme

def test_five_components():
    '''
    Assert that components computed are identical to the original version for n dimensions.
    '''
    np.random.seed(42)

    n_components = 5

    online_lda = LDA(
        n_components=n_components,
        number_of_documents=60,
        maximum_size_vocabulary=100,
        alpha_beta=100,
        alpha_theta=0.5,
    )

    components_list = []

    for document in DOC_SET:
        components_list.append(online_lda.fit_transform_one(document))

    for index, component in enumerate(components_list):
        assert np.array_equal(
            a1=list(component.values()),
            a2=REFERENCE_FIVE_COMPONENTS[index],
        )

예제 #2

파일 보기

파일: test_.py 프로젝트: zeta1999/creme

def test_five_components():
    """
    Assert that components computed are identical to the original version for n dimensions.
    """

    n_components = 5

    lda = LDA(n_components=n_components,
              number_of_documents=60,
              maximum_size_vocabulary=100,
              alpha_beta=100,
              alpha_theta=0.5,
              seed=42)

    components_list = []

    for document in DOC_SET:
        tokens = {token: 1 for token in document.split(' ')}
        components_list.append(lda.fit_transform_one(tokens))

    for index, component in enumerate(components_list):
        assert np.array_equal(
            a1=list(component.values()),
            a2=REFERENCE_FIVE_COMPONENTS[index],
        )

예제 #3

파일 보기

파일: test_.py 프로젝트: zie225/creme

def test_prunning_vocabulary():
    '''
    Vocabulary prunning is available to improve accuracy and limit memory usage.
    You can perform vocabulary prunning with parameters vocab_prune_interval (int) and
    maximum_size_vocabulary (int).
    '''
    np.random.seed(42)

    online_lda = LDA(n_components=2,
                     number_of_documents=60,
                     vocab_prune_interval=2,
                     maximum_size_vocabulary=3)

    components_list = []

    for document in DOC_SET:
        components_list.append(online_lda.fit_transform_one(x=document))

    for index, component in enumerate(components_list):
        assert np.array_equal(a1=list(component.values()),
                              a2=REFERENCE_COMPONENTS_WITH_PRUNNING[index])

예제 #4

파일 보기

파일: test_.py 프로젝트: zeta1999/creme

def test_prunning_vocabulary():
    """
    Vocabulary prunning is available to improve accuracy and limit memory usage.
    You can perform vocabulary prunning with parameters vocab_prune_interval (int) and
    maximum_size_vocabulary (int).
    """

    lda = LDA(n_components=2,
              number_of_documents=60,
              vocab_prune_interval=2,
              maximum_size_vocabulary=3,
              seed=42)

    components_list = []

    for document in DOC_SET:
        tokens = {token: 1 for token in document.split(' ')}
        components_list.append(lda.fit_transform_one(tokens))

    for index, component in enumerate(components_list):
        assert np.array_equal(a1=list(component.values()),
                              a2=REFERENCE_COMPONENTS_WITH_PRUNNING[index])