Python TfidfTransformer.idf_ Examples

Programming Language: Python

Namespace/Package Name: sklearn.feature_extraction.text

Class/Type: TfidfTransformer

Method/Function: idf_

Examples at hotexamples.com: 4

Python TfidfTransformer.idf_ - 4 examples found. These are the top rated real world Python examples of sklearn.feature_extraction.text.TfidfTransformer.idf_ extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

TfidfTransformer(30)

fit(30)

fit_transform(30)

todense(12)

transform(8)

toarray(7)

_idf_diag(6)

get_feature_names(6)

get_params(4)

idf_(3)

astype(2)

_get_param_names(2)

set_params(2)

tocsc(2)

tocoo(2)

tolist(1)

tolil(1)

tocsr(1)

stop_words_(1)

getrow(1)

nonzero(1)

mean(1)

max(1)

__dict__(1)

get_shape(1)

fit_transformer(1)

fit_tansform(1)

eliminate_zeros(1)

build_analyzer(1)

__init__(1)

transpose(1)

Example #1

Show file

def test_transformer_idf_setter():
    X = CountVectorizer().fit_transform(JUNK_FOOD_DOCS)
    orig = TfidfTransformer().fit(X)
    copy = TfidfTransformer()
    copy.idf_ = orig.idf_
    assert_array_equal(
        copy.transform(X).toarray(),
        orig.transform(X).toarray())

Example #2

Show file

File: test_text.py Project: LoveYakamoz/scikit-learn

def test_transformer_idf_setter():
    X = CountVectorizer().fit_transform(JUNK_FOOD_DOCS)
    orig = TfidfTransformer().fit(X)
    copy = TfidfTransformer()
    copy.idf_ = orig.idf_
    assert_array_equal(
        copy.transform(X).toarray(),
        orig.transform(X).toarray())

Example #3

Show file

File: tfidf.py Project: wangqi1996/Essay_Scoring

def tfidf_test(data, tf_vocab, idf_diag):
    """ input: sentences """
    vectorizer = CountVectorizer(vocabulary=tf_vocab)
    tf = vectorizer.transform(data)  # 返回的是稀疏表示

    transformer = TfidfTransformer()
    transformer.idf_ = idf_diag
    tfidf = transformer.transform(tf)
    tfidf = tfidf.toarray()

    return tfidf

Example #4

Show file

File: classify_ConsumerOfTopic.py Project: howckeye20071/More2News_Flume_Kafka

def model_forTypeFinal(tags_final):
    f_open = open('/home/stu/model/new_feature_names1.txt',
                  'r',
                  encoding='UTF-8')
    f_text = f_open.read()
    f_list = eval(f_text)  # 將字符串str當成有效的表達式來求值並返回計算結果
    file_set = set(f_list)
    type(f_list)

    # info_forModel = {}
    # info_forModel = info_jieba
    # info_forModel['tags_final'] = tags_final
    # info_forModel['weight'] = weight

    tags_final_forModel = tags_final.split("、")
    tags_setted = list(set(tags_final_forModel) & file_set)
    x_test = [' '.join(tags_setted)]

    f_open = open('/home/stu/model/new_vocabulary.txt', 'r', encoding='UTF-8')
    f_text = f_open.read()
    vocab = eval(f_text)

    f_open = open('/home/stu/model/new_idf_all.txt', 'r', encoding='UTF-8')
    f_text = f_open.read()
    idf_all = np.asarray(eval(f_text))

    count_v2 = CountVectorizer(vocabulary=vocab)
    counts_test = count_v2.transform(x_test)
    # print("the shape of test is " + repr(counts_test.shape))

    tfidftransformer = TfidfTransformer()
    tfidftransformer.idf_ = idf_all
    x_test = tfidftransformer.transform(counts_test)

    model_path = '/home/stu/model/new_clf.pickle'
    model = pickle.load(open(model_path, "rb"))

    y_pred = model.predict(x_test)
    preds = y_pred.tolist()
    id2c = id2c_mapping[preds[0]]

    return id2c