Python TfidfTransformer.tocoo Examples

Programming Language: Python

Namespace/Package Name: sklearn.feature_extraction.text

Class/Type: TfidfTransformer

Method/Function: tocoo

Examples at hotexamples.com: 2

Python TfidfTransformer.tocoo - 2 examples found. These are the top rated real world Python examples of sklearn.feature_extraction.text.TfidfTransformer.tocoo extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

TfidfTransformer(30)

fit(30)

fit_transform(30)

todense(12)

transform(8)

toarray(7)

_idf_diag(6)

get_feature_names(6)

get_params(4)

idf_(3)

astype(2)

_get_param_names(2)

set_params(2)

tocsc(2)

tocoo(2)

tolist(1)

tolil(1)

tocsr(1)

stop_words_(1)

getrow(1)

nonzero(1)

mean(1)

max(1)

__dict__(1)

get_shape(1)

fit_transformer(1)

fit_tansform(1)

eliminate_zeros(1)

build_analyzer(1)

__init__(1)

transpose(1)

Example #1

Show file

File: text_mining.py Project: Hamza-619/Text-Mining-Project

def useful_words(ls):
    """
    this function takes a list of strings and return a list with strings used more than ones
    """
    bow_transformer = CountVectorizer().fit(ls)
    csr_matrix = bow_transformer.transform([" ".join(ls)])
    tfidf_transfrom = TfidfTransformer().fit_transform(csr_matrix)
    """eliminate"""
    tmp_list = []  # tmp_list contains the elements to be eliminated
    Mc = tfidf_transfrom.tocoo()
    for i in Mc.col:
        if Mc.data[Mc.col ==
                   i][0] == Mc.data.min() and Mc.data.min() != Mc.data.max():
            tmp_list.append(bow_transformer.get_feature_names()[i])
    return list(set(ls) - set(tmp_list))

Example #2

Show file

File: cloud.py Project: recski/vwa-tools

def get_tfidf(count_by_plz, vocab):
    print('building array...')
    counts = np.array([[count_by_plz[plz][vocab.get_word(i)] for plz in PLZ]
                       for i in range(len(vocab))])
    counts = counts.transpose()
    print('done, shape:', counts.shape)

    print('calculating TFIDF...')
    tfidf = TfidfTransformer().fit_transform(counts)
    print('done, type and shape:', type(tfidf), tfidf.shape)

    cx = tfidf.tocoo()
    tfidf_by_plz = {plz: defaultdict(int) for plz in PLZ}
    for i, j, v in zip(cx.row, cx.col, cx.data):
        tfidf_by_plz[PLZ[i]][vocab.get_word(j)] = v

    return tfidf_by_plz