Python CountVectorizer.get_features Examples

Programming Language: Python

Namespace/Package Name: sklearn.feature_extraction.text

Class/Type: CountVectorizer

Method/Function: get_features

Examples at hotexamples.com: 1

Python CountVectorizer.get_features - 1 examples found. These are the top rated real world Python examples of sklearn.feature_extraction.text.CountVectorizer.get_features extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

CountVectorizer(30)

_validate_vocabulary(30)

fit_transform(30)

fit(30)

build_tokenizer(30)

build_analyzer(30)

get_stop_words(30)

get_params(21)

get_feature_names_out(15)

build_preprocessor(13)

__init__(10)

get_feature_names(9)

dictionary_freeze(6)

count(4)

analyzer(4)

fixed_vocabulary(3)

astype(3)

_count_vocab(2)

copy(2)

fit_trainsform(2)

get_features_names(2)

append(2)

_word_ngrams(2)

get_feature_name(1)

getSenVec(1)

_sort_features(1)

get_features(1)

get_sentence_vector(1)

get_shape(1)

getOutputCol(1)

fit_Transform(1)

fit_trasform(1)

fit_transfrom(1)

fit_transforn(1)

__repr__(1)

fir_transform(1)

__dict__(1)

extract_ngrams(1)

delete_temporary_training_data(1)

count_features(1)

_limit_features(1)

fir(1)

Example #1

Show file

    def tfidf_basic(self):
        vectorizer = CountVectorizer()
        transformer = TfidfTransformer()
        X = vectorizer.fit_transform(self.corpus)
        print("\nTransform Matric: ")
        print(X.toarray())
        print("\nTransform Matric shape: ")
        print (X.shape)

        words = vectorizer.get_features() #所有文章的關鍵字
        print ("\nAll feature(keywords)所有文章的字: ")
        print (words)

        # Matrix with one row per document and one column per token (e.g. word) occurring in the corpus.
        # tfidf_matrix: [n_samples, n_features_new]
        tfidf_matrix = transformer.fit_transform(X)

        tfidf_weight = tfidf_matrix.toarray()  #對應的tfidf矩陣
        print ("\ntf-idf Matric: ")
        print (tfidf_weight)
        print (tfidf_weight.shape) # 4 * 9
        return [words, tfidf_weight]