Python TfidfVectorizer.get_feature_name Examples

Programming Language: Python

Namespace/Package Name: sklearn.feature_extraction.text

Class/Type: TfidfVectorizer

Method/Function: get_feature_name

Examples at hotexamples.com: 1

Python TfidfVectorizer.get_feature_name - 1 examples found. These are the top rated real world Python examples of sklearn.feature_extraction.text.TfidfVectorizer.get_feature_name extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

fit(30)

get_stop_words(30)

TfidfVectorizer(30)

fit_transform(30)

get_feature_names(30)

inverse_transform(30)

build_analyzer(30)

build_tokenizer(29)

get_params(29)

get_feature_names_out(14)

__init__(12)

idf_(11)

build_preprocessor(8)

max_features(8)

_validate_vocabulary(3)

max_df(3)

fir(2)

N_(2)

fit_on_texts(2)

build_vocab(2)

decode(2)

_tfidf(2)

decode_error(1)

append(1)

_document_frequency(1)

_get_param_names(1)

kneighbors(1)

join(1)

_stop_words_id(1)

inv_vocabulary_(1)

input(1)

infer_vector(1)

idx_target_cache(1)

get_word_net_feature_vecs(1)

bert(1)

get_shape(1)

encode(1)

get_feautre_names(1)

cate_set(1)

get_feature_name(1)

fit_transfrorm(1)

fit_transfrom(1)

count(1)

fit_trainsform(1)

count_args(1)

count_chunks(1)

encoding(1)

mean(1)

Example #1

Show file

vocab = [
    word[:][0] for word in vocab
]  # we get vocab in each word in 100 most_common features, [:] is for all row, [0] is for index 0

print(vocab)

# now we get 300 words as vocab and content_final (content that has been cleared)

#this can take some time, this is from sklearn tfidfVectorizer
tfidf = TfidfVectorizer(analyzer='word',
                        stop_words=nltk.corpus.stopwords.words('indonesian'),
                        ngram_range=(1, 1),
                        min_df=0.04,
                        vocabulary=vocab)
tfidf_hasil = tfidf.fit_transform(content_final)
features = tfidf.get_feature_name()
print(features)
print(tfidf_hasil.toarray())

# In[5]:

import numpy
numpy.savetxt('D:/SKRIPSI/percobaan/tfidf1332.csv',
              tfidf_hasil.todense(),
              delimiter=',')

# In[1]:

#df = pd.DataFrame(data = vocab)
#df