Python CountVectorizer.get_feature_name示例

编程语言: Python

命名空间/包名称: sklearn.feature_extraction.text

类/类型: CountVectorizer

方法/功能: get_feature_name

hotexamples.com的示例: 1

Python CountVectorizer.get_feature_name - 已找到1个示例。这些是从开源项目中提取的最受好评的sklearn.feature_extraction.text.CountVectorizer.get_feature_name现实Python示例。您可以评价示例，以帮助我们提高示例质量。

常用方法

显示隐藏

CountVectorizer(30)

_validate_vocabulary(30)

fit_transform(30)

fit(30)

build_tokenizer(30)

build_analyzer(30)

get_stop_words(30)

get_params(21)

get_feature_names_out(15)

build_preprocessor(13)

__init__(10)

get_feature_names(9)

dictionary_freeze(6)

count(4)

analyzer(4)

fixed_vocabulary(3)

astype(3)

_count_vocab(2)

copy(2)

fit_trainsform(2)

get_features_names(2)

append(2)

_word_ngrams(2)

get_feature_name(1)

getSenVec(1)

_sort_features(1)

get_features(1)

get_sentence_vector(1)

get_shape(1)

getOutputCol(1)

fit_Transform(1)

fit_trasform(1)

fit_transfrom(1)

fit_transforn(1)

__repr__(1)

fir_transform(1)

__dict__(1)

extract_ngrams(1)

delete_temporary_training_data(1)

count_features(1)

_limit_features(1)

fir(1)

示例#1

显示文件

from sklearn.feature_extraction.text import CountVectorizer
import pandas as pd

data = pd.read_csv("x.txt", sep='\t')
data.columns = ['label','body_text']


count_vect = CountVectorizer(analyzer = clearn_text) # clearn_text is a handmade function
X_counts = count_vect.fit_transform(data['body_text'])
print(X_counts.shape)
print(count_vect.get_feature_name())

X_counts_df = pd.DataFrame(X_counts_sample.toarray()) # till now we can see how many times a word appeared in a sentence


# With N-grams ---------------------------------------------------------------------------------
ngram_vect = CountVectorizer(ngram_range=(1,3))
X_counts = ngram_vect.fit_transform(data['body_text'])
print(X_counts.shape)
print(ngram_vect.get_feature_name())

X_counts_df = pd.DataFrame(X_counts_sample.toarray())
X_counts_df.columns = ngram_vect.get_feature_names()

'''
# TF-IDF ----------------------------------------------------------------------------------------
# need to learn more
1st count how many times a word appear in a sentence
2nd count how many sentence including this word too
3rd show the percentage