Python CountVectorizer.transform示例

编程语言: Python

命名空间/包名称: scikits.learn.feature_extraction.text

类/类型: CountVectorizer

方法/功能: transform

hotexamples.com的示例: 2

Python CountVectorizer.transform - 已找到2个示例。这些是从开源项目中提取的最受好评的scikits.learn.feature_extraction.text.CountVectorizer.transform现实Python示例。您可以评价示例，以帮助我们提高示例质量。

常用方法

显示隐藏

CountVectorizer(8)

transform(2)

__init__(1)

fit(1)

fit_transform(1)

max_df(1)

示例#1

显示文件

文件： test_text.py 项目： jolos/scikit-learn

def test_countvectorizer_custom_vocabulary():
    what_we_like = ["pizza", "beer"]
    vect = CountVectorizer(vocabulary=what_we_like)
    vect.fit(JUNK_FOOD_DOCS)
    assert_equal(set(vect.vocabulary), set(what_we_like))
    X = vect.transform(JUNK_FOOD_DOCS)
    assert_equal(X.shape[1], len(what_we_like))

示例#2

显示文件

文件： classify.py 项目： quinnchr/twitter-classify

class SVM:

    def __init__(self, training, classes, vocabulary):
        vocabulary = load(vocabulary)
        self.cv = CountVectorizer(vocabulary = vocabulary.tolist())
        self.samples = load(training).tolist()
        self.classes = load(classes)
        self.classifier = LinearSVC()
        self.classifier.fit(self.samples, self.classes)

    def classify(self, text):
        features = self.cv.transform([text])
        return self.classifier.predict(features)[0]