Exemplos de TFIDF.doc2weight em Python

Linguagem de programação: Python

Espaço para nome / nome do pacote: microtc.weighting

Classe / Tipo: TFIDF

Método / Função: doc2weight

Exemplos em hotexamples.com: 2

TFIDF.doc2weight em Python - 2 exemplos encontrados. Esses são os exemplos do mundo real mais bem avaliados de microtc.weighting.TFIDF.doc2weight em Python extraídos de projetos de código aberto. Você pode avaliar os exemplos para nos ajudar a melhorar a qualidade deles.

Métodos Frequentes

Exibir Ocultar

TFIDF(6)

counter(3)

doc2weight(2)

Métodos Frequentes

TFIDF (6)

counter (3)

doc2weight (2)

Exemplo n.º 1

0

Exibir arquivo

def test_doc2weight(): from microtc.textmodel import TextModel from microtc.weighting import TFIDF from microtc.utils import tweet_iterator import os fname = join(os.path.dirname(__file__), 'text.json') tw = list(tweet_iterator(fname)) docs = [x['text'] for x in tw] text = TextModel(docs, token_list=[-1, 3]) # print(text['buenos dias']) docs = [text.tokenize(d) for d in docs] sp = TFIDF(docs) assert len(sp.doc2weight(text.tokenize('odio odio los los'))) == 3

Exemplo n.º 2

0

Exibir arquivo

def test_getitem(): from microtc.textmodel import TextModel from microtc.weighting import TFIDF from microtc.utils import tweet_iterator import os fname = join(os.path.dirname(__file__), 'text.json') tw = list(tweet_iterator(fname)) docs = [x['text'] for x in tw] text = TextModel(docs, token_list=[-1, 3]) # print(text['buenos dias']) docs = [text.tokenize(d) for d in docs] sp = TFIDF(docs) tok = text.tokenize('buenos dias') bow = sp.doc2weight(tok) ids = bow[0] assert len(ids) == len(sp[tok])