Python words_tokenize 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: nltk

메소드/함수: words_tokenize

hotexamples.com에서의 예제들: 4

Python words_tokenize - 4개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 nltk.words_tokenize에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

def process_content():
    for i in tokenized:
        words = nltk.words_tokenize(i)
        tagged = nltk.pos_tag(words)

        chunkgram = r"""chunk: {<RB.?>*<VB.?>*<NNP>+<NN>?}"""
        chunkParser = nltk.RegexpParser(chunkgram)
        chunked = chunkgram.parse(tagged)

        chunked.draw()

예제 #2

파일 보기

파일: nlp_chinking.py 프로젝트: vishvdeep007/NLP-basics

def process_content():
        for i in tokenized:
            words = nltk.words_tokenize(i)
            tagged = nltk.pos_tag(words)    

            chunkgram = r"""chunk: {<.*>+} }<VB.?|IN|DT|TO>+{"""
            chunkParser = nltk.RegexpParser(chunkgram)
            chunked = chunkgram.parse(tagged)

            chunked.draw()

예제 #3

파일 보기

파일: main_20210201192438.py 프로젝트: spacelord16/Chat-bot

def bag_words(s, words):
    bag = [0 for _ in range(len(words))]

    s_words = nltk.words_tokenize(s)
    s_words = [stemmer.stem(word.lower()) for word in s_words]

    for se in s_words:
        for i, w in enumerate(words):
            if w == se:
                bag[i].append(1)

    return numpy.array(bag)

예제 #4

파일 보기

import pickle

with open("cauhoi.json") as file:
    data = json.load(file)
try:
    with open("data.pickle", "rb") as f:
        words, labels, training, output = pickle.load(f)
except:
    words = []
    labels = []
    docs_x = []
    docs_y = []

    for cauhoi in data["cauhoi"]:
        for question in cauhoi["question"]:
            wrds = nltk.words_tokenize(question)
            words.extend(wrds)
            docs_x.append(wrds)
            docs_y.append(cauhoi["tag"])

            if cauhoi["tag"] not in labels:
                labels.append(cauhoi["tag"])
                words = [stemmer.stem(w.lower()) for w in words if w != "?"]
                words = sorted(list(set(words)))

                labels = sorted(labels)
                training = []
                output = []

                out_empty = [0 for _ in range(len(labels))]