Python seg 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: deepnlp.segmenter

메소드/함수: seg

hotexamples.com에서의 예제들: 6

Python seg - 6개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 deepnlp.segmenter.seg에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

def parse_data(text):
    words = segmenter.seg(text)
    words_p = ""
    context = {}
    # POS Tagging
    tagging = tagger.predict(words)
    for (w, t) in tagging:
        context[w] = t
    #tmp = text.encode("utf-8") + " : ".encode("utf-8") + words_p.encode("utf-8")
    return context

예제 #2

파일 보기

    def analyze(self, string):
        '''Return a list of three string output: segment, pos, ner'''
        res = []
        #segment
        words = segmenter.seg(string)
        segment_str = " ".join(words)
        res.append(segment_str)

        #POS
        pos_tagging = self.tag_pos(words)
        res.append(_concat_tuples(pos_tagging))

        #NER
        ner_tagging = self.tag_ner(words)
        res.append(_concat_tuples(ner_tagging))
        return res

예제 #3

파일 보기

파일: 50221_test_pos_zh.py 프로젝트: tate11/intelligent-code-completion

#coding:utf-8
from __future__ import unicode_literals  # compatible with python3 unicode

from deepnlp import segmenter
from deepnlp import pos_tagger
tagger = pos_tagger.load_model(lang='zh')

#Segmentation
text = "我爱吃北京烤鸭"  # unicode coding, py2 and py3 compatible
words = segmenter.seg(text)
print(" ".join(words).encode('utf-8'))

#POS Tagging
tagging = tagger.predict(words)
for (w, t) in tagging:
    str = w + "/" + t
    print(str.encode('utf-8'))

#Results
#我/r
#爱/v
#吃/v
#北京/ns
#烤鸭/n

예제 #4

파일 보기

파일: 50222_test_segment.py 프로젝트: tate11/intelligent-code-completion

#coding=utf-8
from __future__ import unicode_literals

from deepnlp import segmenter

text = "我刚刚在浙江卫视看了电视剧老九门，觉得陈伟霆很帅"
segList = segmenter.seg(text)
text_seg = " ".join(segList)

print(text.encode('utf-8'))
print(text_seg.encode('utf-8'))

예제 #5

파일 보기

파일: 50217_test_modules.py 프로젝트: tate11/intelligent-code-completion

def _concat_tuples(tagging):
  TOKEN_BLANK = " "
  wl = [] # wordlist
  for (x, y) in tagging:
    wl.append(x + "/" + y)
  concat_str = TOKEN_BLANK.join(wl)
  return concat_str

# read input file
docs = []
file = codecs.open(os.path.join(BASE_DIR, 'docs_test.txt'), 'r', encoding='utf-8')
for line in file:
    line = line.replace("\n", "").replace("\r", "")
    docs.append(line)

# Test each individual module
# output file
fileOut = codecs.open(os.path.join(BASE_DIR, 'modules_test_results.txt'), 'w', encoding='utf-8')
words = segmenter.seg(docs[0])
pos_tagging = _concat_tuples(tagger_pos.predict(words))
ner_tagging = _concat_tuples(tagger_ner.predict(words))

fileOut.writelines(" ".join(words) + "\n")
fileOut.writelines(pos_tagging + "\n")
fileOut.writelines(ner_tagging + "\n")
fileOut.close

print (" ".join(words).encode('utf-8'))
print (pos_tagging.encode('utf-8'))
print (ner_tagging.encode('utf-8'))

예제 #6

파일 보기

 def segment(self, string):
     ''' Return list of [word]'''
     words = segmenter.seg(string)
     return words