Python txt2tmp 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: tokenizer

메소드/함수: txt2tmp

hotexamples.com에서의 예제들: 6

Python txt2tmp - 6개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 tokenizer.txt2tmp에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: splicer.py 프로젝트: alvations/DLTK

def bananasplit(text):
  """ Dictionary + string search splitter. Only two element splits."""
  txt2tmp(text)
  command = "java -jar banana-split-standalone-0.4.0.jar "+ \
            "igerman98_all.xml < /tmp/tmp.in > /tmp/tmp.out"
  os.system(command)
  for i in codecs.open("/tmp/tmp.out","r","utf8"):
    return " ".join([i for i in i.split() if u']' not in i and u'[' not in i])

예제 #2

파일 보기

파일: splicer.py 프로젝트: ikonov/DLTK

def bananasplit(text):
    """ Dictionary + string search splitter. Only two element splits."""
    txt2tmp(text)
    command = "java -jar banana-split-standalone-0.4.0.jar "+ \
              "igerman98_all.xml < /tmp/tmp.in > /tmp/tmp.out"
    os.system(command)
    for i in codecs.open("/tmp/tmp.out", "r", "utf8"):
        return " ".join(
            [i for i in i.split() if u']' not in i and u'[' not in i])

예제 #3

파일 보기

파일: splicer.py 프로젝트: alvations/DLTK

def jwordsplitter(text): # Source: http://www.danielnaber.de/jwordsplitter/
  """ Dictionary based compound splitter. Supports multiple splits."""
  txt2tmp(text)
  os.system("java -jar jwordsplitter-3.4.jar /tmp/tmp.in > /tmp/tmp.out")
  for i in codecs.open("/tmp/tmp.out","r","utf8"):
    return "".join([j for j in i.strip().split(",")])

예제 #4

파일 보기

파일: splicer.py 프로젝트: alvations/DLTK

def smor(text):
  """ Morphological anlaysis with SMOR. you need SMOR in /usr/bin/ """
  txt2tmp(text)
  os.system("smor < /tmp/tmp.in > /tmp/tmp.out")
  return [i.strip() for i in \
          codecs.open("/tmp/tmp.out","r","utf8").readlines()[3:]]

예제 #5

파일 보기

파일: splicer.py 프로젝트: ikonov/DLTK

def smor(text):
    """ Morphological anlaysis with SMOR. you need SMOR in /usr/bin/ """
    txt2tmp(text)
    os.system("smor < /tmp/tmp.in > /tmp/tmp.out")
    return [i.strip() for i in \
            codecs.open("/tmp/tmp.out","r","utf8").readlines()[3:]]

예제 #6

파일 보기

파일: splicer.py 프로젝트: ikonov/DLTK

def jwordsplitter(text):  # Source: http://www.danielnaber.de/jwordsplitter/
    """ Dictionary based compound splitter. Supports multiple splits."""
    txt2tmp(text)
    os.system("java -jar jwordsplitter-3.4.jar /tmp/tmp.in > /tmp/tmp.out")
    for i in codecs.open("/tmp/tmp.out", "r", "utf8"):
        return "".join([j for j in i.strip().split(",")])