Python split_words 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: rusyllab

메소드/함수: split_words

hotexamples.com에서의 예제들: 9

Python split_words - 9개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 rusyllab.split_words에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: main.py 프로젝트: zanatar/Name_generator

 def Parser(self, txt):       
     f = open(txt)
     x = f.read().lower().replace(',', '').replace("'", "").replace("[", "").replace("]", "").replace(".", "").replace("(", "").replace(")", "").replace(";", "").replace(":", "").replace("-", "").split()
     for item in x:
         split = rusyllab.split_words(item.split())
         verbs_collection.extend(split)
     f.close()

예제 #2

파일 보기

def count_of_syllables():
    arr_syllables = []
    file = open("comments/clean_comments.txt", "r")
    for line in file:
        syllables = rusyllab.split_words(line.strip().lower().split())
        for syllable in syllables:
            arr_syllables.append(syllable)
    return dict(Counter(arr_syllables))

예제 #3

파일 보기

파일: lermonet.py 프로젝트: D-Tretyakov/poetrybot

def get_rhyme_ending(word):
    stress_pos = accent.put_stress(word).find('\'')
    if stress_pos == -1:
        return word

    lst = list(word)
    lst[stress_pos - 1] = lst[stress_pos - 1].upper()
    word = ''.join(lst)
    sx = rusyllab.split_words([word])
    for i in range(len(sx)):
        if not sx[i].islower():
            return ''.join(sx[i:]).lower()

예제 #4

파일 보기

파일: main - копия.py 프로젝트: zanatar/Name_generator

 def Parser_input(self):
     x = self.plainTextEdit.toPlainText().lower().replace(',', '').replace(
         "'",
         "").replace("[", "").replace("]", "").replace(".", "").replace(
             "(",
             "").replace(")",
                         "").replace(";",
                                     "").replace(":",
                                                 "").replace("-",
                                                             "").split()
     for item in x:
         split = rusyllab.split_words(item.split())
         verbs_collection.extend(split)

예제 #5

파일 보기

def check(word):
    if (len(word.split()) > 1):
        return "Слишком много слов"
    elif len(word) == 1 and word in cons:
        return 'чё'
    elif word == "/start":
        return "Здаров, епт"
    elif(len(word) < 20):
        syllables = rusyllab.split_words([word])
        syl = syllables[0]
        if syl[0] in cons:
            return check_cons(syllables, word)
        else:
            return check_vow(syllables, word)
    else:
        return "Браток, помедленней"

예제 #6

파일 보기

def check_cons(syllables, word):
    syl = syllables[0]
    tmp = list(syl[:])
    excons = ""
    i = 0
    for let in tmp:
        if let in cons:
            tmp[i] = ""
            i = i + 1
        if i >= 3 and len(syllables) < 2:
            return "хуе" + "".join(syllables)
        if let in conc or let in vow:
            break
    excons = "".join(tmp) + "".join(syllables[1:])
    new_word = rusyllab.split_words([excons])
    result = check_vow(new_word, word)
    return result

예제 #7

파일 보기

def answer2pieces(answer_str, max_answer_len):
    if answer_representation == 'chars':
        # вариант для разбивки на символы
        return rpad_chars(BEG_CHAR + answer_str + END_CHAR, max_answer_len)
    elif answer_representation == 'syllables':
        # вариант для разбивки на слоги
        seq = [BEG_CHAR] + rusyllab.split_words(answer_str.split()) + [END_CHAR]
        l = len(seq)
        if l < max_answer_len:
            seq = seq + list(itertools.repeat(PAD_CHAR, (max_answer_len - l)))
        return seq
    elif answer_representation == 'sentencepiece':
        seq = [BEG_CHAR] + spm_encoder.EncodeAsPieces(answer_str) + [END_CHAR]
        l = len(seq)
        if l < max_answer_len:
            seq = seq + list(itertools.repeat(PAD_CHAR, (max_answer_len - l)))
        return seq
    else:
        raise NotImplementedError()

예제 #8

파일 보기

파일: ru_syllab_tokenizer.py 프로젝트: maks5507/cognitive-complexity

    def tokenize(self,
                 text,
                 use_preproc=False,
                 use_stem=False,
                 use_lemm=False,
                 check_length=True,
                 check_stopwords=True):

        preprocessed_text = text

        if use_preproc:
            preprocessed_text, _ = self.preprocessor.preproc(
                text,
                use_lemm=use_lemm,
                use_stem=use_stem,
                check_stopwords=check_stopwords,
                check_length=check_length)

        syllables = rusyllab.split_words(preprocessed_text.split())
        return list(filter(lambda syl: syl != ' ', syllables))

예제 #9

파일 보기

파일: check_syllable.py 프로젝트: wb-08/frequency_analysis

def split_word(text):
    syllables_lst = rusyllab.split_words(text.strip().lower().split())
    return syllables_lst