Python Lattice.set_sentence примеры использования

Язык программирования: Python

Пространство имен/Пакет: Lattice

Класс/Тип: Lattice

Метод/Функция: set_sentence

Примеров на hotexamples.com: 6

Python Lattice.set_sentence - 6 примеров найдено. Это лучшие примеры Python кода для Lattice.Lattice.set_sentence, полученные из open source проектов. Вы можете ставить оценку каждому примеру, чтобы помочь нам улучшить качество примеров.

Основные методы

Показать Скрыть

Lattice(30)

load(6)

set_sentence(6)

populate_nodes(6)

Viterbi(5)

update(3)

angle(3)

save(3)

dist(3)

setup_boxes(3)

update_maximal_distances(2)

get_lattice(2)

run_metropolous_algorithm(2)

perc_value(2)

generate_graph(2)

filled(2)

critical_concentration(2)

cells(2)

update_critical(2)

start(1)

sort_critical(1)

All(1)

populate_marginal(1)

join(1)

get_neighbours_open(1)

Grid_Viz(1)

get_nearby_points(1)

density(1)

addOneObject(1)

NBest(1)

IsCloseToEdgeWithHigherSpacing(1)

get_neighbours(1)

Пример #1

Показать файл

Файл: UnigramModel.py Проект: Ueeek/Sentencepiece_py

def process_each_prune(tup):
    """
    poolで呼ばれる関数。
    classのなかにかくと、classごとcopyされてeach processに渡される。
    それを防ぐために、外に書く

    1文ずつの処理。(sent,piece,trie)よりも、まとめた方が早い(sent_list, piece,trie)
    """
    (items, piece, trie) = tup

    vsum = 0
    freq = defaultdict(int)
    inverted = defaultdict(int)

    L = Lattice()
    for item in items:
        if item is None:
            continue

        (s, score) = item
        vsum += score
        L.set_sentence(s)
        L.populate_nodes(piece, trie)

        for word in L.Viterbi(ret_piece=True):
            freq[word] += score
            inverted[word] += score
    return (vsum, freq, inverted)

Пример #2

Показать файл

Файл: UnigramModel.py Проект: Ueeek/Sentencepiece_py

    def prune_step_1_always_keep_alternative(self):
        """
        Return
            always_keep(dict)
            alternatives(dict)
        """
        current_piece = self.SentencePiece.get_pieces()
        # pieceをkeyとしてdictで管理
        always_keep = dict()
        alternatives = defaultdict(list)

        # First segments the current sentencepieces to kwon how each sentencepiece is resegmented if this sentencepiece is  removed from vocabulary.
        for key, score in current_piece.items():
            L = Lattice()
            L.set_sentence(key)
            L.populate_nodes(current_piece, self.Trie)
            nbests = L.NBest(2, ret_piece=True)

            if len(nbests) == 1:  # only one way to resegment it
                always_keep[key] = True

            elif len(nbests[0]) >= 2:
                always_keep[key] = False

            elif len(nbests[0]) == 1:
                always_keep[key] = True
                alternatives[key] = nbests[1]

        #print("alt=>",alternatives)
        return always_keep, alternatives

Пример #3

Показать файл

Файл: UnigramModel.py Проект: Ueeek/Sentencepiece_py

 def encode_one_sent(self, sent):
     #TODO encode_poolがうまくいくなら決して良い
     """
     Arguments:
         sent(str): sentence piece vocを使って分割する文
     Returns:
         tokenize_sent(str): space split tokenize sentence
     """
     L = Lattice()
     L.set_sentence(sent)
     L.populate_nodes(self.SentencePiece.get_pieces(), self.Trie)
     tokenize_sent = " ".join(L.Viterbi(ret_piece=True))
     assert "".join(tokenize_sent.split(" ")) == sent
     return tokenize_sent

Пример #4

Показать файл

Файл: AlignTrainerOneSideFix.py Проект: Ueeek/Sentencepiece_py

def process_each(tup):
    (items,piece,trie) = tup

    L = Lattice()

    ret=[]
    for item in items:
        if item is None:
            continue

        L.set_sentence(item)
        L.populate_nodes(piece, trie)
        ret.append(L.Viterbi(ret_piece=True))

    return ret

Пример #5

Показать файл

Файл: UnigramModel.py Проект: Ueeek/Sentencepiece_py

def process_each_encode(tup):
    """
    tup: tuple(sentence_list, piece, trie)
    
    return: tokenized_sentence_list
    """
    (items, piece, trie) = tup

    ret = []
    L = Lattice()
    for sent in items:
        if sent is None:
            continue
        L.set_sentence(sent)
        L.populate_nodes(piece, trie)
        tokenize_sent = " ".join(L.Viterbi(ret_piece=True))
        ret.append(tokenize_sent)
        assert "".join(tokenize_sent.split(" ")) == sent
    return ret

Пример #6

Показать файл

Файл: UnigramModel.py Проект: Ueeek/Sentencepiece_py

def process_each_estep(tup):

    expected = defaultdict(int)
    objective = 0
    num_tokens = 0

    (items, pieces, trie) = tup
    L = Lattice()
    for item in items:
        if item is None:
            continue
        (key, freq) = item
        L.set_sentence(key)
        L.populate_nodes(pieces, trie)
        Z, ret_expected = L.populate_marginal(freq)

        for key, val in ret_expected.items():
            expected[key] += val

        N = len(L.Viterbi())
        num_tokens += N
        objective -= Z
    return (expected, objective, num_tokens)