Python Ngram.interpolation Beispiele

Programmiersprache: Python

Namespace / Paketname: ngram

Klasse / Typ: Ngram

Methode / Funktion: interpolation

Beispiele auf hotexamples.com: 2

Python Ngram.interpolation - 2 Beispiele gefunden. Dies sind die am besten bewerteten Python Beispiele für die ngram.Ngram.interpolation, die aus Open Source-Projekten extrahiert wurden. Sie können Beispiele bewerten, um die Qualität der Beispiele zu verbessern.

Häufig verwendete Methoden

Anzeigen Verbergen

Ngram(30)

jaccard(5)

cmdline_ngram(5)

getNgrams(4)

deriveNgrams(4)

interpolation(2)

generate(2)

count(2)

calculate_bigram_discounting(1)

dump(1)

__hash__(1)

good_turing_smooting(1)

get_prob(1)

get_before(1)

get_after(1)

getSeg(1)

addCount(1)

addEntry(1)

gen_next_word(1)

gen_next_pos(1)

frequency(1)

bigram_generate_sentences(1)

calculate_bigram_prob(1)

create_unigram_lm(1)

create_top_bigrams(1)

create_bigram_lm(1)

bigram_good_turing(1)

construct(1)

build(1)

chunck_predict(1)

calculate_trigram_prob(1)

calculate_trigram_discounting(1)

calculate_probabilities(1)

calculate_onegram_prob(1)

N(1)

Beispiel #1

Datei anzeigen

Datei: bigram-query.py Projekt: elvis-alexander/StonyBrookCS

    # load bigram
    bigrams = load_bigram(open(argv[1]))
    # load unigram
    uni = load_unigram(open(argv[2]))
    words = uni['words']
    total_tokens = uni['total_tokens']
    training_gram = Ngram(total_tokens, words, bigrams)
    # load x, y, smooth_method
    x = argv[3]
    y = argv[4]
    # exit if x is non-existent in training
    if argv[3] not in words:
        print 'We are incredibly sorry, but the word you requested was not found in the training set'
        exit()
    # return probability if bigram has been seen in training
    if (x, y) in bigrams:
        print 'Pr({}|{}) = {}'.format(
            y, x, training_gram.get_prob(x, y, smooth_index))
    else:
        # bigram (x,y) has not been seen, calculate probability for specific smoothing for bigram (x,y)
        if smooth_method == 'M':
            print "Pr({}|{}) = {}".format(y, x, training_gram.mle(x, y))
        elif smooth_method == 'L':
            print "Pr({}|{}) = {}".format(y, x,
                                          training_gram.laplace_bigram(x, y))
        elif smooth_method == 'I':
            print "Pr({}|{}) = {}".format(
                y, x, training_gram.interpolation(x, y, 0.3))
        else:
            print "Pr({}|{}) = {}".format(y, x, training_gram.pr_k(x, y))

Beispiel #2

Datei anzeigen

 # calculate perplexities
 test_words = []
 sentences = sentence_segmentation(open(argv[3]))
 for sen in sentences:
     tokens = tokenization(start_sym + ' ' + sen + ' ' + end_sym)
     for tok in tokens:
         if tok == '':
             continue
         tok = tok.lower()
         test_words.append(tok)
 test_size = len(test_words)
 # calculating perplexities
 bi_perplexity = 0
 inter_perplexity = 0
 uni_perplexity = 0
 x = start_sym
 # calculate summation
 uni_perplexity += log(training_gram.laplace_unigram(x), 2)
 for y in test_words[1:]:
     bi_perplexity += log(training_gram.laplace_bigram(x, y), 2)
     inter_perplexity += log(training_gram.interpolation(x, y, 0.3), 2)
     uni_perplexity += log(training_gram.laplace_unigram(y), 2)
     x = y
 # calculate perplexities
 bi_perplexity = pow(2, (-1 / float(test_size)) * bi_perplexity)
 inter_perplexity = pow(2, (-1 / float(test_size)) * inter_perplexity)
 uni_perplexity = pow(2, (-1 / float(test_size)) * uni_perplexity)
 # output perplexities
 out = "Laplace Bigram: {}\nInterpolated Bigram: {}\nLaplace Unigram: {}".format(
     str(bi_perplexity), str(inter_perplexity), str(uni_perplexity))
 print out