Esempi in Python per Document.get_candidates

Linguaggio di programmazione: Python

Spazio dei nomi/nome del pacchetto: Document

Classe/tipologia: Document

Metodo/funzione: get_candidates

Esempi su hotexamples.com: 1

Document.get_candidates in Python: 1 esempio trovato. Questo è il miglior esempio reale in Python per Document.Document.get_candidates, estratto da progetti open source. Lo puoi valutare, per aiutarci a migliorare la qualità dei nostri esempi.

Metodi utilizzati di frequente

Mostra Nascondi

Document(30)

all_sentences(11)

__str__(5)

__init__(4)

append(3)

addMention(2)

numOfWords(2)

generateWhole(2)

factory(2)

edit(2)

addMeSH(1)

get_candidates(1)

generate_candidate_anaphor_data(1)

generate_candidate_mention_pairs(1)

generate_document(1)

generate_gold_anaphor_data(1)

generate_gold_mention_pairs(1)

get(1)

getID(1)

getIdentifiant(1)

getUID(1)

get_article(1)

get_clean(1)

from_json(1)

get_cls_byname(1)

get_cluster_data(1)

get_stems(1)

name(1)

__dict__(1)

save_collection(1)

set_body_length(1)

set_url(1)

termFrequency(1)

to_json(1)

write2DB(1)

_edit(1)

from_data_frame(1)

addLien(1)

build_n_grams(1)

addRef(1)

addTexte(1)

addTitre(1)

add_anchor_text(1)

add_body_hits(1)

add_sentence(1)

allDocumentsID(1)

addDocument(1)

addAuteur(1)

availableReplacements(1)

calculate_vectors(1)

Esempio n. 1

Mostra file

def main():
    clustered_corpus_path = 'clustered_corpus'
    clustered_corpus = read_clustered_corpus(clustered_corpus_path)
    corpus = merge_clustered_corpus_into_a_single_corpus(clustered_corpus)

    target_file_path = 'target.txt'
    text = read_text_file(target_file_path)
    document = Document(text)

    corpus = Corpus(corpus)
    clustered_corpus = ClusteredCorpus(clustered_corpus)

    candidate_to_rank_mapping = {}
    candidate_to_params_mapping = {}
    candidate_to_dfs_in_each_cluster_mapping = {}

    for candidate in document.get_candidates():
        tf = math.log(1.0 + document.get_tf_for(candidate), 10.0)
        # tf = document.get_tf_for(candidate)
        idf = math.log(1.0 + 1.0 / corpus.get_df_for(candidate), 2.0)
        cu = clustered_corpus.get_cu_for(candidate)

        rank = cu
        # rank = tf * cu
        # rank = tf * idf

        dfs_in_each_cluster = clustered_corpus.get_dfs_in_each_cluster_for(candidate)

        candidate_representative = corpus.get_representative_for(candidate)
        candidate_to_rank_mapping[candidate_representative] = rank
        candidate_to_params_mapping[candidate_representative] = (tf, idf, cu)
        candidate_to_dfs_in_each_cluster_mapping[candidate_representative] = dfs_in_each_cluster

    table = generate_table_based_on(
        candidate_to_rank_mapping,
        candidate_to_params_mapping,
        candidate_to_dfs_in_each_cluster_mapping
    )

    save_as_file(table)
    print('Done.')