Python ResultsProcessor.results_to_string 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: results_procesor

클래스/타입: ResultsProcessor

메소드/함수: results_to_string

hotexamples.com에서의 예제들: 5

Python ResultsProcessor.results_to_string - 5개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 results_procesor.ResultsProcessor.results_to_string에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

compute_metrics(6)

ResultsProcessor(3)

persist_results(3)

results_to_string(3)

compute_mean_metrics(2)

metrics_to_string(2)

_ResultsProcessor__get_label_(1)

__metrics_to_str__(1)

get_wd_level_lbs(1)

예제 #1

파일 보기

파일: windowbasedCRFclassifier_most_common_tag_test.py 프로젝트: IslamMohamedMosaad/PythonNlpResearch

                merge_dictionaries(wd_td_ys_bytag, cv_wd_td_ys_by_tag)
                merge_dictionaries(wd_vd_ys_bytag, cv_wd_vd_ys_by_tag)
                merge_dictionaries(td_wd_predictions_by_code,
                                   cv_wd_td_predictions_by_tag)
                merge_dictionaries(vd_wd_predictions_by_code,
                                   cv_wd_vd_predictions_by_tag)

            logger.info("Training completed")
            """ Persist Results to Mongo DB """
            wd_algo = "CRF_MOST_COMMON_TAG"
            SUFFIX = "_CRF_MOST_COMMON_TAG"
            SC_TAGGING_TD, SC_TAGGING_VD = "TEST_SC_TAGGING_TD" + SUFFIX, "TEST_SC_TAGGING_VD" + SUFFIX

            parameters = dict(train_config)
            parameters["extractors"] = map(lambda fn: fn.func_name, extractors)
            parameters["min_feat_freq"] = MIN_FEAT_FREQ
            parameters.update(training_opt_copy)

            wd_td_objectid = processor.persist_results(
                SC_TAGGING_TD, cv_wd_td_ys_by_tag, cv_wd_td_predictions_by_tag,
                parameters, wd_algo)
            wd_vd_objectid = processor.persist_results(
                SC_TAGGING_VD, cv_wd_vd_ys_by_tag, cv_wd_vd_predictions_by_tag,
                parameters, wd_algo)

            # This outputs 0's for MEAN CONCEPT CODES as we aren't including those in the outputs
            print processor.results_to_string(wd_td_objectid, SC_TAGGING_TD,
                                              wd_vd_objectid, SC_TAGGING_VD,
                                              "TAGGING")
            logger.info("Results Processed")

예제 #2

파일 보기

파일: train_tagger_binary.py 프로젝트: simonhughes22/PythonNlpResearch

        tag2word_classifier[tag] = tagger

        td_wd_predictions_by_code[tag] = tagger.classify_many(td_feats)
        vd_wd_predictions_by_code[tag] = tagger.classify_many(vd_feats)

        td_metrics = toDict(compute_metrics(wd_td_ys_bytag,  td_wd_predictions_by_code)[tag])
        vd_metrics = toDict(compute_metrics(wd_vd_ys_bytag,  vd_wd_predictions_by_code)[tag])
        print "Fold:", i, "Tag:", tag
        print processor.__metrics_to_str__(pad_str, tag, td_metrics, vd_metrics)

    merge_dictionaries(wd_td_ys_bytag, cv_wd_td_ys_by_tag)
    merge_dictionaries(wd_vd_ys_bytag, cv_wd_vd_ys_by_tag)
    merge_dictionaries(td_wd_predictions_by_code, cv_wd_td_predictions_by_tag)
    merge_dictionaries(vd_wd_predictions_by_code, cv_wd_vd_predictions_by_tag)
    pass

CB_TAGGING_TD, CB_TAGGING_VD = "CB_TAGGING_TD", "CB_TAGGING_VD"
parameters = dict(config)
#parameters["no_bias"] = True # better with
#parameters["AverageWeights"] = False # Bad - averaging really helps
parameters["extractors"] = map(lambda fn: fn.func_name, extractors)
parameters["min_feat_freq"] = MIN_FEAT_FREQ

wd_algo = "MaxEnt-Binary-NLTK"

wd_td_objectid = processor.persist_results(CB_TAGGING_TD, cv_wd_td_ys_by_tag, cv_wd_td_predictions_by_tag, parameters, wd_algo)
wd_vd_objectid = processor.persist_results(CB_TAGGING_VD, cv_wd_vd_ys_by_tag, cv_wd_vd_predictions_by_tag, parameters, wd_algo)

print processor.results_to_string(wd_td_objectid,   CB_TAGGING_TD,  wd_vd_objectid,     CB_TAGGING_VD,  "TAGGING")

""" WEIGHTED MEAN F1 CONCEPT CODES = 0.727. Better than WINDOW BASED """

예제 #3

파일 보기

파일: train_causal_relation_extractor_logreg.py 프로젝트: simonhughes22/PythonNlpResearch

parameters["min_feat_freq"] = MIN_FEAT_FREQ

parameters["num_extractors"] = len(extractors)
parameters["cost_function"] = cost_fn.__name__
parameters["beta"] = BETA
parameters["max_epochs"] = MAX_EPOCHS
parameters["algorithm"] = str(LogisticRegression())
parameters["ngrams"] = str(NGRAMS)
parameters["stemmed"] = False

sent_td_objectid = processor.persist_results(CB_SENT_TD, cv_sent_td_ys_by_tag, cv_sent_td_predictions_by_tag,
                                             parameters, sent_algo)
sent_vd_objectid = processor.persist_results(CB_SENT_VD, cv_sent_vd_ys_by_tag, cv_sent_vd_predictions_by_tag,
                                             parameters, sent_algo)

print(processor.results_to_string(sent_td_objectid, CB_SENT_TD, sent_vd_objectid, CB_SENT_VD, "SENTENCE"))

## TODO
# - Re-train tagging model, adding tags where reg tag is missing but is included in a causer or result tag.
# - Also include explicit in the predicted tags.
# - Need to handle relations where same code -> same code

# -TODO - Neat Ideas
# Inject a random action (unform distribution) with a specified probability during training also
# Ensures better exploration of the policy space. Initial algo predictions will be random but converges very quickly so this may be lost

# TODO * Need to make sure the tagger tags EXCPLICIT tags. These can then be skipped by the parser, but will be included in the features used to train the parser and taggger. Do we want to train a separate tagger that determines if a tagged word is a cause, explict or result. That will then resolve the direction of the relation?
# TODO - recall is v low on training data. Test it with perfect tagging predictions

# TODO Issues
# 1. Unsupported relations

예제 #4

파일 보기

sent_algo = str(fn_create_sent_cls())

SUFFIX = "_CAUSE_EFFECT_LBLS"
SC_TAGGING_TD, SC_TAGGING_VD, SC_SENT_TD, SC_SENT_VD = "SC_TAGGING_TD" + SUFFIX, "SC_TAGGING_VD" + SUFFIX, "SC_SENT_TD" + SUFFIX, "SC_SENT_VD" + SUFFIX
parameters = dict(config)
parameters["extractors"] = map(lambda fn: fn.func_name, extractors)
parameters["min_feat_freq"] = MIN_FEAT_FREQ


wd_td_objectid = processor.persist_results(SC_TAGGING_TD, cv_wd_td_ys_by_tag, cv_wd_td_predictions_by_tag, parameters, wd_algo)
wd_vd_objectid = processor.persist_results(SC_TAGGING_VD, cv_wd_vd_ys_by_tag, cv_wd_vd_predictions_by_tag, parameters, wd_algo)

sent_td_objectid = processor.persist_results(SC_SENT_TD, cv_sent_td_ys_by_tag, cv_sent_td_predictions_by_tag, parameters, sent_algo, tagger_id=wd_td_objectid)
sent_vd_objectid = processor.persist_results(SC_SENT_VD, cv_sent_vd_ys_by_tag, cv_sent_vd_predictions_by_tag, parameters, sent_algo, tagger_id=wd_vd_objectid)

print processor.results_to_string(wd_td_objectid,   SC_TAGGING_TD,  wd_vd_objectid,     SC_TAGGING_VD,  "TAGGING")
print processor.results_to_string(sent_td_objectid, SC_SENT_TD,     sent_vd_objectid,   SC_SENT_VD,     "SENTENCE")
logger.info("Results Processed")
"""
# PLAN
#   WORD LEVEL FEATURE EXTRACTION - use functions specific to the individual word, but that can look around at the
#       previous and next words and sentences if needed. This can handle every scenario where I want to leverage features
#       across sentences and at the essay level.
#   MEMOIZE SENTENCE LEVEL FEATS (e.g. deps) -  Will need memoizing when extracting dependency parse features per sentence (as called once for every word in sentence)
#   WORD \ SENTENCE PARTITIONING FOR WORD AND SENTENCE LEVEL TAGGING
#       Need a class that can transform the feature dictionaries (from essay structure form) into training and test data
#       for word tagging and also for sentence classifying. Suggest do k fold cross validation at the essay level.
#   LOAD RESULTS INTO A DB
#TODO Feed into the sentence classifier the number of words tagged with each category, the proportion of words (to control for sentence length variations) and
    also the number of contiguous segments of each in case some codes occur more than once (in separate segments - probably with > 1 word gaps in between)
#TODO Switch to micro and macro-average F1 scores as described in http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.104.8244&rep=rep1&type=pdf, page 6.

예제 #5

파일 보기

파일: train_causal_relation_extractor_logreg.py 프로젝트: IslamMohamedMosaad/PythonNlpResearch

parameters["cost_function"] = cost_fn.__name__
parameters["beta"] = BETA
parameters["max_epochs"] = MAX_EPOCHS
parameters["algorithm"] = str(LogisticRegression())
parameters["ngrams"] = str(NGRAMS)
parameters["stemmed"] = False

sent_td_objectid = processor.persist_results(CB_SENT_TD, cv_sent_td_ys_by_tag,
                                             cv_sent_td_predictions_by_tag,
                                             parameters, sent_algo)
sent_vd_objectid = processor.persist_results(CB_SENT_VD, cv_sent_vd_ys_by_tag,
                                             cv_sent_vd_predictions_by_tag,
                                             parameters, sent_algo)

print(
    processor.results_to_string(sent_td_objectid, CB_SENT_TD, sent_vd_objectid,
                                CB_SENT_VD, "SENTENCE"))

## TODO
# - Re-train tagging model, adding tags where reg tag is missing but is included in a causer or result tag.
# - Also include explicit in the predicted tags.
# - Need to handle relations where same code -> same code

# -TODO - Neat Ideas
# Inject a random action (unform distribution) with a specified probability during training also
# Ensures better exploration of the policy space. Initial algo predictions will be random but converges very quickly so this may be lost

# TODO * Need to make sure the tagger tags EXCPLICIT tags. These can then be skipped by the parser, but will be included in the features used to train the parser and taggger. Do we want to train a separate tagger that determines if a tagged word is a cause, explict or result. That will then resolve the direction of the relation?
# TODO - recall is v low on training data. Test it with perfect tagging predictions

# TODO Issues
# 1. Unsupported relations