Python Evaluator.evaluateAuc示例

编程语言: Python

命名空间/包名称: evaluation.Evaluator

类/类型: Evaluator

方法/功能: evaluateAuc

hotexamples.com的示例: 2

Python Evaluator.evaluateAuc - 已找到2个示例。这些是从开源项目中提取的最受好评的evaluation.Evaluator.Evaluator.evaluateAuc现实Python示例。您可以评价示例，以帮助我们提高示例质量。

常用方法

显示隐藏

Evaluator(17)

MAP(3)

AddAlgorithm(2)

Evaluate(2)

evaluateAuc(2)

SampleTopNRecs(1)

示例#1

显示文件

文件： XGBoostRun.py 项目： zhouhanghust/hll_xgb

def train(spark):
    if config['XGBOOST']['checkpointInitialization'] == 'true':
        checkpoint_path = config['XGBOOST']['checkpoint_path']
        op = os.system("hadoop fs -rmr %s/*" % checkpoint_path)
        if not op:
            print("initialize checkpoint successfully.")
    train_df = Hdfs2Df.readHdfsCsv(spark=spark,
                                   data_path=config['TRAIN']['train_path'])
    test_df = Hdfs2Df.readHdfsCsv(spark=spark,
                                  data_path=config['TRAIN']['test_path'])

    missing = config['XGBOOST']['missing']
    train_df = PreProcessor.transColType(train_df, missing)
    test_df = PreProcessor.transColType(test_df, missing)
    train, train_col = PreProcessor.transVector(train_df, 'features')
    test, test_col = PreProcessor.transVector(train_df, 'features')

    SavaTools.saveModelFeature(train_col,
                               config['TRAIN']['local_model_feature_path'])
    xgb_handle = XGBoostClassifier(config['XGBOOST'])
    xgbModel = xgb_handle.trainAndSave(spark, train,
                                       config['TRAIN']['hdfs_model_path'])

    train_res, train_auc = xgb_handle.predict(spark, train, xgbModel)
    test_res, test_auc = xgb_handle.predict(spark, test, xgbModel)
    train_res.cache()
    test_res.cache()

    evaluator_handle = Evaluator(spark)
    train_ks = evaluator_handle.evaluateKs(train_res, 'train_res', 'score')
    train_auc = evaluator_handle.evaluateAuc(train_res, "score")
    test_ks = evaluator_handle.evaluateKs(test_res, 'test_ks', 'score')
    test_auc = evaluator_handle.evaluateAuc(test_res, "score")

    fscore = xgbModel.booster.getFeatureScore()
    xgb_handle.saveFeatureImportance(
        train_col, fscore, config['TRAIN']['local_model_feature_weights_path'],
        train_auc, test_auc, train_ks, test_ks)
    SavaTools.saveHdfsFile(train_res, config['TRAIN']['train_res_path'])
    SavaTools.saveHdfsFile(train_res, config['TRAIN']['test_res_path'])

示例#2

显示文件

文件： XGBoostClassifier.py 项目： zhouhanghust/hll_xgb

 def predict(self, spark, tmp, xgb):
     data = PreProcessor.transVector(tmp, 'features')
     predictions = xgb.predict(
         data, -999).map(lambda row: (row['predictions'][1], row['label']))
     predictions = predictions.toDF("score", "label")
     right = predictions.withColumn("idx", monotonically_increasing_id())
     left = tmp.select(['name', 'idcard',
                        'phone']).withColumn("idx",
                                             monotonically_increasing_id())
     res_df = left.join(right, ['idx'], 'inner').drop('idx')
     evaluator_handle = Evaluator(spark)
     auc = evaluator_handle.evaluateAuc(res_df)
     print("AUC: ", auc)
     return res_df, auc