Python Experiment 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: modeling.featurepipeline.experiment

클래스/타입: Experiment

hotexamples.com에서의 예제들: 3

Python Experiment - 3개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 modeling.featurepipeline.experiment.Experiment에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

Experiment(1)

apply_postprocessors(1)

handle_NAs(1)

run_experiment(1)

예제 #1

파일 보기

파일: loopmodels.py 프로젝트: dssg/education-college-public

						# random.shuffle(tmplist)

						# TODO: for each feature group, make one plot of all 
						#		the AUCs compared to each other
						# need: for each featurelist:
						#				- a name for it
						#				- the AUC + confidence interval

						for featurelist in itertools.combinations(featuregroup,pr):
						# for featurelist in tmplist[:10]:

							print "Features: \n\t%s"%'\n\t'.join(featurelist)

							m = getattr(am,modelname)(**thispdict)
							e = Experiment(model=m, feature_list=featurelist, 
										   dloader=dload, id=None,nan_handling=cfg['nan_handling'],
										   logFolder=cfg['logFolder'], looplog=cfg['looplog'],
										   summary_only=cfg['summary_only'])

							e.run_experiment()

							# tmp[(tuple(featurelist)] = (e.auc, e.auc_train)
							# tmp.loc[rowIdx] = [e.auc,e.auc_train,featurelist]
							# if e.auc>e.auc_train:
							# 	print "THIS NO GOOD"
							# 	print ', '.join(featurelist)
							# 	print "++++++++++++++++++++"
							# rowIdx+=1
							
							# print "Best so far: "
							# bla = max(tmp.items(),key=lambda x: x[1][0])
							# print '\tModel: ', bla[0][0]

예제 #2

파일 보기

파일: loopmodels.py 프로젝트: dssg/education-college-public

                        #		the AUCs compared to each other
                        # need: for each featurelist:
                        #				- a name for it
                        #				- the AUC + confidence interval

                        for featurelist in itertools.combinations(
                                featuregroup, pr):
                            # for featurelist in tmplist[:10]:

                            print "Features: \n\t%s" % '\n\t'.join(featurelist)

                            m = getattr(am, modelname)(**thispdict)
                            e = Experiment(model=m,
                                           feature_list=featurelist,
                                           dloader=dload,
                                           id=None,
                                           nan_handling=cfg['nan_handling'],
                                           logFolder=cfg['logFolder'],
                                           looplog=cfg['looplog'],
                                           summary_only=cfg['summary_only'])

                            e.run_experiment()

                            # tmp[(tuple(featurelist)] = (e.auc, e.auc_train)
                            # tmp.loc[rowIdx] = [e.auc,e.auc_train,featurelist]
                            # if e.auc>e.auc_train:
                            # 	print "THIS NO GOOD"
                            # 	print ', '.join(featurelist)
                            # 	print "++++++++++++++++++++"
                            # rowIdx+=1

                            # print "Best so far: "

예제 #3

파일 보기

파일: auc_per_featuregroup.py 프로젝트: dssg/education-college-public

	# go over feature groups
	res = {}
	for featuregroup in cfg['features']:

		model = cfg['model']
		if (len(model.keys()) > 1) or (len(model.values()) > 1):
			raise IOError("A model is not specified correctly.")

		modelname = model.keys()[0]
		paramdict = model.values()[0]

		m = getattr(am,modelname)(**paramdict)

		e = Experiment(model=m, feature_list=featuregroup.values()[0], 
					   dloader=dload, id=None,nan_handling=cfg['nan_handling'],
					   logFolder=cfg['logFolder'], looplog=cfg['looplog'])

		e.apply_postprocessors()
		e.handle_NAs()

		# need to make sure that we only select the columns that are the same across train and test, need the intersection
		test_cols = set(e.test_rows.columns.values)
		train_cols = set(e.train_rows.columns.values)
		predictor_cols = list(test_cols & train_cols)
		predictor_cols.remove(e.target_col)

		# take out the dataframe we'll be working with
		df = e.train_rows[predictor_cols + [e.target_col]]
		randIdxs = np.random.randint(df.shape[0],size=(df.shape[0],cfg['n_boot']))