Python LeaveOneSubGroupOut 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: museotoolbox.cross_validation

메소드/함수: LeaveOneSubGroupOut

hotexamples.com에서의 예제들: 2

Python LeaveOneSubGroupOut - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 museotoolbox.cross_validation.LeaveOneSubGroupOut에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: test_cross_validation.py 프로젝트: marclang/MuseoToolBox

    def test_LeaveOneSubGroupOut(self):
        cv = cross_validation.LeaveOneSubGroupOut(verbose=2)
        # if only one subgroup
        tempG = np.copy(g)
        tempG[np.where(y == 5)] = 1
        self.assertRaises(Exception, cv.get_n_splits, X, y, tempG)

        # if all is ok
        cv = cross_validation.LeaveOneSubGroupOut(verbose=2)
        y_vl = np.array([])
        for tr, vl in cv.split(X, y, g):
            y_vl = np.concatenate((y_vl, vl))
            assert (not np.unique(np.in1d([1, 2], [3, 4]))[0])
        assert (np.all(
            np.unique(np.asarray(y_vl), return_counts=True)[1] == 1))

        list_files = cv.save_to_vector(vector,
                                       'Class',
                                       group='uniquefid',
                                       out_vector='/tmp/cv_g.gpkg')

        assert (len(list_files) == cv.get_n_splits(X, y, g))

예제 #2

파일 보기

group = 'uniquefid'
X, y, g = extract_ROI(raster, vector, field, group)
##############################################################################
# Initialize Random-Forest
# ---------------------------

classifier = RandomForestClassifier(random_state=12, n_jobs=1)

##############################################################################
# Create list of different CV
# ---------------------------

CVs = [
    cross_validation.RandomStratifiedKFold(n_splits=2),
    cross_validation.LeavePSubGroupOut(valid_size=0.5),
    cross_validation.LeaveOneSubGroupOut(),
    StratifiedKFold(n_splits=2, shuffle=True)  #from sklearn
]

kappas = []

for cv in CVs:
    SL = SuperLearner(classifier=classifier,
                      param_grid=dict(n_estimators=[50, 100]),
                      n_jobs=1)
    SL.fit(X, y, group=g, cv=cv)
    print('Kappa for ' + str(type(cv).__name__))
    cvKappa = []

    for stats in SL.get_stats_from_cv(confusion_matrix=False, kappa=True):
        print(stats['kappa'])