Python BaseEstimator.fit_transform 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: sklearn.base

클래스/타입: BaseEstimator

메소드/함수: fit_transform

hotexamples.com에서의 예제들: 4

Python BaseEstimator.fit_transform - 4개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 sklearn.base.BaseEstimator.fit_transform에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

fit(30)

__init__(30)

predict(30)

set_params(29)

get_params(26)

BaseEstimator(20)

predict_proba(19)

transform(7)

score(5)

inverse_transform(4)

__setstate__(4)

fit_transform(4)

decision_function(2)

__getstate__(2)

is_fitted_(2)

get_feature_names(2)

get_metadata(1)

_more_tags(1)

get_support(1)

__class__(1)

get_feature_importance(1)

partial_fit(1)

__subclasscheck__(1)

_validate_data(1)

save(1)

save_model(1)

features_(1)

detect(1)

simulate_noise(1)

train_(1)

attr(1)

예제 #1

파일 보기

파일: feature_extraction_utils.py 프로젝트: lambdaofgod/mlutil

def get_reduced_embeddings_df(data, embedder: embeddings.EmbeddingVectorizer,
                              reducer: base.BaseEstimator):
    """
    run feature extraction with `embedder` and
    then dimensionality reduction with `reducer`
    """
    data_embeddings = embedder.transform(data)
    reduced_task_embeddings = reducer.fit_transform(data_embeddings)
    return reduced_task_embeddings

예제 #2

파일 보기

파일: preprocessors.py 프로젝트: Mgancita/bowline

    def _impute(self, imputer: BaseEstimator, target: str) -> None:
        """Impute any missing data within 'numeric_features'.

        This method skips imputing the target variable.

        Args:
            imputer (BaseEstimator): Class instance to impute the data. Must have valid
                    'fit_transform' method.
            target (str): Column name for the target variable.

        """
        numeric_features_wo_target = list(
            set(self.numeric_features) - set([target]))
        if numeric_features_wo_target:
            self.processed_data.loc[:,
                                    numeric_features_wo_target] = imputer.fit_transform(
                                        self.processed_data.
                                        loc[:, numeric_features_wo_target])

예제 #3

파일 보기

파일: preprocessors.py 프로젝트: Mgancita/bowline

    def _scale_data(self, scaler: BaseEstimator, target: str,
                    scale_target: bool) -> None:
        """Scale numeric features.

        This method can either be used to scale the target variable or not.

        Args:
            scaler (BaseEstimator): Class instance to scale the data. Must have valid
                    'fit_transform' method.
            target (str): Column name of target variable.
            scale_target (bool): Whether to scale the target variable or not.

        """
        if scale_target:
            features_to_scale = self.numeric_features
        else:
            features_to_scale = list(
                set(self.numeric_features) - set([target]))

        if features_to_scale:
            self.processed_data.loc[:,
                                    features_to_scale] = scaler.fit_transform(
                                        self.processed_data.
                                        loc[:, features_to_scale])

예제 #4

파일 보기

파일: worker.py 프로젝트: Ennosigaeon/meta-learning-base

    def transform_dataset(self, algorithm: BaseEstimator, n_folds: int = 5) -> Tuple[pd.DataFrame, Dict[str, float]]:
        """
        Given a set of fully-qualified hyperparameters, create and not working a algorithm model.
        Returns: Model object and metrics dictionary
        """

        """Load input dataset and class_column"""
        df = self.dataset.load(self.s3_config, self.s3_bucket)
        class_column = self.dataset.class_column

        """Split input dataset in X and y"""
        X, y = df.drop(class_column, axis=1), df[class_column]

        """
        Checks if algorithm (BaseEstimator) is a classifier. 
        
        If True, predict y_pred with the method cross_val_predict. Then calculate the evaluation metrics for the
        algorithm model and return them as a dict. Convert y_pred to pd Series and concatenate X & y_pred.
        
        If False, call fit_transform or fit and then transform on X, y and return the transformed dataset as Dataframe.
        """

        if is_classifier(algorithm):

            """Predict labels with n fold cross validation"""
            y_pred = cross_val_predict(algorithm, X, y, cv=n_folds)

            """Calculate evaluation metrics"""
            accuracy = accuracy_score(y, y_pred)
            precision = precision_score(y, y_pred, average='weighted')
            recall = recall_score(y, y_pred, average='weighted')
            f1 = f1_score(y, y_pred, average='weighted')
            # TODO
            log_loss = logloss(y, y_pred)
            roc_auc = multiclass_roc_auc_score(y, y_pred, average='weighted')

            """Convert np array y_pred to pd series and add it to X"""
            y_pred = pd.Series(y_pred)
            X = pd.concat([X, y_pred], axis=1)
            X.columns = range(X.shape[1])

            return X, {'accuracy': accuracy,
                       'precision': precision,
                       'recall': recall,
                       'f1': f1,
                       'neg_log_loss': log_loss,
                       'roc_auc': roc_auc
                       }
        else:
            """
            If algorithm object has method fit_transform, call fit_transform on X, y. Else, first call fit on X, y,
            then transform on X. Safe the transformed dataset in X
            """
            if hasattr(algorithm, 'fit_transform'):
                X = algorithm.fit_transform(X, y)
            else:
                # noinspection PyUnresolvedReferences
                X = algorithm.fit(X, y).transform(X)

            X = pd.DataFrame(data=X, index=range(X.shape[0]), columns=range(X.shape[1]))

            return X, {}