Python DataFrameConvertColumnToNumeric 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: healthcareai.common.transformers

메소드/함수: DataFrameConvertColumnToNumeric

hotexamples.com에서의 예제들: 2

Python DataFrameConvertColumnToNumeric - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 healthcareai.common.transformers.DataFrameConvertColumnToNumeric에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: data_preparation.py 프로젝트: joshsalvi/healthcareai-py

def full_pipeline(model_type,
                  predicted_column,
                  grain_column,
                  impute=True,
                  verbose=True):
    """
    Builds the data preparation pipeline. Sequentially runs transformers and filters to clean and prepare the data.
    
    Note advanced users may wish to use their own custom pipeline.
    """

    # Note: this could be done more elegantly using FeatureUnions _if_ you are not using pandas dataframes for
    #   inputs of the later pipelines as FeatureUnion intrinsically converts outputs to numpy arrays.
    pipeline = Pipeline([
        ('remove_DTS_columns', hcai_filters.DataframeColumnSuffixFilter()),
        ('remove_grain_column',
         hcai_filters.DataframeColumnRemover(grain_column)),
        # Perform one of two basic imputation methods
        # TODO we need to think about making this optional to solve the problem of rare and very predictive values
        ('imputation',
         hcai_transformers.DataFrameImputer(impute=impute, verbose=verbose)),
        ('null_row_filter',
         hcai_filters.DataframeNullValueFilter(excluded_columns=None)),
        ('convert_target_to_binary',
         hcai_transformers.DataFrameConvertTargetToBinary(
             model_type, predicted_column)),
        ('prediction_to_numeric',
         hcai_transformers.DataFrameConvertColumnToNumeric(predicted_column)),
        ('create_dummy_variables',
         hcai_transformers.DataFrameCreateDummyVariables(
             excluded_columns=[predicted_column])),
    ])
    return pipeline

예제 #2

파일 보기

    def test_integer(self):
        df = pd.DataFrame({
            'binary_category': ['a', 'b', 'a'],
            'numeric': [1, 2, 1],
        })
        expected = pd.DataFrame({
            'binary_category': ['a', 'b', 'a'],
            'numeric': [1, 2, 1],
        })

        result = transformers.DataFrameConvertColumnToNumeric('numeric').fit_transform(df)

        # Sort each because column order matters for equality checks
        expected = expected.sort_index(axis=1)
        result = result.sort_index(axis=1)

        self.assertTrue(result.equals(expected))