Python FunctionLib.corr_feats Examples

Programming Language: Python

Namespace/Package Name: Model

Class/Type: FunctionLib

Method/Function: corr_feats

Examples at hotexamples.com: 2

Python FunctionLib.corr_feats - 2 examples found. These are the top rated real world Python examples of Model.FunctionLib.corr_feats extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

get_params(8)

distinct_feats(7)

change_type(7)

get_missing_value_feats(6)

ScoreDataFrame(3)

get_aggregate_features_num(3)

get_model_performance(3)

TurkyOutliers(2)

impute_knn_classifier(2)

GetScaledModel(2)

get_rowcnt_most_missing_val(2)

GetBasedModel(2)

cv_score(2)

corr_feats(2)

GetScaledModelwithfactorizedCW(2)

plot_bar(2)

missing_val_perc(2)

impute_values(2)

log_transform(2)

PlotBoxR(2)

match_strings(1)

hist_perc(1)

hist_compare(1)

get_unique_val_list(1)

plot_stats(1)

min_len_col(1)

AdaBoostClassifier(1)

get_corr(1)

feature_stats(1)

default_ratio(1)

cv_metrics(1)

concat_model_score(1)

RandomSearch(1)

RandomForestClassifier(1)

LogisticRegression(1)

KNeighborsClassifier(1)

GridSearch(1)

GradientBoostingClassifier(1)

GetScaledModelwithbestparams(1)

train_test_split(1)

Example #1

Show file

    def create_dataset_remove_corr_feats(self, target_var, filter_val,
                                         corr_threshold, feats_ignore):
        df = self.df.copy()
        x_df_dum = pd.get_dummies(df)
        x_df_Default_dum = x_df_dum[x_df_dum[target_var] == filter_val]

        x_df_dum.columns = x_df_dum.columns.map(f.remove_space)
        x_df_Default_dum.columns = x_df_Default_dum.columns.map(f.remove_space)

        _corr_threshold = corr_threshold
        get_highly_corr_feats = f.corr_feats(x_df_dum, x_df_dum.columns,
                                             _corr_threshold)

        get_highly_corr_feats = pd.DataFrame(get_highly_corr_feats)
        print('Highly correlated features description more than pearsonsr',
              _corr_threshold)

        corr_lst = []
        for i in range(len(get_highly_corr_feats.index) - 1):

            lst_feat = get_highly_corr_feats.iloc[i, 0]
            lst_corr_feat = get_highly_corr_feats.iloc[i, 1]

            for j in range(len(lst_corr_feat)):
                _str = f.match_strings(lst_feat, lst_corr_feat[j])
                if len(_str) > f.min_len_col(df.drop(df[feats_ignore],
                                                     axis=1)):
                    corr_lst.append(lst_corr_feat[j])

        corr_lst = pd.DataFrame(corr_lst)[0].unique().tolist()
        print(corr_lst)
        _train_drop_cols_df = x_df_dum.copy()
        _train_drop_cols_df.drop(_train_drop_cols_df[corr_lst],
                                 axis=1,
                                 inplace=True)
        self.dim_red_by_corr_df = _train_drop_cols_df.copy()

Example #2

Show file

File: EDA.py Project: rkparyani/KAGGLE---Home-Credit-Default-Risk

x_df_Default_dum = x_df_dum[x_df_dum['TARGET']==1]


# In[11]:


# General correlations wrt Correlations in case of default.
x_corr_default = x_df_Default_dum.corr()
x_corr = x_df_dum.corr()


# In[12]:


corr_threshold = 0.6
get_highly_corr_feats = f.corr_feats (x_df_dum,x_df_dum.columns,corr_threshold)
get_highly_corr_feats = pd.DataFrame(get_highly_corr_feats)
print('Highly correlated features description more than pearsonsr',corr_threshold)
get_highly_corr_feats


# ##### EXPLORATORY DATA ANALYSIS

# ##### TARGET

# In[13]:


# Corr
val= x_corr['TARGET'].sort_values(ascending=False)*100
val = val[val.where(val>5)>0]