Python add_new_data_to_rows 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: clustering_module

메소드/함수: add_new_data_to_rows

hotexamples.com에서의 예제들: 4

Python add_new_data_to_rows - 4개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 clustering_module.add_new_data_to_rows에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: test_main.py 프로젝트: dent424/clustering_project

 def test_add_new_data_to_row_row(self):
     output, keys = add_new_data_to_rows(self.distances,self.data_list, self.feature_names, self.distance_names)
     np.testing.assert_array_equal(output, self.data_with_distances)
     print output
     self.assertEqual(keys, self.new_feature_names_distance)

예제 #2

파일 보기

파일: test_main.py 프로젝트: dent424/clustering_project

 def test_add_new_data_to_rows_individual(self):
     output, keys = add_new_data_to_rows(self.clusters,self.data_list, self.feature_names, ["E"])        
     np.testing.assert_array_equal(output, self.data_with_clusters)
     self.assertEqual(keys, self.new_feature_names)

예제 #3

파일 보기

파일: test_main.py 프로젝트: dent424/clustering_project

 def test_add_new_data_before(self):
     output, _ = add_new_data_to_rows(self.clusters,self.data_list, self.feature_names, ["E"], "before")
     np.testing.assert_array_equal(output, self.data_with_clusters_before)

예제 #4

-1

파일 보기

파일: clustering_suite.py 프로젝트: dent424/clustering_project

def cluster(final_data_dict, cluster_range, list_or_dict):
    final_data_list= clustering_module.convert_to_list(final_data_dict) 
    respondent_IDs = np.array(map(int, final_data_dict.keys()))
    feature_names = final_data_dict.values()[0].keys()
    final_data_list_imputed = clustering_module.preprocess(final_data_list)
    Scaler = MinMaxScaler()    
    final_data_list_scaled = Scaler.fit_transform(final_data_list_imputed)
    #Transformed is distance of each respondent from each cluster center
    #Predicted is the cluster membership of each respondent
    merging_list = clustering_module.convert_to_list(final_data_dict,remove_NaN=0 )
    data = list(merging_list)
    ignore_set_added = set(['ids'])
    for num_clusters in cluster_range:    
        transformed, predicted, score = clustering_module.clustering(final_data_list_scaled, num_clusters)
        cluster_name = "%s_clusters" % num_clusters
        ignore_set_added.add(cluster_name)    
        data, feature_names = clustering_module.add_new_data_to_rows(predicted, data, feature_names, [cluster_name])
    data, feature_names = clustering_module.add_new_data_to_rows(respondent_IDs, data, feature_names, ["ids"], "before")
    if list_or_dict == "dict":        
        temp = dictionary_conversion.create_dictionary(data, feature_names)    
        num_converted = dictionary_conversion.convert_values_to_int(temp)    
        #Set of features that should be different due to being categorical
        ignore_set_changed = set(['busgrn', 'peopgrn', 'sex', 'race', 'topprob1', 'topprob2'])    
        verdict = compare_respondent_dicts(respondent_IDs, num_converted, final_data_dict, ignore_set_changed, ignore_set_added)
        return num_converted, verdict
    elif list_or_dict == "list":
        return data, feature_names