Python HW3.separate_X_and_y Examples

Programming Language: Python

Namespace/Package Name: CS6140_A_MacLeay.Homeworks

Class/Type: HW3

Method/Function: separate_X_and_y

Examples at hotexamples.com: 2

Python HW3.separate_X_and_y - 2 examples found. These are the top rated real world Python examples of CS6140_A_MacLeay.Homeworks.HW3.separate_X_and_y extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

partition_folds(19)

load_and_normalize_spambase(10)

pandas_to_data(10)

transpose_array(7)

get_mus(5)

get_accuracy(4)

get_covar(4)

get_data_and_mus(4)

get_sub_at_value(4)

separate_X_and_y(2)

bins_per_column(1)

set_diag_min(1)

get_std_dev(1)

bins_per_column_by_col(1)

classify_x(1)

classify(1)

univariate_normal(1)

Example #1

Show file

File: hw3.py Project: alliemacleay/MachineLearning_CS6140

def q1():
    """GDA """
    """Run the Gaussian Discriminant Analysis on the spambase data. Use the k-folds from the previous problem (1 for testing, k-1 for training, for each fold)
Since you have 57 real value features, each of the  2gaussians (for + class and for - class) will have a mean  vector with 57 components, and a they will have
either a common (shared) covariance matrix size 57x57. This covariance is estimated from all training data (both classes)
or two separate covariance 57x57 matrices (estimated separately for each class)
(you can use a Matlab or Python of Java built in function to estimated covariance matrices, but the estimator is easy to code up).
Looking at the training and testing performance, does it appear that the gaussian assumption (normal distributed data) holds for this particular dataset?
"""

    spamData = hw3.pandas_to_data(hw3.load_and_normalize_spambase())  # returns an array of arrays - this is by row
    k = 10
    train_acc_sum = 0
    k_folds = hw3.partition_folds(spamData, k)
    gdas = []
    for ki in range(k - 1):
        subset = []
        gda = hw3.GDA()
        X, truth = hw3.separate_X_and_y(k_folds[ki])
        covariance_matrix = hw3.get_covar(X)
        gda.p_y = float(sum(truth)) / len(truth)
        gda.train(X, covariance_matrix, truth)
        predictions = gda.predict(X)
        #print predictions
        accuracy = mystats.get_error(predictions, truth, True)
        #gdas.append(gda)
        print_output(ki, accuracy)
        #print gda.prob
        gdas.append(gda)

Example #2

Show file

File: hw3_tests.py Project: alliemacleay/MachineLearning_CS6140

def test_separate_x_and_y():
    array = get_test_data(5, 3)
    print array
    print hw3.separate_X_and_y(array)