Python predictions 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: excercises.ps03.ps03_04

메소드/함수: predictions

hotexamples.com에서의 예제들: 2

Python predictions - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 excercises.ps03.ps03_04.predictions에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: ps03_05.py 프로젝트: darshandpatel/Analyzing-the-New-York-Subway-Dataset

    per hour for our data, why don't you make a histogram of the residuals
    (that is, the difference between the original hourly entry data and the predicted values).
    Try different binwidths for your histogram.

    Based on this residual histogram, do you have any insight into how our model
    performed?  Reading a bit on this webpage might be useful:

    http://www.itl.nist.gov/div898/handbook/pri/section2/pri24.htm
    """

    plt.figure()
    (turnstile_weather["ENTRIESn_hourly"] - predictions).plot(kind="hist", bins=10)
    plt.title("Histogram of Residuals")
    plt.ylabel("Frequency")
    plt.xlabel("Prediction Error")
    plt.show()

    """
    # QQ Plot
    z = (turnstile_weather['ENTRIESn_hourly'] - np.mean(turnstile_weather['ENTRIESn_hourly']))/np.std(turnstile_weather['ENTRIESn_hourly'])
    stats.probplot(z,dist="norm",plot=plt)
    plt.show()
    """


if __name__ == "__main__":
    file_path = "../data/turnstile_weather_v2.csv"
    file_pointer = open(file_path)
    turnstile_weather = pandas.read_csv(file_pointer)
    plot_residuals(turnstile_weather, predictions(turnstile_weather))

예제 #2

파일 보기

파일: ps03_07.py 프로젝트: darshandpatel/Analyzing-the-New-York-Subway-Dataset

def compute_r_squared(data, predictions):
    '''
    In exercise 5, we calculated the R^2 value for you. But why don't you try and
    and calculate the R^2 value yourself.
    
    Given a list of original data points, and also a list of predicted data points,
    write a function that will compute and return the coefficient of determination (R^2)
    for this data.  numpy.mean() and numpy.sum() might both be useful here, but
    not necessary.

    Documentation about numpy.mean() and numpy.sum() below:
    http://docs.scipy.org/doc/numpy/reference/generated/numpy.mean.html
    http://docs.scipy.org/doc/numpy/reference/generated/numpy.sum.html
    '''
    
    # your code here
    data_avg = np.mean(data)
    partial_denominator = data - data_avg
    denominator = np.sum(partial_denominator*partial_denominator)
    partial_numerator = data - predictions
    numerator = np.sum(partial_numerator*partial_numerator)
    r_squared = 1 - float(numerator/denominator)
    return r_squared

if __name__ == '__main__':
    file_path = "../data/turnstile_weather_v2.csv"
    file_pointer = open(file_path)
    turnstile_weather = pandas.read_csv(file_pointer)
    print(compute_r_squared(turnstile_weather['ENTRIESn_hourly'],predictions(turnstile_weather)))