Python DataPot 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: datapot

메소드/함수: DataPot

hotexamples.com에서의 예제들: 2

Python DataPot - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 datapot.DataPot에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

예제 #1

파일 보기

파일: job_eval.py 프로젝트: neurale/datapot

from __future__ import print_function

import sys
import bz2
import time
import xgboost as xgb
import pandas as pd
from sklearn.model_selection import cross_val_score
import datapot as dp
from datapot.datasets import load_job_salary

data = load_job_salary()
datapot = dp.DataPot()

t0 = time.time()
datapot.detect(data)
print('detect time:', time.time() - t0)

t0 = time.time()
datapot.fit(data, verbose=True)
print('fit time:', time.time() - t0)

t0 = time.time()
df = datapot.transform(data)
print('transform time:', time.time() - t0)

X = df.drop(['SalaryNormalized', 'Id'], axis=1)
y = pd.qcut(df['SalaryNormalized'].values, q=2, labels=[0, 1]).ravel()

model = xgb.XGBClassifier()
cv_score = cross_val_score(model, X, y, cv=5)

예제 #2

파일 보기

파일: simple_test.py 프로젝트: MokriyYuriy/datapot

from sklearn.model_selection import cross_val_score
import xgboost as xgb

import datapot as dp

dummy_data = [
    '{"name": "Gilbert", "wins": [3, 4, 12], "rating": 32}',
    '{"name": "Alexa", "wins": [1, 2, 5, 7], "rating": 24}',
    '{"name": "May", "wins": [], "rating": 1240}',
    '{"name": "Deloise", "wins": [6, 8, 9, 10, 11], "rating": 25}',
]

# create DataPot instance
data = dp.DataPot()
print(data)

# fit it with data
data.fit(dummy_data)
print(data)
print(data.fields())

# apply transformers
df = data.transform(dummy_data, drop_non_numerical=True)
print(df)

# we are going to predict rating
y = df['rating']
X = df.drop('rating', axis=1)

# evaluate prediction score using xgboost
model = xgb.XGBRegressor()