Python MohammadDataSet 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: ml.datasets.mohammad

클래스/타입: MohammadDataSet

hotexamples.com에서의 예제들: 4

Python MohammadDataSet - 4개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 ml.datasets.mohammad.MohammadDataSet에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

MohammadDataSet(4)

자주 사용되는 메소드들

MohammadDataSet (4)

예제 #1

파일 보기

파일: testTax.py 프로젝트: yuancz/ExampleDrivenErrorDetection

from ml.datasets.mohammad import MohammadDataSet
from ml.tools.openrefine.OpenRefine import OpenRefine

#one rule for all columns:
# if(contains(value, "x"), "error", value)
# takes 3 mins to execute

data = MohammadDataSet("tax", 20, 30, 10)

tool = OpenRefine(
    "/home/felix/SequentialPatternErrorDetection/OpenRefine/tax/result/tax_o20_r30_p10-csv-with-minus-rule.tsv",
    data=data)

print "Fscore: " + str(tool.calculate_total_fscore())
print "Precision: " + str(tool.calculate_total_precision())
print "Recall: " + str(tool.calculate_total_recall())

for c in range(data.shape[1]):
    print tool.calculate_fscore_by_column(c)

예제 #2

파일 보기

from sets import Set

from ml.datasets.mohammad import MohammadDataSet
from ml.tools.nadeef_repair.FD import FD
from ml.tools.nadeef_repair.NadeefAll import NadeefAll

data = MohammadDataSet("books", 30, 30, 10)

rules = []

#'''
#Mohammad's rule
rules.append(FD(Set(["first_author_varchar"]), "language_varchar"))
#'''

#rules.append(FD(Set(["first_author_varchar", "publish_date_varchar", "rating_varchar"]), "language_varchar"))
rules.append(FD(Set(["isbn13_varchar", "publisher_varchar", "rating_varchar", "title_varchar"]), "first_author_varchar"))
rules.append(FD(Set(["description_varchar", "first_author_varchar", "format_varchar", "title_varchar"]), "isbn13_varchar"))





nadeef = NadeefAll(data, rules)

예제 #3

파일 보기

파일: TestBikes.py 프로젝트: yuancz/ExampleDrivenErrorDetection

from ml.datasets.mohammad import MohammadDataSet
from ml.tools.dboost.TestDBoost import test

data = MohammadDataSet("bikes", 30, 0, 20)

sample_size = 10
steps = 100

test(data, sample_size, steps)

예제 #4

파일 보기

from ml.datasets.mohammad import MohammadDataSet
from ml.tools.dboost.TestDBoost import run_params_gaussian

data = MohammadDataSet("cars", 30, 20, 20)

sample_size = 10
steps = 100

best_params = {}
best_params['gaussian'] = 1.0
best_params['statistical'] = 0.5
run_params_gaussian(data, best_params)