Python NumericStatsMixin 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: dataprofiler.profilers

클래스/타입: NumericStatsMixin

hotexamples.com에서의 예제들: 6

Python NumericStatsMixin - 6개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 dataprofiler.profilers.NumericStatsMixin에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

NumericStatsMixin(2)

__init__(1)

_get_percentile(1)

is_float(1)

is_int(1)

match_count(1)

report(1)

times(1)

예제 #1

파일 보기

파일: test_numeric_stats_mixin_profile.py 프로젝트: bballamudi/DataProfiler

    def test_check_int(self):
        """
        Checks if number is integer.
        :return:
        """
        true_asserts = [
            1,
            1345,
            -13,
            0,
            -0,  # numeric values
            "1"  # strings
        ]
        for assert_val in true_asserts:
            self.assertTrue(NumericStatsMixin.is_int(assert_val))

        false_asserts = [
            1.3,  # float
            float("nan"),
            np.nan,  # nan value
            "nan",
            "1a",
            "abc",
            "",
            "1.3"  # strings
        ]
        for assert_val in false_asserts:
            self.assertFalse(NumericStatsMixin.is_int(assert_val))

예제 #2

파일 보기

파일: test_numeric_stats_mixin_profile.py 프로젝트: bballamudi/DataProfiler

    def test_base(self):

        # validate requires NumericalOptions
        with self.assertRaisesRegex(
                ValueError, "NumericalStatsMixin parameter 'options' "
                "must be of type NumericalOptions."):
            profile = NumericStatsMixin(options='bad options')

        try:
            # validate doesn't fail
            profile = NumericStatsMixin()
            profile = NumericStatsMixin(NumericalOptions())
        except Exception as e:
            self.fail(e)

예제 #3

파일 보기

파일: test_numeric_stats_mixin_profile.py 프로젝트: capitalone/DataProfiler

 def test_get_percentile_median(self):
     num_profiler = TestColumn()
     # Dummy data for calculating bin error
     num_profiler._stored_histogram = {
         "histogram": {
             "bin_counts": np.array([1, 2, 0, 2, 1]),
             "bin_edges": np.array([0.0, 4.0, 8.0, 12.0, 16.0, 20.0]),
         }
     }
     median = NumericStatsMixin._get_percentile(num_profiler,
                                                percentiles=[50, 50])
     self.assertListEqual([10, 10], median)

예제 #4

파일 보기

파일: test_numeric_stats_mixin_profile.py 프로젝트: capitalone/DataProfiler

    def test_report(self):
        options = NumericalOptions()
        options.max.is_enabled = False
        options.min.is_enabled = False
        options.histogram_and_quantiles.is_enabled = False
        options.variance.is_enabled = False

        num_profiler = NumericStatsMixin(options=options)

        num_profiler.match_count = 0
        num_profiler.times = defaultdict(float)

        report = num_profiler.report(remove_disabled_flag=True)
        report_keys = list(report.keys())

        for disabled_key in [
                "max", "min", "variance", "histogram", "quantiles"
        ]:
            self.assertNotIn(disabled_key, report_keys)

        # test report default `remove_disabled_flag`
        # value and no NumericalOptions
        report = num_profiler.report()
        report_keys = list(report.keys())

        for disabled_key in [
                "max", "min", "variance", "histogram", "quantiles"
        ]:
            self.assertIn(disabled_key, report_keys)

예제 #5

파일 보기

파일: test_numeric_stats_mixin_profile.py 프로젝트: bballamudi/DataProfiler

    def test_check_float(self):
        """
        Checks if number is float.
        :return:
        """
        true_asserts = [
            1.3,
            1.345,
            -1.3,
            0.03,
            0.0,
            -0.0,
            1,  # numeric values
            float("nan"),
            np.nan,  # nan values
            "1.3",
            "nan"  # strings
        ]
        for assert_val in true_asserts:
            self.assertTrue(NumericStatsMixin.is_float(assert_val))

        false_asserts = ["1.3a", "abc", "", "1.23.45"]
        for assert_val in false_asserts:
            self.assertFalse(NumericStatsMixin.is_float(assert_val))

예제 #6

파일 보기

파일: test_numeric_stats_mixin_profile.py 프로젝트: bballamudi/DataProfiler

 def __init__(self):
     NumericStatsMixin.__init__(self)
     self.match_count = 0
     self.times = defaultdict(float)