Python PreprocessingUtils.get_most_frequent примеры использования

Язык программирования: Python

Пространство имен/Пакет: supervised.preprocessing.preprocessing_utils

Класс/Тип: PreprocessingUtils

Метод/Функция: get_most_frequent

Примеров на hotexamples.com: 2

Python PreprocessingUtils.get_most_frequent - 2 примера найдено. Это лучшие примеры Python кода для supervised.preprocessing.preprocessing_utils.PreprocessingUtils.get_most_frequent, полученные из open source проектов. Вы можете ставить оценку каждому примеру, чтобы помочь нам улучшить качество примеров.

Основные методы

Показать Скрыть

is_categorical(8)

get_type(8)

is_0_1(3)

is_log_scale_needed(3)

is_na(3)

is_scale_needed(3)

get_mean(2)

get_median(2)

get_min(2)

get_most_frequent(2)

is_datetime(1)

is_text(1)

num_class(1)

Пример #1

Показать файл

Файл: test_preprocessing_utils.py Проект: zhweiliu/mljar-supervised

    def test_get_stats(self):
        tmp = np.array([1, np.nan, 2, 3, np.nan, np.nan])
        self.assertEqual(1, PreprocessingUtils.get_min(tmp))
        self.assertEqual(2, PreprocessingUtils.get_mean(tmp))
        self.assertEqual(2, PreprocessingUtils.get_median(tmp))
        d = {"col1": [1, 2, 1, 3, 1, np.nan], "col2": ["a", np.nan, "b", "a", "c", "a"]}
        df = pd.DataFrame(data=d)
        self.assertEqual(1, PreprocessingUtils.get_min(df["col1"]))
        self.assertEqual(8.0 / 5.0, PreprocessingUtils.get_mean(df["col1"]))
        self.assertEqual(1, PreprocessingUtils.get_median(df["col1"]))

        self.assertEqual(1, PreprocessingUtils.get_most_frequent(df["col1"]))
        self.assertEqual("a", PreprocessingUtils.get_most_frequent(df["col2"]))

Пример #2

Показать файл

Файл: preprocessing_missing.py Проект: tinomaxthayil/mljar-supervised

    def _get_fill_value(self, x):
        # categorical type
        if PreprocessingUtils.get_type(x) == PreprocessingUtils.CATEGORICAL:
            if self._na_fill_method == PreprocessingMissingValues.FILL_NA_MIN:
                return (
                    PreprocessingMissingValues.MISSING_VALUE
                )  # add new categorical value
            return PreprocessingUtils.get_most_frequent(x)

        if PreprocessingUtils.get_type(x) == PreprocessingUtils.DATETIME:
            return PreprocessingUtils.get_most_frequent(x)

        # numerical type
        if self._na_fill_method == PreprocessingMissingValues.FILL_NA_MIN:
            return PreprocessingUtils.get_min(x) - 1.0
        if self._na_fill_method == PreprocessingMissingValues.FILL_NA_MEAN:
            return PreprocessingUtils.get_mean(x)
        return PreprocessingUtils.get_median(x)