Python DataFrame.as_gpu_matrixの例

プログラミング言語: Python

名前空間/パッケージ名: cudf

クラス/型: DataFrame

メソッド/関数: as_gpu_matrix

hotexamples.comのコード掲載数: 1

Python DataFrame.as_gpu_matrix - 1件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのcudf.DataFrame.as_gpu_matrixの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

DataFrame(30)

from_pandas(30)

_from_data(16)

to_pandas(14)

_from_table(10)

drop(10)

merge(7)

copy(7)

take(5)

from_gpu_matrix(5)

equals(4)

one_hot_encoding(4)

set_index(4)

apply_chunks(4)

add_column(4)

columns(3)

label_encoding(3)

name(3)

dropna(3)

query(3)

sort_values(2)

to_records(2)

_concat(2)

from_records(2)

append(2)

apply_rows(2)

_apply(1)

serialize(1)

to_parquet(1)

_apply_support_method(1)

to_dlpack(1)

to_cupy(1)

to_arrow(1)

scatter_by_map(1)

select_dtypes(1)

join(1)

repeat(1)

argsort(1)

as_gpu_matrix(1)

nsmallest(1)

nlargest(1)

drop_duplicates(1)

from_arrow(1)

memory_usage(1)

insert(1)

コード例 #1

ファイルを表示

ファイル: encoders.py プロジェクト: teju85/cuml

    def inverse_transform(self, X):
        """
        Convert the data back to the original representation.
        In case unknown categories are encountered (all zeros in the
        one-hot encoding), ``None`` is used to represent this category.

        The return type is the same as the type of the input used by the first
        call to fit on this estimator instance.
        Parameters
        ----------
        X : array-like or sparse matrix, shape [n_samples, n_encoded_features]
            The transformed data.
        Returns
        -------
        X_tr : cudf.DataFrame or cupy.ndarray
            Inverse transformed array.
        """
        self._check_is_fitted()
        if cp.sparse.issparse(X):
            # cupy.sparse 7.x does not support argmax, when we upgrade cupy to
            # 8.x, we should add a condition in the
            # if close: `and not cp.sparse.issparsecsc(X)`
            # and change the following line by `X = X.tocsc()`
            X = X.toarray()
        result = DataFrame(columns=self._encoders.keys())
        j = 0
        for feature in self._encoders.keys():
            feature_enc = self._encoders[feature]
            cats = feature_enc.classes_

            if self.drop is not None:
                # Remove dropped categories
                dropped_class_idx = Series(self.drop_idx_[feature])
                dropped_class_mask = Series(cats).isin(cats[dropped_class_idx])
                if len(cats) == 1:
                    inv = Series(GenericIndex(cats[0]).repeat(X.shape[0]))
                    result[feature] = inv
                    continue
                cats = cats[~dropped_class_mask]

            enc_size = len(cats)
            x_feature = X[:, j:j + enc_size]
            idx = cp.argmax(x_feature, axis=1)
            inv = Series(cats.iloc[idx]).reset_index(drop=True)

            if self.handle_unknown == 'ignore':
                not_null_idx = x_feature.any(axis=1)
                inv.iloc[~not_null_idx] = None
            elif self.drop is not None:
                # drop will either be None or handle_unknown will be error. If
                # self.drop is not None, then we can safely assume that all of
                # the nulls in each column are the dropped value
                dropped_mask = cp.asarray(x_feature.sum(axis=1) == 0).flatten()
                if dropped_mask.any():
                    inv[dropped_mask] = feature_enc.inverse_transform(
                        Series(self.drop_idx_[feature]))[0]

            result[feature] = inv
            j += enc_size
        if self.input_type == 'array':
            try:
                result = cp.asarray(result.as_gpu_matrix())
            except ValueError:
                warnings.warn("The input one hot encoding contains rows with "
                              "unknown categories. Arrays do not support null "
                              "values. Returning output as a DataFrame "
                              "instead.")
        return result