Python SparkDataSet.exists示例

编程语言: Python

命名空间/包名称: kedro.contrib.io.pyspark

类/类型: SparkDataSet

方法/功能: exists

hotexamples.com的示例: 4

Python SparkDataSet.exists - 已找到4个示例。这些是从开源项目中提取的最受好评的kedro.contrib.io.pyspark.SparkDataSet.exists现实Python示例。您可以评价示例，以帮助我们提高示例质量。

常用方法

显示隐藏

SparkDataSet(30)

save(20)

load(9)

drop(8)

exists(4)

join(3)

示例#1

显示文件

    def test_exists(self, file_format, tmp_path, sample_spark_df):
        filepath = str(tmp_path / "test_data")
        spark_data_set = SparkDataSet(filepath=filepath, file_format=file_format)

        assert not spark_data_set.exists()

        spark_data_set.save(sample_spark_df)
        assert spark_data_set.exists()

示例#2

显示文件

文件： test_spark_data_set.py 项目： zulyang/kedro

def test_exists(file_format):
    with tempfile.TemporaryDirectory() as temp_dir:
        temp_path = join(temp_dir, "test_data")
        spark_data_set = SparkDataSet(filepath=temp_path,
                                      file_format=file_format)
        spark_df = _get_sample_spark_data_frame().coalesce(1)

        assert not spark_data_set.exists()

        spark_data_set.save(spark_df)
        assert spark_data_set.exists()

示例#3

显示文件

文件： test_spark_data_set.py 项目： sylinuxhy/kedro

    def test_exists_raises_error(self, mocker):
        # exists should raise all errors except for
        # AnalysisExceptions clearly indicating a missing file
        spark_data_set = SparkDataSet(filepath="")
        mocker.patch.object(
            spark_data_set,
            "_get_spark",
            side_effect=AnalysisException("Other Exception", []),
        )

        with pytest.raises(DataSetError, match="Other Exception"):
            spark_data_set.exists()

示例#4

显示文件

文件： test_spark_data_set.py 项目： zulyang/kedro

def test_exists_raises_error(monkeypatch):
    # exists should raise all errors except for
    # AnalysisExceptions clearly indicating a missing file
    def faulty_get_spark():
        raise AnalysisException("Other Exception", [])

    spark_data_set = SparkDataSet(filepath="")
    monkeypatch.setattr(spark_data_set, "_get_spark", faulty_get_spark)

    with pytest.raises(DataSetError) as error:
        spark_data_set.exists()
    assert "Other Exception" in str(error.value)