Python PandasDatasource.build_batch_kwargs Examples

Programming Language: Python

Namespace/Package Name: great_expectations.datasource

Class/Type: PandasDatasource

Method/Function: build_batch_kwargs

Examples at hotexamples.com: 2

Python PandasDatasource.build_batch_kwargs - 2 examples found. These are the top rated real world Python examples of great_expectations.datasource.PandasDatasource.build_batch_kwargs extracted from open source projects. You can rate examples to help us improve the quality of examples.

Frequently Used Methods

Show Hide

PandasDatasource(12)

get_batch(6)

build_configuration(3)

get_available_data_asset_names(3)

get_data_asset(3)

get_generator(3)

build_batch_kwargs(2)

_infer_default_options(1)

get_batch_kwargs_generator(1)

process_batch_parameters(1)

Example #1

Show file

File: test_datasources.py Project: scarrucciu/great_expectations

def test_standalone_pandas_datasource(test_folder_connection_path):
    datasource = PandasDatasource('PandasCSV', base_directory=test_folder_connection_path)

    assert datasource.get_available_data_asset_names() == {"default": {"test"}}
    manual_batch_kwargs = datasource.build_batch_kwargs(os.path.join(str(test_folder_connection_path), "test.csv"))

    # Get the default (subdir_path) generator
    generator = datasource.get_generator()
    auto_batch_kwargs = generator.yield_batch_kwargs("test")

    assert manual_batch_kwargs["path"] == auto_batch_kwargs["path"]

    # Include some extra kwargs...
    dataset = datasource.get_batch("test", batch_kwargs=auto_batch_kwargs, sep=",", header=0, index_col=0)
    assert isinstance(dataset, PandasDataset)
    assert (dataset["col_1"] == [1, 2, 3, 4, 5]).all()

Example #2

Show file

File: test_pandas_datasource.py Project: rexboyce/great_expectations

def test_pandas_datasource_processes_dataset_options(
        test_folder_connection_path):
    datasource = PandasDatasource(
        "PandasCSV",
        batch_kwargs_generators={
            "subdir_reader": {
                "class_name": "SubdirReaderBatchKwargsGenerator",
                "base_directory": test_folder_connection_path,
            }
        },
    )
    batch_kwargs = datasource.build_batch_kwargs("subdir_reader", name="test")
    batch_kwargs["dataset_options"] = {"caching": False}
    batch = datasource.get_batch(batch_kwargs)
    validator = Validator(batch,
                          ExpectationSuite(expectation_suite_name="foo"))
    dataset = validator.get_dataset()
    assert dataset.caching is False