Ejemplos de PandasExecutionEngine._s3 en Python

Lenguaje de programación: Python

Namespace/Package Name: great_expectations.execution_engine.pandas_execution_engine

Método / Función: _s3

Ejemplos en hotexamples.com: 2

Python PandasExecutionEngine._s3 - 2 ejemplos encontrados. Estos son los ejemplos en Python del mundo real mejor valorados de great_expectations.execution_engine.pandas_execution_engine.PandasExecutionEngine._s3 extraídos de proyectos de código abierto. Puedes valorar ejemplos para ayudarnos a mejorar la calidad de los ejemplos.

Métodos usados con frecuencia

Mostrar Ocultar

PandasExecutionEngine(30)

load_batch_data(10)

get_compute_domain(6)

get_batch_data(5)

get_domain_records(3)

_s3(2)

resolve_metrics(2)

_azure(1)

_get_reader_fn(1)

Ejemplo n.º 1

Mostrar archivo

def test_get_batch_with_no_s3_configured(batch_with_split_on_whole_table_s3):
    # if S3 was not configured
    execution_engine_no_s3 = PandasExecutionEngine()
    execution_engine_no_s3._s3 = None
    with pytest.raises(ge_exceptions.ExecutionEngineError):
        execution_engine_no_s3.get_batch_data(
            batch_spec=batch_with_split_on_whole_table_s3)

Ejemplo n.º 2

Mostrar archivo

Archivo: test_pandas_execution_engine.py Proyecto: MikelDietz/great_expectations

def test_get_batch_with_split_on_whole_table_s3():
    region_name: str = "us-east-1"
    bucket: str = "test_bucket"
    conn = boto3.resource("s3", region_name=region_name)
    conn.create_bucket(Bucket=bucket)
    client = boto3.client("s3", region_name=region_name)

    test_df: pd.DataFrame = pd.DataFrame(data={"col1": [1, 2], "col2": [3, 4]})
    keys: List[str] = [
        "path/A-100.csv",
        "path/A-101.csv",
        "directory/B-1.csv",
        "directory/B-2.csv",
    ]
    for key in keys:
        client.put_object(Bucket=bucket,
                          Body=test_df.to_csv(index=False).encode("utf-8"),
                          Key=key)

    path = "path/A-100.csv"
    full_path = f"s3a://{os.path.join(bucket, path)}"
    test_df = PandasExecutionEngine().get_batch_data(batch_spec=S3BatchSpec(
        path=full_path,
        reader_method="read_csv",
        splitter_method="_split_on_whole_table",
    ))
    assert test_df.dataframe.shape == (2, 2)

    # if S3 was not configured
    execution_engine_no_s3 = PandasExecutionEngine()
    execution_engine_no_s3._s3 = None
    with pytest.raises(ge_exceptions.ExecutionEngineError):
        execution_engine_no_s3.get_batch_data(batch_spec=S3BatchSpec(
            path=full_path,
            reader_method="read_csv",
            splitter_method="_split_on_whole_table",
        ))