Exemplos de Pipeline.only_nodes_with_outputs em Python

Linguagem de programação: Python

Espaço para nome / nome do pacote: kedro.pipeline

Classe / Tipo: Pipeline

Método / Função: only_nodes_with_outputs

Exemplos em hotexamples.com: 2

Pipeline.only_nodes_with_outputs em Python - 2 exemplos encontrados. Esses são os exemplos do mundo real mais bem avaliados de kedro.pipeline.Pipeline.only_nodes_with_outputs em Python extraídos de projetos de código aberto. Você pode avaliar os exemplos para nos ajudar a melhorar a qualidade deles.

Métodos Frequentes

Exibir Ocultar

Pipeline(30)

outputs(12)

inputs(11)

only_nodes_with_tags(7)

only_nodes(5)

data_sets(4)

describe(3)

from_inputs(3)

all_outputs(2)

extract_pipeline_artifacts(2)

only_nodes_with_namespace(2)

only_nodes_with_namespaces(2)

only_nodes_with_outputs(2)

decorate(1)

extract_pipeline_catalog(1)

_extract_pipeline_catalog(1)

only_nodes_with_inputs(1)

from_nodes(1)

Métodos Frequentes

Pipeline (30)

outputs (12)

inputs (11)

only_nodes_with_tags (7)

only_nodes (5)

data_sets (4)

describe (3)

from_inputs (3)

all_outputs (2)

extract_pipeline_artifacts (2)

Métodos Frequentes

only_nodes_with_namespace (2)

only_nodes_with_namespaces (2)

only_nodes_with_outputs (2)

decorate (1)

extract_pipeline_catalog (1)

_extract_pipeline_catalog (1)

only_nodes_with_inputs (1)

from_nodes (1)

Exemplo n.º 1

0

Exibir arquivo

Arquivo: runner.py Projeto: zeta1999/kedro

def run_only_missing( self, pipeline: Pipeline, catalog: DataCatalog ) -> Dict[str, Any]: """Run only the missing outputs from the ``Pipeline`` using the ``DataSet``s provided by ``catalog`` and save results back to the same objects. Args: pipeline: The ``Pipeline`` to run. catalog: The ``DataCatalog`` from which to fetch data. Raises: ValueError: Raised when ``Pipeline`` inputs cannot be satisfied. Returns: Any node outputs that cannot be processed by the ``DataCatalog``. These are returned in a dictionary, where the keys are defined by the node outputs. """ free_outputs = pipeline.outputs() - set(catalog.list()) missing = {ds for ds in catalog.list() if not catalog.exists(ds)} to_build = free_outputs | missing to_rerun = pipeline.only_nodes_with_outputs(*to_build) + pipeline.from_inputs( *to_build ) # we also need any memory data sets that feed into that # including chains of memory data sets memory_sets = pipeline.data_sets() - set(catalog.list()) output_to_memory = pipeline.only_nodes_with_outputs(*memory_sets) input_from_memory = to_rerun.inputs() & memory_sets to_rerun += output_to_memory.to_outputs(*input_from_memory) return self.run(to_rerun, catalog)

Exemplo n.º 2

0

Exibir arquivo

def run(self, pipeline: Pipeline, catalog: DataCatalog, run_id: str = None) -> Dict[str, Any]: """ Run the ``Pipeline`` using the ``DataSet``s provided by ``catalog``. Parameters ---------- pipeline: Pipeline The ``Pipeline`` to run catalog: DataCatalog The ``DataCatalog`` from which to fetch data. run_id: str The id of the run. Returns ------- dict Any node outputs that cannot be processed by the ``DataCatalog``. These are returned in a dictionary, where the keys are defined by the node outputs. """ # If missing flag run missing_output pipeline and its child nodes if self.only_missing: to_build = {ds for ds in catalog.list() if not catalog.exists(ds)}.intersection(pipeline.data_sets()) pipeline = pipeline.only_nodes_with_outputs(*to_build) + pipeline.from_inputs(*to_build) return super(DatalabRunner, self).run(pipeline, catalog, run_id)