Python BigQueryOperator.doc_md示例

编程语言: Python

命名空间/包名称: airflow.contrib.operators.bigquery_operator

类/类型: BigQueryOperator

方法/功能: doc_md

hotexamples.com的示例: 2

Python BigQueryOperator.doc_md - 已找到2个示例。这些是从开源项目中提取的最受好评的airflow.contrib.operators.bigquery_operator.BigQueryOperator.doc_md现实Python示例。您可以评价示例，以帮助我们提高示例质量。

常用方法

显示隐藏

BigQueryOperator(30)

set_upstream(7)

execute(5)

get_extra_links(3)

doc_md(2)

set_downstream(2)

__init__(1)

示例#1

显示文件

# Global variables
bqproject = 'usage-data-reporting'
datasetenv = 'DEV'

# Create DAG
dag = DAG(ENVIRONMENTS['dev']['dag-name'],
          default_args=DEFAULT_ARGS,
          schedule_interval=schedule_interval,
          description='CK DAG GBQ Test')

MOVE_LDZ_DATA_TO_DWH = BigQueryOperator(
    dag=dag,
    task_id='CK_GBQ_TEST_TASK_01',
    sql='Tests/ckgbqtest.sql',
    params={
        "project": bqproject,
        "environment": datasetenv
    },
    # destination_dataset_table=bqproject + '.' + datasetenv + '_DWH_GBQ_Test.CK_GBQ_Test', # Zieltabelle
    write_disposition=
    'WRITE_APPEND',  # Specify the write disposition truncate or write_append
    use_legacy_sql=False,
    bigquery_conn_id=ENVIRONMENTS['dev']['connection-id'])

# Referenziert auf den dag - name (s.o.) - liefert eine Beschreibung mit.
MOVE_LDZ_DATA_TO_DWH.doc_md = """Write data from LDZ to DWH"""

# Define how the different steps in the workflow are executed

MOVE_LDZ_DATA_TO_DWH

示例#2

显示文件

        task_id='my_bq_task_1_' +
        lob,  # task id's must be uniqe within the dag
        bql=
        'my_qry_1.sql',  # the actual sql command we want to run on bigquery is in this file in the same folder. it is also templated
        params={
            "lob": lob
        },  # the sql file above have a template in it for a 'lob' paramater - this is how we pass it in
        destination_dataset_table='airflow.' + lob +
        '_test_task1',  # we also in this example want our target table to be lob and task specific
        write_disposition=
        'WRITE_TRUNCATE',  # drop and recreate this table each time, you could use other options here
        bigquery_conn_id=
        'my_gcp_connection'  # this is the airflow connection to gcp we defined in the front end. More info here: https://github.com/alexvanboxel/airflow-gcp-examples
    )
    # add documentation for what this task does - this will be displayed in the Airflow UI
    bq_task_1.doc_md = """\
    Append a "Hello World!" message string to the table [airflow.<lob>_test_task1]
    """

    # define the second task, in our case another big query operator
    bq_task_2 = BigQueryOperator(
        dag=
        dag,  # need to tell airflow that this task belongs to the dag we defined above
        task_id='my_bq_task_2_' +
        lob,  # task id's must be uniqe within the dag
        bql=
        'my_qry_2.sql',  # the actual sql command we want to run on bigquery is in this file in the same folder. it is also templated
        params={
            "lob": lob
        },  # the sql file above have a template in it for a 'lob' paramater - this is how we pass it in
        destination_dataset_table='airflow.' + lob +