Python MySqlHook.get_connection 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: airflow.hooks.mysql_hook

클래스/타입: MySqlHook

메소드/함수: get_connection

hotexamples.com에서의 예제들: 4

Python MySqlHook.get_connection - 4개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 airflow.hooks.mysql_hook.MySqlHook.get_connection에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

MySqlHook(30)

run(30)

get_records(30)

get_conn(16)

insert_rows(15)

begin(11)

get_first(6)

bulk_load(5)

get_connection(4)

get_sqlalchemy_engine(4)

get_uri(3)

bulk_dump(3)

get_pandas_df(3)

set_autocommit(2)

__init__(1)

cursor(1)

get_cursor(1)

예제 #1

파일 보기

파일: generate_twitter.py 프로젝트: zbelghali/airflow-tutorial

def generate_search_terms(**kwargs):
    """ Generate subdag to search twitter for terms. """
    dbconn = MySqlHook(mysql_conn_id="mysql_default")
    conn = dbconn.get_connection()
    cursor = conn.cursor()
    query = "select * from twitter_terms"
    df = pd.read_sql_query(query, conn)
    return random.choice([
        "search_{}_twitter".format(re.sub(r"\W+", "", t))
        for t in df.search_term.values
    ])

예제 #2

파일 보기

파일: generate_twitter.py 프로젝트: zbelghali/airflow-tutorial

def fill_terms(my_terms=SEARCH_TERMS, **kwargs):
    """ Fill sqlite database with a few search terms. """
    dbconn = MySqlHook(mysql_conn_id="mysql_default")
    conn = dbconn.get_connection()
    cursor = conn.cursor()
    df = pd.DataFrame(my_terms, columns=["search_term"])
    try:
        df.to_sql("twitter_terms", conn)
    except ValueError:
        # table already exists
        pass

예제 #3

파일 보기

def csv_to_sql(directory=RAW_TWEET_DIR, **kwargs):
    """ csv to sql pipeline using pandas
        params:
            directory: str (file path to csv files)
    """
    dbconn = MySqlHook(mysql_conn_id="mysql_default")
    conn = dbconn.get_connection()
    cursor = conn.cursor()

    for fname in glob.glob("{}/*.csv".format(directory)):
        if "_read" not in fname:
            try:
                df = pd.read_csv(fname)
                df.to_sql("tweets", dbconn, if_exists="append", index=False)
                shutil.move(fname, fname.replace(".csv", "_read.csv"))
            except pd.io.common.EmptyDataError:
                # probably an io error with another task / open file
                continue

예제 #4

파일 보기

def identify_popular_links(directory=RAW_TWEET_DIR, write_mode="w", **kwargs):
    """ Identify the most popular links from the last day of tweest in the db
        Writes them to latest_links.txt in the RAW_TWEET_DIR
        (or directory kwarg)
    """
    dbconn = MySqlHook(mysql_conn_id="mysql_default")
    conn = dbconn.get_connection()
    cursor = conn.cursor()

    query = """select * from tweets where
    created > date('now', '-1 days') and urls is not null
    order by favorite_count"""
    df = pd.read_sql_query(query, conn)
    df.urls = df.urls.map(ast.literal_eval)
    cntr = Counter(itertools.chain.from_iterable(df.urls.values))
    with open("{}/latest_links.txt".format(directory), write_mode) as latest:
        wrtr = writer(latest)
        wrtr.writerow(["url", "count"])
        wrtr.writerows(cntr.most_common(5))