Python SparkContext Exemples

Langage de programmation: Python

Espace de nommage/Pack: dip.spark

Class/Type: SparkContext

Exemples au hotexamples.com: 3

Python SparkContext - 3 exemples trouvés. Ce sont les exemples réels les mieux notés de dip.spark.SparkContext extraits de projets open source. Vous pouvez noter les exemples pour nous aider à en améliorer la qualité.

Méthodes fréquemment utilisées

Afficher Cacher

SparkContext(1)

extract_text_to_arr(1)

stop(1)

textFile(1)

textFiles(1)

Méthodes fréquemment utilisées

SparkContext (1)

extract_text_to_arr (1)

stop (1)

textFile (1)

textFiles (1)

Associées

create_new_api_key

get_blocks_in_list

GaplessList

read

Edge

new_compiler

isBegin

HashChecker

Basic

read_in_ascii

Related in langs

Cache (PHP)

xoops_template_clear_module_cache (PHP)

MultiDownload (C#)

LongOps (C#)

GameReplay (C++)

_arb_atan_taylor_rs (C++)

mustMarshal (Go)

DelayLogins (Go)

Inliner (Java)

AdverseReactionSymptomComponent (Java)

Exemple #1

0

Afficher le fichier

Fichier : app_picserversweibof6vwt_wapvideodownload.py Projet : imran273/pyspark-1

from dip.util import timetool import sys import random reload(sys) sys.setdefaultencoding("utf-8") import re from pyspark.sql.types import StructType, StructField, StringType, IntegerType, FloatType, ArrayType import json import time conf = SparkConf().setAppName( "app_picserversweibof6vwt_wapvideodownload_to_hdfs") sc = SparkContext(conf=conf) hc = HiveContext(sc) try: source = sc.textFile( "/user/hdfs/rawlog/app_picserversweibof6vwt_wapvideodownload/" + timetool.getHDFSDayDir(sys.argv[1])) pattern = re.compile("^([^`]*)`([^`]*)") def lineParse(line): matcher = pattern.match(line) if not matcher: return None

Exemple #2

0

Afficher le fichier

Fichier : spark_parse.py Projet : Leaderman/dip

from pyspark import SparkConf from dip.spark import SparkContext from pyspark.sql import HiveContext conf = SparkConf().setAppName("spark_parse") sc = SparkContext(conf=conf) hc = HiveContext(sc) def printRows(rows): for row in rows: print row rows = sc.extract_text_to_arr("hdfs://dip.cdh5.dev:8020/user/yurun/text", "delimiter", " ", [str, int, str, str, str], lambda words: words[0] == "1").collect() printRows(rows) sc.extract_text_to_arr("hdfs://dip.cdh5.dev:8020/user/yurun/text", "regex", "(.*) (.*) (.*) (.*) (.*)", filter=lambda words: True).transform_arr(lambda words: [words[0].upper()], [int], lambda words: words[0] == 1).load_arr_to_table(hc, "temp_table", [("first", int, False)]) rows = hc.sql("select * from temp_table").collect() printRows(rows) sc.stop()

Exemple #3

0

Afficher le fichier

Fichier : spark_textFiles_test.py Projet : Leaderman/dip

from pyspark import SparkConf from dip.spark import SparkContext conf = SparkConf().setAppName("spark_textFiles_test") sc = SparkContext(conf=conf) dirs = ["hdfs://dip.cdh5.dev:8020/user/yurun/dir1", "hdfs://dip.cdh5.dev:8020/user/yurun/dir2"] def printLines(lines): if lines: for line in lines: print line lines = sc.textFiles(dirs).collect() printLines(lines) sc.stop()