Python SQLContext.filter示例

编程语言: Python

命名空间/包名称: pyspark.sql

类/类型: SQLContext

方法/功能: filter

hotexamples.com的示例: 2

Python SQLContext.filter - 已找到2个示例。这些是从开源项目中提取的最受好评的pyspark.sql.SQLContext.filter现实Python示例。您可以评价示例，以帮助我们提高示例质量。

常用方法

显示隐藏

SQLContext(30)

cacheTable(30)

createDataFrame(30)

clearCache(15)

dropTempTable(12)

applySchema(8)

__init__(3)

_inferSchema(3)

csvFile(1)

filter(1)

示例#1

显示文件

文件： crash_deviations.py 项目： marco-c/crashcorrelations

def get_telemetry_crashes(sc, versions, days, product='Firefox'):
    days = utils.get_days(days)
    dataset = SQLContext(sc).read.load(['s3://telemetry-parquet/socorro_crash/v2/crash_date=' + day.strftime('%Y%m%d') for day in days], 'parquet')

    if product != 'FennecAndroid':
        dataset = dataset.select([c for c in dataset.columns if c not in [
            'android_board', 'android_brand', 'android_cpu_abi', 'android_cpu_abi2',
            'android_device', 'android_hardware', 'android_manufacturer',
            'android_model', 'android_version',
        ]])

    return dataset.filter((dataset['product'] == product) & (dataset['version'].isin(versions)))

示例#2

显示文件

	def load_dataFrame_from_csv(self, csvFilePath):
		schema = StructType([
			StructField("X2", StringType(), True),
			StructField("X4", StringType(), True),
			StructField("X5", StringType(), True),
			StructField("X6", StringType(), True),
			StructField("adversaire", StringType(), True),
			StructField("score_france", IntegerType(), True),
			StructField("score_adversaire", IntegerType(), True),
			StructField("penalty_france", StringType(), True),
			StructField("penalty_adversaire", StringType(), True),
			StructField("date", DateType(), True),
			StructField("year", IntegerType(), True),
			StructField("outcome", StringType(), True),
			StructField("no", StringType(), True)
		])
		dfNotFiltered = SQLContext(self.spark).read.csv(csvFilePath, header=True,schema=schema)
		return dfNotFiltered.filter(dfNotFiltered.no != "None")