Python Babe.groupByの例

プログラミング言語: Python

名前空間/パッケージ名: pybabe

クラス/型: Babe

メソッド/関数: groupBy

hotexamples.comのコード掲載数: 4

Python Babe.groupBy - 4件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのpybabe.Babe.groupByの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

よく使われるメソッド

表示非表示

Babe(30)

push(30)

to_string(30)

pull(24)

typedetect(9)

mapTo(6)

head(6)

push_sql(4)

join(4)

primary_key_detect(4)

dedup(4)

partition(4)

push_mongo(3)

filterColumns(3)

maxN(3)

rename(2)

push_bigquery(2)

replace_in_string(2)

sort(2)

user_agent(2)

minN(2)

parse_time(2)

filter(2)

filter_values(2)

flatMap(2)

geoip_country_code(2)

groupBy(2)

group(1)

to_list(1)

tee(1)

get_config_with_env(1)

groupAll(1)

bulkMapTo(1)

group_all(1)

has_config(1)

keynormalize(1)

pull_kontagent(1)

pull_command(1)

mail(1)

merge_substreams(1)

windowMap(1)

コード例 #1

ファイルを表示

 def test_groupby(self):
     a = Babe().pull(stream=StringIO('a,b\n1,2\n3,4\n1,4\n'),
                     format="csv").typedetect()
     a = a.groupBy(key="a",
                   reducer=lambda key, rows:
                   (key, sum([row.b for row in rows])))
     buf = StringIO()
     a.push(stream=buf, format='csv')
     self.assertEquals(buf.getvalue(), "a,b\n1,6\n3,4\n")

コード例 #2

ファイルを表示

ファイル: wordcount.py プロジェクト: waytai/PyBabe

def wordcount():
    a = Babe().pull(protocol='http',
                    host='www.ietf.org',
                    filename='rfc/rfc1149.txt')
    a = a.flatMap(lambda row: [(w, 1) for w in re.findall('\w+', row.text)],
                  columns=['word', 'count'])
    a = a.groupBy(key='word',
                  reducer=lambda word, rows:
                  (word, sum([row.count for row in rows])))
    a = a.maxN(column='count', n=10)
    a.push(stream=sys.stdout, format='csv')

コード例 #3

ファイルを表示

ファイル: wordcount.py プロジェクト: IsCoolEntertainment/PyBabe

def wordcount():
    a = Babe().pull(protocol='http',
                    host='www.ietf.org',
                    filename='rfc/rfc1149.txt')
    a = a.flatMap(lambda row: [(w, 1) for w in re.findall('\w+', row.text)],
                  columns=['word', 'count'])
    a = a.groupBy(key='word',
                  reducer=lambda word, rows: (word, sum([row.count for row in rows])))
    a = a.maxN(column='count',
               n=10)
    a.push(stream=sys.stdout,
           format='csv')

コード例 #4

ファイルを表示

ファイル: tests.py プロジェクト: nizox/PyBabe

 def test_groupby(self):
     a = Babe().pull(stream=StringIO('a,b\n1,2\n3,4\n1,4\n'), format="csv").typedetect()
     a = a.groupBy(key="a", reducer=lambda key, rows: (key, sum([row.b for row in rows])))
     buf = StringIO()
     a.push(stream=buf, format='csv')
     self.assertEquals(buf.getvalue(), "a,b\n1,6\n3,4\n")