Python from_arrow_schemaの例

プログラミング言語: Python

名前空間/パッケージ名: pyspark.sql.pandas.types

メソッド/関数: from_arrow_schema

hotexamples.comのコード掲載数: 2

Python from_arrow_schema - 2件のコード例が見つかりました。すべてオープンソースプロジェクトから抽出されたPythonのpyspark.sql.pandas.types.from_arrow_schemaの実例で、最も評価が高いものを厳選しています。コード例の評価を行っていただくことで、より質の高いコード例が表示されるようになります。

コード例 #1

ファイルを表示

ファイル: main.py プロジェクト: abs-tudelft/SVCall

def createFromArrowRecordBatchesRDD(self, ardd, schema=None, timezone=None):
    #from pyspark.sql.types import from_arrow_schema
    #from pyspark.sql.dataframe import DataFrame
    #from pyspark.serializers import ArrowSerializer, PickleSerializer, AutoBatchedSerializer

    from pyspark.sql.pandas.types import from_arrow_schema
    from pyspark.sql.dataframe import DataFrame

    # Filter out and cache arrow record batches
    ardd = ardd.filter(lambda x: isinstance(x, pa.RecordBatch)).cache()

    ardd = ardd.map(_arrow_record_batch_dumps)

    #schema = pa.schema([pa.field('c0', pa.int16()),
    #                    pa.field('c1', pa.int32())],
    #                   metadata={b'foo': b'bar'})
    if (args.aligner == "BWA"):
        schema = from_arrow_schema(sam_schema())
    else:
        schema = from_arrow_schema(_schema())

    # Create the Spark DataFrame directly from the Arrow data and schema
    jrdd = ardd._to_java_object_rdd()
    jdf = self._jvm.PythonSQLUtils.toDataFrame(jrdd, schema.json(),
                                               self._wrapped._jsqlContext)
    df = DataFrame(jdf, self._wrapped)
    df._schema = schema

    return df

コード例 #2

ファイルを表示

 def test_schema_conversion_roundtrip(self):
     from pyspark.sql.pandas.types import from_arrow_schema, to_arrow_schema
     arrow_schema = to_arrow_schema(self.schema)
     schema_rt = from_arrow_schema(arrow_schema)
     self.assertEqual(self.schema, schema_rt)