Exemplos de IndexToString.getOutputCol em Python

Linguagem de programação: Python

Espaço para nome / nome do pacote: pyspark.ml.feature

Classe / Tipo: IndexToString

Método / Função: getOutputCol

Exemplos em hotexamples.com: 3

IndexToString.getOutputCol em Python - 3 exemplos encontrados. Esses são os exemplos do mundo real mais bem avaliados de pyspark.ml.feature.IndexToString.getOutputCol em Python extraídos de projetos de código aberto. Você pode avaliar os exemplos para nos ajudar a melhorar a qualidade deles.

Métodos Frequentes

Exibir Ocultar

IndexToString(30)

transform(29)

getInputCol(3)

getLabels(2)

getOutputCol(2)

drop(1)

setHandleInvalid(1)

setLabels(1)

Métodos Frequentes

IndexToString (30)

transform (29)

getInputCol (3)

getLabels (2)

getOutputCol (2)

drop (1)

setHandleInvalid (1)

setLabels (1)

Exemplo n.º 1

0

Exibir arquivo

Arquivo: index_to_string_example.py Projeto: lhfei/spark-in-action

if __name__ == "__main__": spark = SparkSession\ .builder\ .appName("IndexToStringExample")\ .getOrCreate() # $example on$ df = spark.createDataFrame( [(0, "a"), (1, "b"), (2, "c"), (3, "a"), (4, "a"), (5, "c")], ["id", "category"]) indexer = StringIndexer(inputCol="category", outputCol="categoryIndex") model = indexer.fit(df) indexed = model.transform(df) print("Transformed string column '%s' to indexed column '%s'" % (indexer.getInputCol(), indexer.getOutputCol())) indexed.show() print("StringIndexer will store labels in output column metadata\n") converter = IndexToString(inputCol="categoryIndex", outputCol="originalCategory") converted = converter.transform(indexed) print("Transformed indexed column '%s' back to original string column '%s' using " "labels in metadata" % (converter.getInputCol(), converter.getOutputCol())) converted.select("id", "categoryIndex", "originalCategory").show() # $example off$ spark.stop()

Exemplo n.º 2

0

Exibir arquivo

Arquivo: Data engineering pyspark.py Projeto: hmk88/Pyspark_ML_databricks_ApacheSpark

model = indexer.fit(df) indexed = model.transform(df) print("Transformed string column '%s' to indexed column '%s'" % (indexer.getInputCol(), indexer.getOutputCol())) indexed.show() print("StringIndexer will store labels in output column metadata\n") converter = IndexToString(inputCol="categoryIndex", outputCol="originalCategory") converted = converter.transform(indexed) print( "Transformed indexed column '%s' back to original string column '%s' using " "labels in metadata" % (converter.getInputCol(), converter.getOutputCol())) converted.select("id", "categoryIndex", "originalCategory").show() # COMMAND ---------- ###One hot encode estimator maps the categorical features to binary vector. It is common practice to run string indexer first to convert the raw features into indexed features (Stringindexer) from pyspark.ml.feature import OneHotEncoderEstimator df = spark.createDataFrame([(0.0, 1.0), (1.0, 0.0), (2.0, 1.0), (0.0, 2.0), (0.0, 1.0), (2.0, 0.0)], ["categoryIndex1", "categoryIndex2"]) encoder = OneHotEncoderEstimator( inputCols=["categoryIndex1", "categoryIndex2"], outputCols=["categoryVec1", "categoryVec2"]) model = encoder.fit(df)

Exemplo n.º 3

0

Exibir arquivo

Arquivo: sparkML_FT_indextostring.py Projeto: shivendrakrjha/Projects-1

from pyspark.ml.feature import IndexToString, StringIndexer from pyspark.sql import SparkSession spark = SparkSession.builder.getOrCreate() spark.sparkContext.setLogLevel("ERROR") df = spark.createDataFrame([(0, "a"), (1, "b"), (2, "c"), (3, "a"), (4, "a"), (5, "c")], ["id", "category"]) indexer = StringIndexer(inputCol="category", outputCol="categoryIndex") model = indexer.fit(df) indexed = model.transform(df) print("Transformed string column '%s' to indexed column '%s'" % (indexer.getInputCol(), indexer.getOutputCol())) indexed.show() converter = IndexToString(inputCol="categoryIndex", outputCol="originalCategory") converted = converter.transform(indexed) print( "Transformed indexed column '%s' back to original string column '%s' using labels in metadata" % (converter.getInputCol(), converter.getOutputCol())) converted.select("id", "categoryIndex", "originalCategory").show() spark.stop()