Python RandomForestRegressor.setPredictionCol 예제들

프로그래밍 언어: Python

네임스페이스/패키지 이름: pyspark.ml.regression

메소드/함수: setPredictionCol

hotexamples.com에서의 예제들: 2

Python RandomForestRegressor.setPredictionCol - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 Python의 pyspark.ml.regression.RandomForestRegressor.setPredictionCol에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

RandomForestRegressor(30)

fit(30)

transform(4)

getMaxDepth(4)

getNumTrees(4)

save(2)

setPredictionCol(2)

setLabelCol(2)

predict(2)

load(2)

explainParams(1)

get_params(1)

getPredictionCol(1)

setMaxDepth(1)

setNumTrees(1)

setParams(1)

getMaxBins(1)

set_params(1)

getLabelCol(1)

write(1)

예제 #1

파일 보기

resultsBestDtDf.write.save('/mnt/data/resultsBestDtDf.parquet',
                           format='parquet',
                           header=True,
                           mode="overwrite")

# COMMAND ----------

# COMMAND ----------

from pyspark.ml.regression import RandomForestRegressor

# Create a RandomForestRegressor
rf = RandomForestRegressor()

rf.setPredictionCol("Prediction_cuisine")\
  .setLabelCol("6714")\
  .setFeaturesCol("features")\
  .setSeed(190088121L)\
  .setMaxDepth(8)\
  .setNumTrees(25)

# Create a Pipeline
rfPipeline = Pipeline()

# Set the stages of the Pipeline
rfPipeline.setStages([vectorizer, rf])

# Let's first train on the entire dataset to see what we get
rfModel = rfPipeline.fit(trainingSetDF)

# COMMAND ----------

예제 #2

파일 보기

trainingSetDF = split80DF
testSetDF = split20DF

# Guardamos en cache los datos para agilizar los cáluclos

trainingSetDF.cache()
testSetDF.cache()

# Árboles de decisión

rf = RandomForestRegressor()

# Para información sobre los parametros: print(rf.explainParams())

rf.setPredictionCol('Predicted_PE')\
  .setLabelCol('PE')\
  .setNumTrees(20)\
  .setMaxDepth(5)

# Forest Pipeline

pipeline = Pipeline(stages = [vectorizer, rf])

# Entrenamos el modelo

model = pipeline.fit(trainingSetDF)

# Podemos ver los detalles del árbol creado:

"""
    print("Nodos: " + str(model.stages[-1]._java_obj.parent().getNumTrees()))
    print("Profundidad: "+ str(model.stages[-1]._java_obj.parent().getMaxDepth()))