Python StringIndexer.count Exemples

Langage de programmation: Python

Espace de nommage/Pack: pyspark.ml.feature

Class/Type: StringIndexer

Méthode/Fonction: count

Exemples au hotexamples.com: 2

Python StringIndexer.count - 2 exemples trouvés. Ce sont les exemples réels les mieux notés de pyspark.ml.feature.StringIndexer.count extraits de projets open source. Vous pouvez noter les exemples pour nous aider à en améliorer la qualité.

Méthodes fréquemment utilisées

Afficher Cacher

StringIndexer(30)

fit(30)

transform(30)

getOutputCol(22)

show(19)

select(15)

setHandleInvalid(14)

write(10)

drop(9)

randomSplit(8)

toPandas(4)

withColumnRenamed(4)

getInputCol(3)

withColumn(3)

groupBy(3)

where(3)

printSchema(3)

save(2)

setInputCol(2)

count(2)

take(1)

describe(1)

setOutputCol(1)

filter(1)

dropna(1)

fitAsync(1)

orderBy(1)

_call_java(1)

labels(1)

groupby(1)

getOutputCols(1)

fillna(1)

load(1)

Méthodes fréquemment utilisées

StringIndexer (30)

fit (30)

transform (30)

getOutputCol (22)

show (19)

select (15)

setHandleInvalid (14)

write (10)

drop (9)

randomSplit (8)

Méthodes fréquemment utilisées

toPandas (4)

withColumnRenamed (4)

getInputCol (3)

withColumn (3)

groupBy (3)

where (3)

printSchema (3)

save (2)

setInputCol (2)

count (2)

take (1)

describe (1)

setOutputCol (1)

filter (1)

dropna (1)

fitAsync (1)

orderBy (1)

_call_java (1)

labels (1)

groupby (1)

Méthodes fréquemment utilisées

take (1)

describe (1)

setOutputCol (1)

filter (1)

dropna (1)

fitAsync (1)

orderBy (1)

_call_java (1)

labels (1)

groupby (1)

getOutputCols (1)

fillna (1)

load (1)

Méthodes fréquemment utilisées

getOutputCols (1)

fillna (1)

load (1)

Exemple #1

0

Afficher le fichier

Fichier : Ex2a.3.py Projet : wel51x/Machine_Learning_and_Spark

header=True, inferSchema=True, nullValue='NA') # Get number of records print("The data contain %d records." % flights.count(), '\n') # Remove records with missing 'delay' values flights = flights.filter('delay IS NOT NULL') # Create an indexer for carrier categorical feature indexer = StringIndexer(inputCol="carrier", outputCol='carrier_idx') # Indexer identifies categories in the data indexer_model = indexer.fit(flights) # Indexer creates a new column with numeric index values flights_indexed = indexer_model.transform(flights) # Repeat the process for the org categorical feature flights_indexed = StringIndexer( inputCol="org", outputCol='org_idx').fit(flights_indexed).transform(flights_indexed) # Check first five records flights_indexed.show(5) # Get number of records print("The data contain %d records." % flights_indexed.count(), '\n') spark.stop()

Exemple #2

0

Afficher le fichier

Fichier : Untitled1.py Projet : sosam29/pycode

# In[110]: desidxer_df.describe().show() # In[115]: desidxer_df.select("tailnum===NA" || "tailnum === ''") # In[118]: desidxer_df.count() # In[120]: df3= desidxer_df.drop() # In[123]: df3.count() # In[ ]: