chevron_left
PySpark RDD
check_circle
Mark as learned thumb_up
0
thumb_down
0
chat_bubble_outline
0
auto_stories new
settings
PySpark RDD | count method
Machine Learning
chevron_rightPySpark
chevron_rightDocumentation
chevron_rightPySpark RDD
schedule Jul 1, 2022
Last updated local_offer PySpark
Tags tocTable of Contents
expand_more Check out the interactive map of data science
PySpark RDD's count(~)
method returns the number of values in the RDD as an integer.
Parameters
This method does not take in any parameters.
Return Value
An integer (int
).
Examples
Consider the following PySpark RDD:
['A', 'B', 'A', 'B']
Here, we are using the parallelize(~)
method to create a PySpark RDD.
Getting the number of values in PySpark RDD
To get the number of elements in the RDD, use the count()
method:
rdd.count()
4
Published by Isshin Inada
Edited by 0 others
Did you find this page useful?
thumb_up
thumb_down
Ask a question or leave a feedback...
Official PySpark Documentation
https://spark.apache.org/docs/latest/api/python/reference/api/pyspark.RDD.count.html
thumb_up
0
thumb_down
0
chat_bubble_outline
0
settings
Enjoy our search
Hit / to insta-search docs and recipes!