PySpark DataFrame | show method
Start your free 7-days trial now!
PySpark DataFrame's show(~) method prints the rows of the DataFrame on the console.
Parameters
1. n | int | optional
The number of rows to show. By default, n=20.
2. truncate | boolean or int | optional
If
True, then strings that are longer than 20 characters will be truncated.If
False, then whole strings will be shown.If
int, then strings that are longer thantruncatewill be truncated.
If truncation occurs, then the left part of the string is preserved. By default, truncate=True.
3. vertical | boolean | optional
If True, then the rows are printed with one line for each column value. By default, vertical=False.
Return Value
None.
Examples
Consider the following PySpark DataFrame:
columns = ["name", "age"]data = [("Alex", 15), ("Bob", 20), ("Cathy", 25)]
Printing the first n rows of PySpark DataFrame
To print the first 20 rows of the PySpark DataFrame:
+-----+---+| name|age|+-----+---+| Alex| 15|| Bob| 20||Cathy| 25|+-----+---+
To print the first 2 rows of the DataFrame:
+----+---+|name|age|+----+---+|Alex| 15|| Bob| 20|+----+---+only showing top 2 rows
Truncating strings in printed rows of PySpark DataFrame
To truncate strings that are longer than 2:
df.show(truncate=2)
+----+---+|name|age|+----+---+| Al| 15|| Bo| 20|| Ca| 25|+----+---+
Disabling truncation of strings in printed rows of PySpark DataFrame
To disable truncation of strings in printed rows:
df.show(truncate=False)
+-----+---+|name |age|+-----+---+|Alex |15 ||Bob |20 ||Cathy|25 |+-----+---+
Printing rows of PySpark DataFrame vertically
To print each column value in a separate line:
df.show(vertical=True)
-RECORD 0----- name | Alex age | 15 -RECORD 1----- name | Bob age | 20 -RECORD 2----- name | Cathy age | 25