PySpark
keyboard_arrow_down 147 guides
chevron_leftPySpark Column
check_circle
Mark as learned thumb_up
2
thumb_down
0
chat_bubble_outline
0
Comment auto_stories Bi-column layout
settings
PySpark Column | rlike method
schedule Aug 12, 2023
Last updated local_offer
Tags PySpark
tocTable of Contents
expand_more Master the mathematics behind data science with 100+ top-tier guides
Start your free 7-days trial now!
Start your free 7-days trial now!
PySpark Column's rlike(~) method returns a Column of booleans where True corresponds to string column values that match the specified regular expression.
NOTE
The rlike(~) method is the same as the RLIKE operator in SQL.
Parameters
1. str | other
The regular expression to match against.
Return Value
A Column object of booleans.
Examples
Consider the following PySpark DataFrame:
+----+---+|name|age|+----+---+|Alex| 20|| Bob| 30|+----+---+
Getting rows where values match some regular expression in PySpark DataFrame
To get rows where values match some regex:
Here, the regular expression "^A" matches strings that begin with "A". Also, F.col("name").rlike("^A") returns a Column object of booleans:
In our solution, we use the filter(~) method to fetch only the rows that correspond to True.
Published by Isshin Inada
Edited by 0 others
Did you find this page useful?
thumb_up
thumb_down
Comment
Citation
Ask a question or leave a feedback...
Official PySpark Documentation
https://spark.apache.org/docs/latest/api/python/reference/api/pyspark.sql.Column.rlike.html
thumb_up
2
thumb_down
0
chat_bubble_outline
0
settings
Enjoy our search
Hit / to insta-search docs and recipes!