search
Search
Login
Unlock 100+ guides
menu
menu
web
search toc
close
Comments
Log in or sign up
Cancel
Post
account_circle
Profile
exit_to_app
Sign out
What does this mean?
Why is this true?
Give me some examples!
search
keyboard_voice
close
Searching Tips
Search for a recipe:
"Creating a table in MySQL"
Search for an API documentation: "@append"
Search for code: "!dataframe"
Apply a tag filter: "#python"
Useful Shortcuts
/ to open search panel
Esc to close search panel
to navigate between search results
d to clear all current filters
Enter to expand content preview
icon_star
Doc Search
icon_star
Code Search Beta
SORRY NOTHING FOUND!
mic
Start speaking...
Voice search is only supported in Safari and Chrome.
Navigate to

Pandas DataFrame | where method

schedule Aug 12, 2023
Last updated
local_offer
PythonPandas
Tags
mode_heat
Master the mathematics behind data science with 100+ top-tier guides
Start your free 7-days trial now!

Pandas DataFrame.where(~) uses a boolean mask to selectively replace values in the source DataFrame.

Parameters

1. cond | boolean or array-like or callable | optional

A boolean mask, which is an array-like structure (e.g. Series and DataFrame) that contains either True or False as its entries.

  • If an entry is True, then the corresponding value in the source DataFrame will be left as is.

  • If an entry if False, then the corresponding value in the source DataFrame will be replaced by that in other.

If a callable is passed, then the function takes as argument a DataFrame and returns a DataFrame of booleans. This callable must not modify the source DataFrame.

2. other | scalar or Series or DataFrame or function | optional

The values to replace the entries that have True in the cond.

If a callable is passed. then the function takes in as argument the value to be replaced and returns a new scalar, Series or DataFrame that will be the replacer. Once again, this callable must not modify the source DataFrame.

3. inplace | boolean | optional

Whether or not to perform the method inplace. Methods that are inplace means that they will directly modify the source DataFrame without creating and returning a new DataFrame. By default, inplace=False.

4. axis | int | optional

The axis along which to perform the method. By default, axis=None.

5. level | int | optional

The levels on which to perform the method. This is only relevant if your source DataFrame is a multi-index.

6. errors | string | optional

Whether to raise or suppress errors:

Value

Description

"raise"

Allow for errors to be raised.

"ignore"

When error occurs, return the source DataFrame.

By default, errors="raise".

7. try_cast | boolean | optional

Whether or not to cast the resulting DataFrame into the source DataFrame's type. By default, try_cast=False.

Return Value

A DataFrame.

Examples

Basic usage

Consider the following DataFrame:

df = pd.DataFrame({"A":[3,4],"B":[5,6]})
df
A B
0 3 5
1 4 6

Suppose we have the following DataFrame that acts as the boolean mask:

df_mask = pd.DataFrame({"A":[True,False],"B":[False,True]})
df_mask
A B
0 True False
1 False True

We then call where(~) to selectively replace values in df that where the corresponding entry in df_mask is False:

df.where(df_mask, 10)
A B
0 3 10
1 10 6

Passing in a callable for cond

Consider the same DataFrame as before:

df = pd.DataFrame({"A":[3,4],"B":[5,6]})
df
A B
0 3 5
1 4 6

Instead of specifying an array-like mask as the first parameter, we can also pass in a function like so:

def foo(my_df):
return my_df > 4

df.where(foo, 10)
A B
0 10 5
1 10 6

Here, the function foo takes in as argument the entire DataFrame, and returns a DataFrame of booleans. Again, a boolean of True would mean that the corresponding values will be kept intact, while replacement is carried out for False.

Note that the previous code snippet can be written compactly using lambdas:

df.where(lambda x : x > 4, 10)
A B
0 10 5
1 10 6

Passing in callable for other

Consider the same DataFrame as before:

df = pd.DataFrame({"A":[3,4],"B":[5,6]})
df
A B
0 3 5
1 4 6

Suppose we have a mask like follows:

my_mask = [[True,False],[True,False]]
my_mask
[[True, False],
[True, False]]

Let us pass a callable for the other parameter:

df.where(my_mask, lambda x : x + 10)
A B
0 3 15
1 4 16

The callable takes in as argument the value to be replaced, and returns the new replacer.

robocat
Published by Isshin Inada
Edited by 0 others
Did you find this page useful?
thumb_up
thumb_down
Comment
Citation
Ask a question or leave a feedback...
thumb_up
0
thumb_down
0
chat_bubble_outline
0
settings
Enjoy our search
Hit / to insta-search docs and recipes!