menu

login

Log in

Linear Algebra

Prob and Stats

Other math topics

Machine Learning

Dagster (NEW)

search

Search

Login

Unlock 100+ guides

menu

menu

search toc

close

Outline

Comments

Log in or sign up

Cancel

Post

account_circle

exit_to_app

Sign out

What does this mean?

Why is this true?

Give me some examples!

search

keyboard_voice

close

Searching Tips

Search for a recipe:
"Creating a table in MySQL"

Search for an API documentation: "@append"

Search for code: "!dataframe"

Apply a tag filter: "#python"

Useful Shortcuts

/ to open search panel

Esc to close search panel

↑↓ to navigate between search results

⌘d to clear all current filters

⌘Enter to expand content preview

icon_star

Doc Search

icon_star

Code Search Beta

SORRY NOTHING FOUND!

mic

Start speaking...

Voice search is only supported in Safari and Chrome.

fullscreen_exit

Shrink

Navigate to

Pandas

655 guides

keyboard_arrow_down

Linear Algebra

Prob and Stats

Machine Learning

Other math topics

chevron_leftData Aggregation Cookbook

Applying a function to multiple columns in groups Calculating percentiles of a DataFrame Calculating the percentage of each value in each group Computing descriptive statistics of each group Difference between a group's count and size Difference between methods apply and transform for groupby Getting cumulative sum of each group Getting descriptive statistics of DataFrame Getting multiple aggregates of a column after grouping Getting n rows with smallest column value in each group Getting number of distinct rows in each group Getting size of each group Getting specific group after groupby Getting the first row of each group Getting the last row of each group Getting the top n rows with largest column value in each group Getting unique values of each group Grouping by multiple columns Grouping without turning group column into index Merging rows within a group together Naming columns after aggregation Sorting values within groups

check_circle

Mark as learned

thumb_up

1

thumb_down

0

chat_bubble_outline

0

Comment

auto_stories Bi-column layout

settings

Calculating the percentage of each value in each group in Pandas

schedule Aug 12, 2023

Last updated

local_offer

Python●Pandas

Tags

tocTable of Contents

expand_more

Master the mathematics behind data science with 100+ top-tier guides
Start your free 7-days trial now!

Consider the following DataFrame:


        
        
            
                
                
                    df = pd.DataFrame({"A":[2,3,4],"B":[6,7,8],"group":["a","a","b"]})
df
                
            
               A  B  group
0  2  6    a
1  3  7    a
2  4  8    b

To compute the percentage of each value in each distinct group:


        
        
            
                
                
                    df.groupby("group").apply(lambda my_df: my_df / my_df.sum())
                
            
               A    B
0  0.4  0.461538
1  0.6  0.538462
2  1.0  1.000000

Note the following:

the function defined in apply(~) is called twice in this case - once for each group.
the argument (my_df) passed to this function is a DataFrame representing a single group.
the my_df.sum() returns a Series containing the sum of each column of my_df. In this case, for group a, my_df.sum() would evaluate to a Series holding values [5,13].
dividing my_df by this Series involves dividing values in column A by 5, and dividing values in column B by 13.
the return type of argument function is a DataFrame.

* * *

To compute the percentage of a specific column instead of all numeric columns:


        
        
            
                
                
                    df.groupby("group").apply(lambda my_df: my_df["A"] / my_df["A"].sum())
                
            
            group   
a      0    0.4
       1    0.6
b      2    1.0
Name: A, dtype: float64

Related

Pandas DataFrame | groupby method

Divides up your DataFrame into groups based on the specified criteria.

chevron_right

robocat

Published by Isshin Inada

Edited by 0 others

Did you find this page useful?

thumb_up

thumb_down

Comment

Citation

Ask a question or leave a feedback...

thumb_up

1

thumb_down

0

chat_bubble_outline

0

settings

Enjoy our search

Hit / to insta-search docs and recipes!

Navigation

Contact us

Resources

Python Pandas MySQL Beautiful Soup Matplotlib NumPy PySpark

Community

Join our Discord

Join our newsletter for updates on new comprehensive DS/ML guides

|