df = pd.DataFrame({"A":[1,4,6], "B":[3,8,8]})
df
                
            
               A  B
0  1  3
1  4  8
2  6  8

To get the top 2 rows with the highest value for column A:


        
        
            
                
                
                    df.nlargest(2, "A")
                
            
               A  B
2  6  8
1  4  8

Notice how the returned rows are sorted in descending order of the values in A.

Dealing with duplicate values

Consider the same df as above:

Keeping only the first

By default, keep="first", which means that the first occurrence of the row with the largest column value is returned:


        
        
            
                
                
                    df.nlargest(1, "B")   # keep="first"
                
            
               A  B
1  4  8

Notice how row 1 was returned, as opposed to row 2, despite the fact that they both had the same value (8) for column B.

Keeping only the last

To get the last occurrence instead, set keep="last":


        
        
            
                
                
                    df.nlargest(1, "B", keep="last")
                
            
               A  B
2  6  8

Keeping all occurrences

To keep both the occurrences, set keep="all":


        
        
            
                
                
                    df.nlargest(1, "B", keep="all")
                
            
               A  B
1  6  8
2  4  8

Notice how despite the fact that we set n=1, we end up with 2 rows.

Published by Isshin Inada

Edited by 0 others

Did you find this page useful?

thumb_up

thumb_down

Comment

Citation

Ask a question or leave a feedback...

Official Pandas Documentation

https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.nlargest.html

thumb_up

thumb_down

chat_bubble_outline

settings

Enjoy our search

Hit / to insta-search docs and recipes!