Value	Description
`True`	Only numeric rows/columns will be considered (e.g. `float`, `int`, `boolean`).
`False`	Attempt computation with all types (e.g. strings and dates), and throw an error whenever the variance cannot be computed.
`None`	Attempt computation with all types, and ignore all rows/columns whose variance cannot be computed without raising an error.

Note that the variance can only be computed when the + operator is well-defined between the types.

By default, numeric_only=None.

Return Value

If the level parameter is specified, then a DataFrame will be returned. Otherwise, a Series will be returned.

Examples

Consider the following DataFrame:


        
        
            
                
                
                    df = pd.DataFrame({"A":[3,5,7], "B":[2,5,8]})
df
                
            
               A  B
0  3  2
1  5  5
2  7  8

Column-wise variance

To compute the variance for each column:


        
        
            
                
                
                    df.var()   # axis=0
                
            
            A    4.0
B    9.0
dtype: float64

Row-wise variance

To compute the variance for each row:


        
        
            
                
                
                    df.var(axis=1)
                
            
            0    0.5
1    0.0
2    0.5
dtype: float64

Specifying numeric_only

Consider the following DataFrame:


        
        
            
                
                
                    df = pd.DataFrame({"A":[3,5], "B":[True,5], "C":["x",7]})
df
                
            
               A  B     C
0  3  True  x
1  5  5     7

Here, columns B and C are of mixed-type.

None

By default, numeric_only=None, which means that rows/columns with mixed types will also be considered:


        
        
            
                
                
                    df.var()   # numeric_only=None
                
            
            A    2.0
B    8.0
dtype: float64

The reason why the variance is still computable for column B is that, True is internally represented as a 1 in Pandas. In contrast, the variance for column C cannot be computed since "x"+7 is undefined.

False

numeric_only=False means that the rows/columns of mixed type will also be considered, but an error will be raised if the variance is not computable:


        
        
            
                
                
                    df.var(numeric_only=False)
                
            
            TypeError: could not convert string to float: 'x'

True

To compute the variance of numeric rows/columns only:


        
        
            
                
                
                    df.var(numeric_only=True)
                
            
            A    2.0
dtype: float64

Published by Isshin Inada

Edited by 0 others

Did you find this page useful?

thumb_up

thumb_down

Comment

Citation

Ask a question or leave a feedback...

Official Pandas Documentation

https://pandas.pydata.org/pandas-docs/dev/reference/api/pandas.DataFrame.var.html

thumb_up

thumb_down

chat_bubble_outline

settings

Enjoy our search

Hit / to insta-search docs and recipes!