menu

login

Log in

Linear Algebra

Prob and Stats

Other math topics

Machine Learning

Dagster (NEW)

search

Search

Login

Unlock 100+ guides

menu

menu

search toc

close

Outline

Parameters Return value Examples Basic usage Handling the endpoints

Comments

Log in or sign up

Cancel

Post

account_circle

exit_to_app

Sign out

What does this mean?

Why is this true?

Give me some examples!

search

keyboard_voice

close

Searching Tips

Search for a recipe:
"Creating a table in MySQL"

Search for an API documentation: "@append"

Search for code: "!dataframe"

Apply a tag filter: "#python"

Useful Shortcuts

/ to open search panel

Esc to close search panel

↑↓ to navigate between search results

⌘d to clear all current filters

⌘Enter to expand content preview

icon_star

Doc Search

icon_star

Code Search Beta

SORRY NOTHING FOUND!

mic

Start speaking...

Voice search is only supported in Safari and Chrome.

fullscreen_exit

Shrink

Navigate to

NumPy

319 guides

keyboard_arrow_down

Linear Algebra

Prob and Stats

Machine Learning

Other math topics

chevron_leftDocumentation

Method argpartition

NumPy Random Generator4 topics

Method choice Method dot Method finfo Method histogram Method iinfo Method max Method mean Method place Method roots Method seed Method uniform Method view Method zeros Method sum Object busdaycalendar Method is_busday Property dtype Method unique Method loadtxt Method vsplit Method fliplr Method setdiff1d Method msort Method argsort Method lexsort Method around Method nanmax Method nanmin Method nanargmax Method nanargmin Method argmax Method argmin Property itemsize Method spacing Method fix Method ceil Method diff Property flat Property real Property base Method flip Method delete Method amax Method amin Method logical_xor Method logical_or Method logical_not Method logical_and Method logaddexp Method logaddexp2 Method logspace Method not_equal Method equal Method greater_equal Method less Method less_equal Method remainder Method mod Method empty Method greater Method isfinite Method busday_count Method repeat Method var Method random_sample Method random Method sign Method std Method absolute Method abs Method sort Method randint Method isreal Method linspace Method gradient Method all Method sample Property T Property imag Method cov Method insert Method log Method log1p Method exp2 Method expm1 Method exp Method arccos Method cos Method arcsin Method sin Method tan Method fromiter Method trim_zeros Method diagflat Method savetxt Method count_nonzero Property size Property shape Method reshape Method resize Method triu Method tril Method eye Method arange Method fill_diagonal Method tile Method save Method transpose Method swapaxes Method meshgrid Property mgrid Method rot90 Method log2 Method radians Method deg2rad Method rad2deg Method degrees Method log10 Method append Method cumprod Property nbytes Method tostring Property data Method modf Method fmod Method tolist Method datetime_as_string Method datetime_data Method array_split Method itemset Method floor Method put_along_axis Method cumsum Method bincount Method put Method putmask Method take Method hypot Method sqrt Method square Method floor_divide Method tri Method signbit Method flatten Method ravel Method roll Method isrealobj Method diag Method diagonal Method quantile Method ones Method iscomplexobj Method iscomplex Method isscalar Method divmod Method isnat Method percentile Method isnan Method divide Method add Method reciprocal Method positive Method subtract Method median Method isneginf Method isposinf Method float_power Method power Method negative Method maximum Method average Method isinf Method multiply Method busday_offset Method identity Method interp Method squeeze Method get_printoptions Method savez_compressed Method savez Method load Method asfarray Method clip Method array Method array_equiv Method array_equal Method frombuffer Method set_string_function Method matmul Method genfromtxt Method fromfunction Method asscalar Method searchsorted Method full_like Method full Method shares_memory Method ptp Method digitize Method argwhere Method geomspace Method zeros_like Method fabs Method flatnonzero Method vstack Method dstack Method fromstring Method tobytes Method expand_dims Method ranf Method arctan Method item Method extract Method compress Method choose Method asarray Method asmatrix Method allclose Method isclose Method any Method corrcoef Method trunc Method prod Method cross Method true_divide Method hsplit Method split Method rint Method ediff1d Method lcm Method gcd Method cbrt Method flipud Property ndim Method array2string Method set_printoptions Method where Method hstack

Char32 topics

check_circle

Mark as learned

thumb_up

0

thumb_down

0

chat_bubble_outline

0

Comment

auto_stories Bi-column layout

settings

NumPy | digitize method

schedule Aug 10, 2023

Last updated

local_offer

Python●NumPy

Tags

tocTable of Contents

expand_more

Parameters Return value Examples Basic usage Handling the endpoints

Master the mathematics behind data science with 100+ top-tier guides
Start your free 7-days trial now!

Numpy's digitize(~) method returns a Numpy array of indices of the bins to which the values in the input array belongs to. To explain in plain words is difficult, so please look at the examples for clarification.

Parameters

1. a | array-like

The array of values.

2. bins | array-like

The array of bins, which must be one-dimensional and sorted in ascending order.

3. right | boolean | optional

If True, then value will be placed in the next bin at the endpoints. If False, then value will be placed in the previous bin. By default, right=False.

Return value

A Numpy array of integer indices.

Examples

Basic usage

Consider the following code snippet:


        
        
            
                
                
                    # Our array of values
a = [3, 6.5, 9]

# Our bins
bins = [5, 6, 7, 8]

np.digitize(a, bins)
                
            
            array([0, 2, 4])

Let's understand the output here.

The first value 3 is between 3 <= 5 (the first bin), so the returned integer index is 0.
The second value 6.5 is between 6 and 7 (2nd and 3rd bin), so the returned integer index is 2.
The third value 9 is larger than 8 (the 4th bin), so the returned integer index is 4.

A nice way of wrapping your head around this is to think of the index of the value if it were to be inserted into the bins array. For instance, the value 3 will be inserted into index 0, so 0 is returned. 6.5 will be inserted into index 2, so 2 is returned, and so on.

Handling the endpoints

By default, when checking for which bin to place a value in, Numpy will use the < comparison. For instance,


        
        
            
                
                
                    a = [5]
bins = [5, 6]
np.digitize(a, bins)   # or right=False
                
            
            array([1])

The reason we get an integer index of 1 is that the first comparison we perform is 5 < 5, which evaluates to False.

Instead of a < comparison, we can perform a <= comparison, like follows:


        
        
            
                
                
                    a = [5]
bins = [5, 6]
np.digitize(a, bins, right=True)
                
            
            array([0])

Here, we get a integer index of 0 because the first comparison 5 <=5 evaluates to True.

robocat

Published by Isshin Inada

Edited by 0 others

Did you find this page useful?

thumb_up

thumb_down

Comment

Citation

Ask a question or leave a feedback...

thumb_up

0

thumb_down

0

chat_bubble_outline

0

settings

Enjoy our search

Hit / to insta-search docs and recipes!

Navigation

Contact us

Resources

Python Pandas MySQL Beautiful Soup Matplotlib NumPy PySpark

Community

Join our Discord

Join our newsletter for updates on new comprehensive DS/ML guides

|