menu

login

Log in

Linear Algebra

Prob and Stats

Other math topics

Machine Learning

Dagster (NEW)

search

Search

Login

Unlock 100+ guides

menu

menu

search toc

close

Outline

Comments

Log in or sign up

Cancel

Post

account_circle

exit_to_app

Sign out

What does this mean?

Why is this true?

Give me some examples!

search

keyboard_voice

close

Searching Tips

Search for a recipe:
"Creating a table in MySQL"

Search for an API documentation: "@append"

Search for code: "!dataframe"

Apply a tag filter: "#python"

Useful Shortcuts

/ to open search panel

Esc to close search panel

↑↓ to navigate between search results

⌘d to clear all current filters

⌘Enter to expand content preview

icon_star

Doc Search

icon_star

Code Search Beta

SORRY NOTHING FOUND!

mic

Start speaking...

Voice search is only supported in Safari and Chrome.

fullscreen_exit

Shrink

Navigate to

near_me

Linear Algebra

54 guides

keyboard_arrow_down

Linear Algebra

Prob and Stats

Machine Learning

Other math topics

check_circle

Mark as learned

thumb_up

1

thumb_down

0

chat_bubble_outline

0

Comment

auto_stories Bi-column layout

settings

Beautiful Soup Tag | contents property

schedule Aug 10, 2023

Last updated

local_offer

Python●Beautiful Soup

Tags

tocTable of Contents

expand_more

Master the mathematics behind data science with 100+ top-tier guides
Start your free 7-days trial now!

The Tag.contents property in Beautiful Soup returns a list that contains all the immediate child elements and text nodes (i.e. Navigable String).

Examples

Consider the following HTML document:


        
        
            
                
                
                    my_html = """
       <div id="names">
              <p>Alex</p>
              <p>Bob</p>
              <p>Cathy</p>
       </div>
"""
soup = BeautifulSoup(my_html)

To get all the direct child elements and text nodes as a list:


        
        
            
                
                
                    soup.find("div").contents
                
            
            ['\n', <p>Alex</p>, '\n', <p>Bob</p>, '\n', <p>Cathy</p>, '\n']

Here, the text nodes are "\n", which represent a new line.

Most of the time, you just want the elements without the text nodes. You can do this using the find_all(~) method:


        
        
            
                
                
                    soup.find("div").find_all()
                
            
            [<p>Alex</p>, <p>Bob</p>, <p>Cathy</p>]

Notice how text nodes are excluded.

robocat

Published by Isshin Inada

Edited by 0 others

Did you find this page useful?

thumb_up

thumb_down

Comment

Citation

Ask a question or leave a feedback...

Official Beautiful Soup Documentation

https://www.crummy.com/software/BeautifulSoup/bs4/doc/#contents-and-children

thumb_up

1

thumb_down

0

chat_bubble_outline

0

settings

Enjoy our search

Hit / to insta-search docs and recipes!

Navigation

Contact us

Resources

Python Pandas MySQL Beautiful Soup Matplotlib NumPy PySpark

Community

Join our Discord

Join our newsletter for updates on new comprehensive DS/ML guides

|