chevron_left
Documentation
Method findMethod find_parentMethod find_parentsMethod insertMethod insert_afterMethod insert_beforeMethod new_tagMethod prettifyMethod extendMethod appendMethod has_attrMethod find_allMethod select_oneMethod selectMethod unwrapMethod wrapMethod replace_withProperty previous_elementsProperty next_elementsProperty parentProperty previous_elementProperty next_siblingsProperty previous_siblingsProperty stringsProperty stripped_stringsProperty contentsProperty childrenProperty next_elementMethod extractMethod find_all_nextProperty previous_siblingMethod find_previous_siblingMethod find_next_siblingProperty next_siblingProperty stringMethod find_nextMethod clearProperty descendantsProperty parentsMethod get_textMethod decompose
0
0
0
new
Beautiful Soup | extract method
Programming
chevron_rightPython
chevron_rightBeautiful Soup
chevron_rightDocumentation
schedule Jul 1, 2022
Last updated Python●Beautiful Soup
Tags tocTable of Contents
expand_more In Beautiful Soup, the extract
method removes either:
an element and its string
a string
The method returns whatever it removed.
Examples
Consider the following HTML document:
my_html = """ <div> <p>Bob is great</p> </div>"""soup = BeautifulSoup(my_html)
Removing a tag and its content
To remove the p
tag as well as its inner content:
Here, we see <p>Bob is great</p>
as the output because the extract()
method returns whatever you've removed.
After calling extract()
on the p
tag, it is no longer part of our parse tree:
print(div_tag)
<div></div>
However, the catch is that, although the p
tag has been detached from our parse tree, it still exists in memory, and we can still access it:
print(p_tag)
<p>Bob is great</p>
If you want to completely remove the p_tag
, then call the decompose()
method instead of extract()
.
Removing a string
To remove the inner text of p
:
Published by Isshin Inada
Edited by 0 others
Did you find this page useful?
Ask a question or leave a feedback...
Official Beautiful Soup Documentation
https://www.crummy.com/software/BeautifulSoup/bs4/doc/#extract
0
0
0
Enjoy our search