What is "beautifulsoup"

Beautiful Soup is a library for obtaining document data HTML and XML written in Python. Currently in its version 4, it presents a range of functionalities that facilitate the process of Web Scraping, technique consisting of retrieving data on web pages.

Functioning

According to the documentation itself (complete in English):

Beautiful Soup transforms a complex HTML document into a complex Python object tree. But you only need to worry about four types of objects: Tag, NavigableString, BeautifulSoup, andComment.

That is, using the library, programmers are free from all the work of reinventing the wheel or external concerns whether their Parser code will work or not, since it synthesizes all the complex information. Also, it’s faster than several other ways you can imagine.

When to use tag?

  • When the question is related to problems, doubts or possible errors strictly linked to Beautifulsoup.
  • When the code, if you have it, is written in Python and focused on Webscraping.

When not using tag?

  • Do not use the tag when the question relates to web requests, but not to reading the data from it.
  • Do not use this tag when it is related only to doubts about web data scraping.
  • Do not use when the question is not within the scope of the Python language.