Web Scraping in Python

FOR ADVANCED USERS

Web Scraping in Python

Training description

Data is an essential fuel for Data Science products. The more we have it, the greater the chance, that valuable patterns and signals are discovered. The internet is a great source to be used to obtain data.

“Web scraping w Python” is a training for everybody who wants to learn how to extract data directly from the internet.

Duration: 2 days 8 hours each (including an hour for breaks)

Requirements: knowledge of Python programming language at an intermediate level which can be acquired during our training “Introduction to Python”.

Training agenda

Part one: Let’s get it started

  • Ethical aspect
  • Introduction to HTML (tags, ids, classes)
  • Using Chrome DevTools
  • Identifying objects HTML/XML with BeautifulSoup
  • Data extraction from scripts HTML/XML with BeautifulSoup

Part two: The more you get into it…

  • Browsing dynamic web pages with Selenium
  • Extracting nested data

Part three: Squeezing the lemon – Scrapy framework

  • Architecture
  • How to create a crawler
  • How to extract data

Contact us about closed training

This website uses cookies to ensure you get the best experience on our website.
Ok, got it. More about cookies