A WEALTH OF DATA
Web scraping in R
Training description
The training covers a broad range of topics on acquiring data from Internet pages using R.
Duration: 2 days 8 hours each (including an hour lunch break)
Requirements: knowledge of R programming language at an intermediate level which can be acquired during our training “Introduction to R.”
Training agenda
Part one: Let’s get it started
- Introduction to HTML (tags, ids, classes)
- Using Chrome DevTools
- Identifying objects HTML/XML with rvest package
- Data extraction from scripts HTML/XML with rvest
- Iterative browsing HTML/XML scripts with purrr package
Part two: The more you get into it…
- Browsing web pages with RSelenium package
- Extracting nested data
Part three: Squeezing the lemon
- Browsing dynamic web pages with splashR
- Extracting dynamic data
Contact us about closed training