Introduction
  • Intro to Web Scraping & Scrapy
  • Setting up Scrapy the Development Environment (Updated)
  • Add VSCODE to path (Mac users)
  • Udemy 101 (Please don't skip*)
  • Asking questions
Scrapy Fundamentals
  • Scrapy fundamentals PART 1
  • Scrapy fundamentals PART 2
  • Scrapy fundamentals PART 3
  • Scrapy fundamentals PART 4
  • Scrapy fundamentals PART 5
XPath expressions & CSS Selectors
  • Downloadable files
  • XPath & CSS Selectors
  • CSS Selectors fundamentals
  • CSS selectors in theory
  • XPath fundamentals
  • Navigating using XPath(Going UP)
  • Navigating using XPath(Going DOWN)
  • XPath in theory
Project 1 Spiders from A to Z
  • Worldometers PART 1
  • Worldometers PART 2
  • Worldometers PART 3
  • Worldometers PART 4
  • Project source code
  • Exercise
Building Datasets
  • Bulding datesets
Project 2 Dealing with Multiple pages
  • Website URL (Please do not skip)
  • Setting up the project
  • Setting up the project - Code update -
  • Building the spider
  • Dealing with pagination
  • Spoofing request headers
  • TinyDeal project source code
  • Exercise 2
Debugging spiders
  • What is debugging?
  • Debugging spiders PART 1
  • Debugging spiders PART 2
Let's take a break !
  • The "whys" & "whens" of web scraping
  • Web scraping challenges
Project 3 Build Crawlers using Scrapy
  • Crawl spider structure
  • The Rule object
  • Following links in pagination
  • Spoofing request headers
  • Project source code
  • Exercise
Splash crash course
  • What dilemma splash came to solve
  • Setting up Splash (Windows Pro/Entreprise edition & Mac Os)
  • Setting up Splash(Windows Home Edition)
  • Setting up Splash (Linux)
  • Introduction to Splash
  • Working with elements
  • Spoofing request headers
Project 4 Scraping JavaScript websites using Splash
  • Website URL update
  • Splash incognito mode
  • Using Splash with Scrapy
  • Parsing (BAD HTML MARKUP)
  • Project source code
  • Exercise
Project 5 Scraping JavaScript websites using Selenium
  • Selenium basics
  • ElementNotInteractable Exception
  • Selenium with Scrapy
  • Selenium Middleware PART 1 (NEW)
  • Selenium Middleware PART 2 (NEW)
  • Project source code
Working with Pipelines
  • Pipelines
  • Storing data in MongoDB
  • Storing data in SQLite3
  • Project source code
Scraping APIs (NEW)
  • Scraping APIs PART 1
  • Scraping APIs PART 2
  • Scraping APIs PART 3
  • Scraping APIs PART 4
  • Scraping APIs PART 5
  • Project source code
Log in to websites (NEW)
  • Log in to websites PART 1
  • Log in to websites PART 2
  • Log in to websites PART 3 (JavaScript required)
  • Project source code
Project 6 Bypass Cloudflare
  • Website URL update
  • Bypass Cloudflare PART 1
  • Bypass Cloudflare PART 2
  • Project source code
APPENDIX (OLDER SCRAPY 1.5 CONTENT)
  • *IMPORTANT*
  • Avoid getting banned PART 1