Skip to content

Lucas382/SeleniumGridScrapper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Selenium grid scraper

Instructions to run:

  1. Install the dependencies with the pip install -r requirements.txt command.
  2. Create the selenium grid container with the docker compose up command.
  3. Run the book-scrapper.py file.

How it works?

This script's purpose is to scrape book data from https://books.toscrape.com using Selenium Grid to create concurrent sessions for scraping data from multiple pages and store it in a CSV file.

Note: The website being scraped is open for scraping, and Selenium is not necessary. However, it's worth noting that this script was created as a training exercise, hence the use of Selenium despite it not being necessary.

Releases

No releases published

Packages

No packages published

Languages