With the BeautifulSoup Python library, get informations from books.toscrape.com like book titles, prices, ratings, descriptions and many more.
When launched, all the datas will be saved locally in an "Outputs" folder, and each category of books will be loaded in separate CSV files. Date of scraping is included in the folder name for a better manage of backups. Finally, you can choose if you want to download the corresponding images files of books.
- Python 3.6 or later
Once the code has been downloaded, go to the project directory and enter the following commands in terminal :
python -m venv env
install a new vitual environement
env/Scripts/activate
activate the environement
pip install -r requirements.txt
install all the depedencies
python main.py
run the code
deactivate
when over, deactivate the environement
Note
The commands above are for Windows use. Go to the official Python documentation for MacOS or Unix usage.
Feel free to mail me for any questions, comments, or suggestions.