The written parser collects information (title, genre, director, country, year of release) about films from Wikipedia. The following was selected as the start page — ru.wikipedia.org/wiki/Category:Film_alphabitu. The data is saved to a csv file.
- Python 3.10
- Scrapy
Clone the repository and navigate to the project directory:
git clone https://github.com/vlada-pv/Scrapy-Parser-Wiki-Movies.git
cd films_parser
To run the parser, type the following command:
scrapy crawl films_spider