Simply, The script scrapes all the jobs from all the pages (first to final available page) located on https://careers.google.com/jobs and return's the result as a JSON string, Then you will have a JSON file containing all scraped data.
- Download and Install Python 3
- Install requirements
pip install requests beautifulsoup4 selenium
- Download the latest release of Chrome Driver for your OS
- Extract chromedriver and move it to the same directory of
scrape_google.py
file - Finally, Run
scrape_google.py
python scrape_google.py
Note: For Windows users, Please check this video
{
"total": "total_count",
"jobs": [
{
"job_id": "id1",
"title": "title1",
"location": "location1",
"intro": "introduction1",
"resps": "responsibilities1",
"quals": "qualifications1"
},
{
"job_id": "id2",
"title": "title2",
"location": "location2",
"intro": "introduction2",
"resps": "responsibilities2",
"quals": "qualifications2"
},
...
]
}