Skip to content

abhi777/Cloud-based-Web-Scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

39 Commits
 
 
 
 
 
 

Repository files navigation

Scrapify v1.0.10

This is a Web Application developed using Python which can scrap Data from the web and save the extracted Data in Google Cloud Storage. It's also very convenient to use, even for a Non-Technical Person as it provides a really nice UI.

To check out the live version of my project, Click Here...

image

First, We need to # by clicking on the # section present in the side bar and Login using the newly created credentials.

image image

It needs various inputs like:

  • Name of the GCS Bucket
  • A text file containing a list of links of webpages line by line
  • A json file containing the credentials of the GCS(Google Cloud Storage)
  • Types of Data to be scraped

Executing the script after providing the inputs would start saving the required data into the GCS Bucket.

image

image

image