DNL_Backend_Challenge

This project involves scraping data from a website and storing it in a SQLite database. An API built with FastAPI is also provided to access and query this data.

DNL Project

This project involves scraping data from a website using both Beautiful Soup and Scrapy, storing the scraped data in a SQLite database, and providing an API built with FastAPI to access and query this data.

Located in the scraper/ directory, this scraper executes once upon initiation. It fetches the required data using Beautiful Soup and stores it in the SQLite database.
If you want to run locally and create the database so that can be mounted as volume in docker-compose, execute the below command at a terminal. python ./scraper/main.py

Scrapy Spider

Found within the scraper_spider/ directory, this scraper uses Scrapy to fetch necessary data and stores it in the SQLite database.
If you want to run locally and create the database: python ./scraper_spider/main.py

API

Hosted in the api/ directory, the API taps into the populated SQLite database to deliver data through its endpoints. The FastAPI Swagger UI allows direct interaction and testing of the API.

API Usage

Fetching Parts_Data (Table)

Endpoint: /parts
Refine results using query parameters, e.g., ?manufacturer=Ammann.

Swagger UI

Test the API endpoints by accessing the FastAPI Swagger UI at http://127.0.0.1:8000/docs.

Structure

scraper/: Houses the Beautiful Soup scraping logic.
scraper_spider/: Contains the Scrapy logic responsible for web scraping.
api/: Contains the FastAPI server and API logic.
database/: Manages database operations and holds the SQLite file.
docker/: Keeps the Dockerfile and relevant configurations for containerization.

Logger

Logging is integrated into the application, helping in tracking and debugging activities. You can modify the logging level and format in the Scrapy settings to filter the type of information captured and displayed. This can be especially helpful in identifying issues or optimizing scraper performance.

Contributing

To contribute to this project, please fork the repository and submit a pull request.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.github/workflows		.github/workflows
api		api
database		database
docker		docker
scraper		scraper
scraper_spider		scraper_spider
tests		tests
.gitignore		.gitignore
README.md		README.md
config.yml		config.yml
config_util.py		config_util.py
docker-compose.yml		docker-compose.yml
fatsapi_req.py		fatsapi_req.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DNL_Backend_Challenge

DNL Project

Table of Contents

Getting Started

Setup

Configuration

Running the Services

Accessing the API

Components

Database

Scrapers

Beautiful Soup Scraper

Scrapy Spider

API

API Usage

Fetching Parts_Data (Table)

Swagger UI

Structure

Logger

Contributing

About

Releases

Packages

Languages

ajits-github/DNL_Backend_Challenge

Folders and files

Latest commit

History

Repository files navigation

DNL_Backend_Challenge

DNL Project

Table of Contents

Getting Started

Setup

Configuration

Running the Services

Accessing the API

Components

Database

Scrapers

Beautiful Soup Scraper

Scrapy Spider

API

API Usage

Fetching Parts_Data (Table)

Swagger UI

Structure

Logger

Contributing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages