- Project Overview
- Technologies Used
- Data Pipeline
- ETL Process
- Repository Structure
- How to Run
- Dashboard
- Acknowledgments
- Conclusion
- Contacts
This ETL (Extract, Transform, Load) project aims to extract human resources data, clean it using PL/SQL and SQL, integrate it into a Snowflake data warehouse on Azure Cloud using Informatica, and visualize the insights in Power BI.
- Oracle Database
- PL/SQL
- SQL
- Informatica
- Snowflake Data Warehouse
- Azure Cloud
- Power BI
Here is the HR data pipeline :
- Data Extraction: Human resource data was extracted from the csv file (HR.csv).
- Data Loading: The data was loaded into the Oracle Database.
- Data Cleaning: Extensive data cleaning and transformation were performed using PL/SQL and SQL scripts.
- Data Integration: Cleaned data was integrated into Snowflake Data Warehouse using Informatica.
- Visualization: Snowflake Data Warehouse was connected to Power BI for data visualization.
Human-Resources-ETL-Project
│ README.md
│
└── Main
│ data_cleaning.sql
│ HR_Dashboard.pbix
│
├── database for PowerBI
│ age_group.csv
│ age_group_gender.csv
│ avg_length_employment.csv
│ avg_tenure.csv
│ data_generation.sql
│ employee_change.csv
│ gender.csv
│ gender_department.csv
│ HR.csv
│ jobtitle.csv
│ location.csv
│ race.csv
│ state.csv
│ turnover_rate.csv
│
├── dataset
│ HR.csv
│
└── images
dashboard.png
hr_data_pipeline.png
To run this project, follow these steps:
- Set up an Oracle Database and Snowflake Data Warehouse.
- Execute
data_cleaning.sql
anddata_generation.sql
in the Oracle Database to perform data cleaning and generate additional data. - Configure Informatica to connect to your Oracle Database and Snowflake Data Warehouse.
- Use Informatica to create ETL workflows to load and transform the data.
- Import
HR_Dashboard.pbix
into Power BI to visualize the data.
Here is the HR Dashboard created in Power BI:
I would like to express our gratitude to the following resources for their valuable contributions to this project:
The official documentation of oracle database
.
The documentation provided by informatica
for data integration
The snowflake
documentation, which aided in loaded the data into data warehouse.
This project demonstrates the end-to-end ETL process for human resources data, from extraction and cleaning to integration and visualization. Feel free to explore the provided resources and code to gain insights from your HR data. :)
For any questions or inquiries related to this project, please feel free to contact me :)
- linkedin : Aymane Maghouti
- Email:
aymanemaghouti@gmail.com