Customer-360-Pipeline

🦖PROBLEM :-->

==>Customer 360 team have problem to handle all customer information and Update it and fault tolerance •Closed Orders against the cluster (The "ORDERS" Data is provided by Third party between 5PM to 6PM) Orders_file.csv -> S3 buckets -> 5 PM -6 PM •Cstomer related information ("CUSTOMERS-INFORMATION" IS PRESENT ON RELATIONAL DATABASE LIKE :- MySQL ) •CRM team all the customer information in a MySQL/oracle_db S3("Orders") MySql("Customer_Info") • Order filtering on the "CLOSED" orders • Load Both the dataset in Hive • Notification & HBase data loading

🐝SOLUTION :-->

S3 Files(https connection) [In AIRFLOW] HTTP Sensor Connection - Name - Host/Port/Username/Password/schema SSH into edgeNode

Download the files from S3 into Local(edgeNode)
Sqoop will fetch the Customers_Info from MySql and Dump to Hive
Upload S3 orders file to HDFS location
Spark program [SUBMIT SPARK~JOB]
Create Hive table from the Output Path available in step 4
Upload it into Hbase (HBase Hive Connectors)
Slack #channel for communication Success/Failure of the PipeLine

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
AIRFLOW_PIPELINE.png		AIRFLOW_PIPELINE.png
AIRFLOW_PIPELINE_COMMAND'S.txt		AIRFLOW_PIPELINE_COMMAND'S.txt
ARCHITECTURE.png		ARCHITECTURE.png
Airflow_bundel_Sub24.jar		Airflow_bundel_Sub24.jar
Create_Hive_External_table_On_Top_Of_Spark_job_Output.txt		Create_Hive_External_table_On_Top_Of_Spark_job_Output.txt
Create_Hive_HBase_Bucketed_Table.txt		Create_Hive_HBase_Bucketed_Table.txt
Data_Flow_DiAgram.png		Data_Flow_DiAgram.png
G19 - Customer_360_Poster.gif		G19 - Customer_360_Poster.gif
Make_Connection_btw_SLACK_And_to_our_AIRFLOW_PipeLine.txt		Make_Connection_btw_SLACK_And_to_our_AIRFLOW_PipeLine.txt
OutPut.png		OutPut.png
README.md		README.md
SLACK_NOTIFICATION.png		SLACK_NOTIFICATION.png
Spark_jar_For_Filter__Orders_Data.scala		Spark_jar_For_Filter__Orders_Data.scala
Sqoop_Hive_Create_Cust_Info.txt		Sqoop_Hive_Create_Cust_Info.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Customer-360-Pipeline

🦖PROBLEM :-->

🐝SOLUTION :-->

About

Releases

Packages

Languages

Rajdeep-Borana/Customer-360-Pipeline

Folders and files

Latest commit

History

Repository files navigation

Customer-360-Pipeline

🦖PROBLEM :-->

🐝SOLUTION :-->

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages