CSVtoParquet using Databricks (Scala) and running Notebook as Azure Data Factory Job

Converting CSV files to Parquet

Azure storage containers for CSV files and Parquet files are mounted
Read the CSV files from source using defines schema
convert the CSV files to parquet format
store the Parquet files to destination folder
Parameterization used so as to select which folder and sub folder to choose from the container for the CSV files to be converted
Notebook can be ran from Azure Data factory.
Create an Azure Data factory and within it create a linked service for Databricks
Add Parameters to the pipeline
Add Databricks notebooks to the pipeline, and give the details ( Linked Service and Parameters to be used in the notebook)
JSON file in this repo has the ADF template

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Historical Extraction - CSV To Parquet Conversion.scala		Historical Extraction - CSV To Parquet Conversion.scala
README.md		README.md
pipeline_csv2parquet_history.json		pipeline_csv2parquet_history.json

Provide feedback