Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
-
Updated
May 19, 2021 - Scala
Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.
Various data stream/batch process demo with Apache Scala Spark 🚀
I have implemented the sample programs using apache spark. The programs have developed on the concepts of Spark RDD and Spark SQL Dataframe.
Demonstration of basic data transformations using Spark RDD and Spark DataFrame in Scala
This program will process legal report via Stanford CoreNLP and index them in ElasticSearch
Add a description, image, and links to the spark-rdd topic page so that developers can more easily learn about it.
To associate your repository with the spark-rdd topic, visit your repo's landing page and select "manage topics."