Distributed Deep Learning with Apache Spark and Keras
This is my personal research on distributed LSTM computing with Apache Spark with Keras / Tensorflow or scala with Deep4j. The data is time series of Dow Jones stock market price.
Target: 30 stock symbols from Dow Jones. Time frame: Sep. 29, 2016 to Sep. 29 2017
- Long Short Tmer Memory (LSTM)
- Gated Recurrent Unit (GRU)
- Data split into 2 partions
- Partition by Symbols. (Every 15 symbols will be trained on each network)
- Partition by Time. (Every 6 months of data will be trained on each network)
Reference:
- https://db-blog.web.cern.ch/blog/joeri-hermans/2017-01-distributed-deep-learning-apache-spark-and-keras
- https://github.com/cerndb/dist-keras
- https://github.com/cerndb/dist-keras/tree/master/examples
- https://www.slideshare.net/emanueldinardo/distributed-implementation-of-a-lstm-on-spark-and-tensorflow-69787635
Stock price reference:
Scala related reference: