Hadoop-streaming with Python for DC Python
Using Python with Hadoop Streaming
##Pre-requisites
- Java
- Hadoop
##Setup:
- Run ./get_data.sh to download and unpack the DC Payment Card transaction data
- Modify HADOOP_HOME in ./mr_dc_payment_cards.sh
- Run ./mr_dc_payment_cards.sh