The project started as worked examples from the book "Hadoop In Action" (hence the name). Since then it has morphed to a container for multiple projects related to Hadoop (and Hadoop ecosystem).
A selection of solutions to problems from the "Hadoop in Action" book.
An automated newspaper clipping service using Cascading.
Cascading implementation of the Degrees of Separation from Kevin Bacon problem.
Scalding version of Parts 1-5 of Paco Nathan's Cascading for the Impatient.
Data analysis with Scalding on Medicare Claims data from cms.gov to find code outliers, etc.
Data analysis with Apache Spark to compute a disease graph for diseases identified as comorbidities among members. Uses Medicare Claims data from cms.gov.
Pig script examples from HIA.
Associated Pig UDFs from HIA.
Hive script examples from HIA.