RABBDA project

RABBDA (Reduce Access Barriers to Big Data Analytics) is created by the Centre of Parallel Computing - University of Westminster.

The project objective is to provide students and practitioners access to Big Data technologies and learning material.

For that purpose Centre of Parallel Computing currently develops a Big Data environment based on Hadoop services that can be accessed by students and researchers.

Additionally, the Centre of Parallel Computing provides the required learning material to assist the learning and development process. That material includes Hadoop services tutorials and demo applications.

The utilised services and learning material are open-source so that everyone can learn and understand how to build their own Big Data applications.

RABBDA is the first attempt to merge a Science Gateway with a KREL (knowledge repository and learning), called SMARTEST to facilitate the comprehension of the various aspects of a portal, in this case a Big Data portal.

For more information, please review RABBDA here.

Portals

University portal

The University portal is a proof of concept application that aims to demonstrate how Big Data can be used to create complex Big Data solutions. Additionally, by implementing various releases, we present how a project evolves through multiple iterations.

The application exports relational data from RDMBS and analyse them with Big Data technologies. To more detail the application aims to analyse students data to answer complex scientific question related to students performance.

Earthquakes portal

The Earthquakes portal is proof of concept application that aims to demonstrate how Big Data can be used to create complex Big Data solutions. Additionally, by implementing various releases, we present how a project evolves through multiple iterations.

This demonstration utilises earthquakes data, source: USGS science for a changing world.

To more detail, static data for cities and seismograph stations are being associated with earthquakes data acquired from the Rest API. The result of this process produces information such as earthquakes closest cities and seismographic stations, and links to seismographs.

Sample solutions

Real-time steaming twitter data to HDFS with Apache Kafka, review the solution here.
An Extract-Transform-Load (ETL) pipeline with Apache Hive, review the solution here.
Real-time data ingestion to HDFS with Apache Flume, review the solution here.

Tutorials

Spark and Map-Reduce Jobs with Python, review the examples here.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RABBDA project

Portals

University portal

Earthquakes portal

Sample solutions

Tutorials

About

Releases

Packages

UoW-CPC/rabbda

Folders and files

Latest commit

History

Repository files navigation

RABBDA project

Portals

University portal

Earthquakes portal

Sample solutions

Tutorials

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages