hadoop-filesystem
Here are 16 public repositories matching this topic...
Data Engineering Project with Hadoop HDFS and Kafka
-
Updated
Nov 4, 2023 - Python
This repository focuses on gathering and making a curated list resources to learn Hadoop for FREE.
-
Updated
Jun 10, 2018 - Python
Python wrapper to access Hadoop HDFS REST API
-
Updated
Oct 26, 2016 - Python
Data pipeline to process and analyse Twitter data in a distributed fashion using Apache Spark and Airflow in AWS environment
-
Updated
May 6, 2021 - Python
Ingestion pipeline to analyze soccer tweets
-
Updated
May 1, 2017 - Python
Category: Cloud Computing and Machine Learning Application - Subject: A cloud platform to make data processing with machine learning algorithms, built on Openstack, using Spark for data distribution and Hadoop Filesystem for data storage
-
Updated
Aug 9, 2018 - Python
Setup hadoop cluster manually and automatically
-
Updated
Jul 17, 2017 - Python
This is a TF-IDF calculator for shakespearean play dataset
-
Updated
Nov 15, 2017 - Python
Collection of assignments offered under COL733 - Cloud Computing by Prof. Suresh Chand Gupta
-
Updated
Jan 12, 2020 - Python
Big Data project. Web client for HDFS. Working in the terminal. Has ability to manipulate local and Hadoop storage
-
Updated
Nov 29, 2021 - Python
Worked on Hadoop file streaming
-
Updated
Jun 19, 2023 - Python
Distributed and Parallel Database Tasks
-
Updated
Feb 27, 2019 - Python
Bulk I/O Dispatch, i.e. BID Schemes. We have designed and developed two contention avoidance storage solutions, collectively known as BID: Bulk I/O Dispatch, for big data environment. BID-HDD is a disk scheduling scheme. BID-Hybrid is another contention avoidance scheme using hybrid tiers of storage for improving HDD performance using SSDs. In t…
-
Updated
Jul 6, 2017 - Python
When dealing with huge datasets, it is quite impossible that the code successfully executes on your personal desktop. You either need a locally installed clustered environment i.e. Hadoop Map-Reduce or a Cloud such as AWS. Here's an example of running such Job on AWS cloud.
-
Updated
Jul 20, 2019 - Python
Hadoop-Cluster
-
Updated
Sep 23, 2017 - Python
Improve this page
Add a description, image, and links to the hadoop-filesystem topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the hadoop-filesystem topic, visit your repo's landing page and select "manage topics."