SeaweedFS is a distributed storage system for object storage (S3), file systems, and Iceberg tables, designed to handle billions of files with O(1) disk access and effortless horizontal scaling.
-
Updated
Apr 15, 2026 - Go
SeaweedFS is a distributed storage system for object storage (S3), file systems, and Iceberg tables, designed to handle billions of files with O(1) disk access and effortless horizontal scaling.
More than 2000+ Data engineer interview questions.
MorphL Community Edition uses big data and machine learning to predict user behaviors in digital products and services with the end goal of increasing KPIs (click-through rates, conversion rates, etc.) through personalization
A tool for scale and performance testing of HDFS with a specific focus on the NameNode.
Data Engineering Project with Hadoop HDFS and Kafka
Big Data essentials: Hadoop, MapReduce, Spark. Explore tutorials and demos in Jupyter notebooks—most are self-contained and live, ready to run with a click.
Learn how to use Spark SQL and HSpark connector package to create / query data tables that reside in HBase region servers
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Big data projects implemented by Maniram yadav
旅游网站(携程网部分数据)大数据分析-hadoop课程设计(本科课设级别)
By Smart Shaped s.r.l. (https://www.smartshaped.com/)
HokStack - Run Hadoop Stack on Kubernetes
Ansible Playbook For Setup Hadoop HDFS
A fully-functional Hadoop Yarn cluster as docker-compose deployment.
Open source data infrastructure platform. Designed for developers, built for speed.
Helm chart for Apache Hadoop using multi-arch docker images
λFS: an elastic, high-performance, serverless-function-based metadata service for large-scale distributed file systems (ACM ASPLOS'23)
Twitter + Flume + Hadoop (HDFS, MapReduce) + Neo4j + Pyhton
Toy Hadoop cluster combining various SQL-on-Hadoop variants
Add a description, image, and links to the hadoop-hdfs topic page so that developers can more easily learn about it.
To associate your repository with the hadoop-hdfs topic, visit your repo's landing page and select "manage topics."