chord-dfs

chord-dfs is a decentralized distributed file system (DFS) built on the Chord Distributed Hash Table (DHT) protocol. Files are hashed and distributed across multiple nodes connected in a Chord ring. The goal is to provide a simple, fault-tolerant, and scalable DFS system. This project is implemented using Python, Flask, and Docker.

Features

Organize nodes into a Chord ring using consistent hashing
Upload and store files across distributed nodes
Run each node in an isolated Docker container

How To Run

Requirements

Docker installed and running

Clone and Start

git clone [email protected]:goncalooliveirasilva/chord-dfs.git
cd chord-dfs
./run.sh

The run.sh script builds the Docker image and starts the system with 5 nodes by default. To run more nodes, you'll need to update the docker-compose.yml or modify the run.sh script.

Using the API

The DFS exposes a REST API for interaction. You can use curl to upload, download, and delete files. A UI is planned for the future (see Future Plans to learn more about what's coming up!).

Upload a File

curl -X POST -F "file=@<filename>" http://127.0.0.1:5000/files

Delete a File

# delete a specific file
curl -X DELETE http://127.0.0.1:5000/files/<filename>

# delete all files (not fully implemented yet)
curl -X DELETE http://127.0.0.1:5000/files

Download a File

# only see the content
curl -X GET http://127.0.0.1:5000/files/<filename>

# actually download
curl -O http://127.0.0.1:5000/files/<filename>

List Files

# get all file names stored (not fully implemented yet)
curl -X GET http://127.0.0.1:5000/files

These examples use port 5000 which corresponds to node0. Other nodes are accessible via ports 5001 to 5004. You can upload, download or delete files from any node. Dont't forget to replace <filename> with the name of an existing file!

Limitations

Data persistence: Stored files are lost when Docker containers are stopped. A persistence mechanism is planned.
DELETE all files and list files are not fully implemented yet.

Documentation

Files API - Endpoints for file upload/download/delete.
System API - Endpoints related to Chord ring operations.

Future Plans

Some ideas to make this project more robust and user-friendly:

Custom node scaling: Allow launching a dynamic number of nodes via script or CLI argument
Data persistence: Use Docker volumes to retain files across container restarts
Simple web interface: Upload, download and monitor nodes via a UI
Dynamic node joins/leaves: Add and remove nodes without affecting the system
File chunking: Split large files across nodes
Fault tolerance: Replicate data for reliability
User accounts: Support for authentication and user-specific storage

References

To develop this project I based myself on this chord implementation and on this paper.

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
app		app
docs		docs
tests		tests
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
main.py		main.py
requirements.txt		requirements.txt
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

chord-dfs

Features

How To Run

Requirements

Clone and Start

Using the API

Limitations

Documentation

Future Plans

References

About

Uh oh!

Releases

Packages

Languages

goncalooliveirasilva/chord-dfs

Folders and files

Latest commit

History

Repository files navigation

chord-dfs

Features

How To Run

Requirements

Clone and Start

Using the API

Limitations

Documentation

Future Plans

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages