Hades: A Scalable Job Scheduler for Container Workloads

Welcome to Hades, a robust job scheduler designed with scalability in mind. Hades' primary mission is to provide a straightforward, scalable, and adaptable solution for executing containerized workloads in various environments, from educational programming courses to research computing clusters.

Design Goals

Hades embodies several core design principles:

Simplicity: Hades focuses on delivering just the essentials required to execute containerized jobs efficiently, without unnecessary complexity.
Scalability: Hades has scalability at its core, capable of queuing and executing a vast number of jobs in parallel, making it ideal for large-scale operations.
Container-Based: Hades executes jobs within containers, ensuring a high level of isolation and security between workloads.
Kubernetes Native: As a Kubernetes-native solution, Hades leverages the power and flexibility of Kubernetes as its primary execution platform for production workloads.
Extensibility: Hades is designed to be highly extensible, allowing for easy integration with other execution platforms and workflow systems as needed.

Architecture

Hades is built upon the following key components:

API: Serving as the main entry point, the API handles all incoming job requests and provides status information.
Queue: Using NATS as a message queue, this component is responsible for managing the queue of jobs, ensuring efficient scheduling and reliable delivery.
Scheduler: The scheduler orchestrates the execution of jobs, coordinating with the executor components to run each job step in the appropriate environment.
- Docker Executor: Designed for local development, the Docker executor is responsible for running jobs within Docker containers on a single host.
- Kubernetes Executor: Intended for production use, the Kubernetes executor executes jobs within a Kubernetes cluster, providing improved scalability, reliability, and resource utilization.

How It Works

Hades processes jobs through a sequence of well-defined steps:

Job Submission: Jobs are submitted to the API, defining a series of steps to execute.
Queuing: The job is queued in NATS for asynchronous processing.
Scheduling: The scheduler picks up the job and schedules it on the appropriate executor.
Execution: Each step of the job runs in its own container, with steps sharing data through a common volume.
Completion: Upon completion, results are stored and made available through the API.

Getting Started

Prerequisites

Docker and Docker Compose for local development
Kubectl and a Kubernetes cluster for production deployment
Minikube for local Kubernetes testing (optional)

Running in Docker Mode

To run Hades in Docker mode for local development:

Clone the repository:

git clone https://github.com/yourusername/Hades.git
cd Hades

Copy the .env.example file to .env:
```
cp .env.example .env
```
The default configuration uses Docker as the executor, so no changes are necessary for local testing.
Start the Hades services:
```
docker compose up -d
```

Running in Kubernetes Mode

For production deployments or testing with Kubernetes:

Ensure you have a running Kubernetes cluster and a valid kubeconfig file.
Copy the .env.example file to .env and update the configuration:
```
cp .env.example .env
```
Change the HADES_EXECUTOR variable to kubernetes in your .env file.
Adjust the Kubeconfig volume mount in docker-compose.k8s.yml to point to your kubeconfig file.

Start Hades in Kubernetes mode:

docker compose -f docker-compose.yml -f docker-compose.k8s.yml up -d

Usage Examples

Creating a Simple Job

Here's an example of submitting a basic job to Hades:

{
  "name": "Example Job",
  "metadata": {
    "GLOBAL": "test"
  },
  "steps": [
    {
      "id": 1,
      "name": "Hello World",
      "image": "alpine:latest",
      "script": "echo 'Hello, Hades!'"
    }
  ]
}

Submit this job using:

curl -X POST -H "Content-Type: application/json" -d @job.json http://localhost:8080/build

Multi-Step Job Example

For more complex workflows, you can define multi-step jobs where each step runs in a different container:

{
  "name": "Multi-Step Example",
  "steps": [
    {
      "id": 1,
      "name": "Step 1",
      "image": "alpine:latest",
      "script": "echo 'Setting up environment...' > /shared/output.txt"
    },
    {
      "id": 2,
      "name": "Step 2",
      "image": "ubuntu:latest",
      "script": "cat /shared/output.txt && echo 'Processing data...' >> /shared/output.txt"
    },
    {
      "id": 3,
      "name": "Step 3",
      "image": "python:3.9-alpine",
      "script": "cat /shared/output.txt && echo 'Finalizing...' >> /shared/output.txt && cat /shared/output.txt"
    }
  ]
}

Configuration Options

Hades can be configured through environment variables or a .env file:

Variable	Description	Default
`HADES_EXECUTOR`	Execution platform: `docker` or `kubernetes`	`docker`
`CONCURRENCY`	Number of jobs to process concurrently	`1`
`API_PORT`	Port for the Hades API	`8080`

Deployment

Ansible Deployment

Hades includes Ansible playbooks for automated deployment. See the ansible/hades/README.md file for more details.

High Level Architecture Diagram

┌─────────┐         ┌─────────┐          ┌───────────────┐
│         │         │         │          │               │
│  API    │────────▶│  NATS   │─────────▶│  Scheduler    │
│         │         │ Queue   │          │               │
└─────────┘         └─────────┘          └───────┬───────┘
                                                 │
                                                 ▼
                        ┌────────────────────────┴───────────────────────┐
                        │                                                │
                        ▼                                                ▼
                 ┌─────────────┐                               ┌─────────────────┐
                 │             │                               │                 │
                 │   Docker    │                               │   Kubernetes    │
                 │  Executor   │                               │    Executor     │
                 │             │                               │                 │
                 └─────────────┘                               └─────────────────┘

Acknowledgments

Special thanks to all contributors who have helped shape Hades
Inspired by the need for a lightweight, scalable job execution system in educational environments
Built with Go, Docker, Kubernetes, and NATS

Name		Name	Last commit message	Last commit date
Latest commit History 160 Commits
.github		.github
.idea		.idea
.vscode		.vscode
HadesAPI		HadesAPI
HadesCloneContainer		HadesCloneContainer
HadesScheduler		HadesScheduler
ansible/hades		ansible/hades
docs		docs
shared		shared
.env.example		.env.example
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Readme.md		Readme.md
docker-comose.k8s.yml		docker-comose.k8s.yml
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.yml		docker-compose.yml
go.work		go.work
go.work.sum		go.work.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Hades: A Scalable Job Scheduler for Container Workloads

Design Goals

Architecture

How It Works

Getting Started

Prerequisites

Running in Docker Mode

Running in Kubernetes Mode

Usage Examples

Creating a Simple Job

Multi-Step Job Example

Configuration Options

Deployment

Ansible Deployment

High Level Architecture Diagram

Acknowledgments

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors 4

Uh oh!

Languages

License

ls1intum/hades

Folders and files

Latest commit

History

Repository files navigation

Hades: A Scalable Job Scheduler for Container Workloads

Design Goals

Architecture

How It Works

Getting Started

Prerequisites

Running in Docker Mode

Running in Kubernetes Mode

Usage Examples

Creating a Simple Job

Multi-Step Job Example

Configuration Options

Deployment

Ansible Deployment

High Level Architecture Diagram

Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors 4

Uh oh!

Languages

Packages