Orion

Orion is a fine-grained scheduler for interference-free GPU sharing across ML workloads.

Introduction

Orion is a fine-grained, interference-free scheduler for GPU sharing across ML workloads. We assume one of the clients is high-priority, while the rest of the clients are best-effort.

Orion intercepts CUDA, CUDNN, and CUBLAS calls and submits them into software queues. The Scheduler polls these queues and schedules operations based on their resource requirements and their priority. See ARCHITECTURE for more details on the system and the scheduling policy.

Orion expects that each submitted job has a file where all of its operations, along with their profiles and Straming Multiprocessor (SM) requirements are listed. See PROFILE for detailed instructions on how to profile a client applications, and how to generate the profile files.

Example

We have set up a docker image: fotstrt/orion-ae with all packages pre-installed.

Alternatively, follow the instructions on the 'setup' directory, and check INSTALL, to install Orion and its dependencies.

See PROFILE to generate profiling files for each workload. Create a json file containing all the info for the workloads that are about to share the GPU. See examples under 'eval'.

The file 'launch_jobs.py' is responsible for spawning the scheduler and the application thread(s).

Project Structure

> tree .
├── profiling                     # Scripts and instructions for profiling
│   ├── benchmarks                # Scripts of DNN models for profiling
│   ├── postprocessing            # Scripts for processing of profile files
└── src                           # Source code
│   ├── cuda_capture              # Code to intercept CUDA/CUDNN/CUBLAS calls
│   └── scheduler                 # Implementation of the scheduling policy
│   └── scheduler_frontend.py     # Python interface for the Orion scheduler
└── benchmarking                  # Scripts and configuration files for benchmarking
└── related                       # Some of the related baselines: MPS, Streams, Tick-Tock
└── artifact_evaluation           # Scripts and instructions for artifact evaluation
└── setup                         # Instructions and scripts to install Orion's prerequisites.

Hardware Requirements

Orion currently supports NVIDIA GPUs.

Installation

see INSTALL.

Debugging

see DEBUGGING.

Paper

If you use Orion, please cite our paper: (TODO)

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
artifact_evaluation		artifact_evaluation
benchmarking		benchmarking
profiling		profiling
related		related
setup		setup
src		src
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
DEBUGGING.md		DEBUGGING.md
INSTALL.md		INSTALL.md
LICENSE		LICENSE
PROFILE.md		PROFILE.md
README.md		README.md
client_0.json		client_0.json
compile.sh		compile.sh
orion_architecture.png		orion_architecture.png
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Orion

Table of Contents

Introduction

Example

Project Structure

Hardware Requirements

Installation

Debugging

Paper

About

Uh oh!

Releases

Packages

Languages

License

MachineLearningSystem/24Eurosys-orion

Folders and files

Latest commit

History

Repository files navigation

Orion

Table of Contents

Introduction

Example

Project Structure

Hardware Requirements

Installation

Debugging

Paper

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages