Introduction

Features

Scale out workloads to thousands of GPU (or CPU) containers
Ultrafast cold-start for custom ML models
Automatic scaling up and down to zero
Flexible distributed storage for storing models and function outputs
Distribute workloads across multiple cloud providers
Easily deploy task queues and functions using simple Python abstractions

Quickstart

Create an account on Beam and download the Beam SDK to get started:

pip install beam-client

How It Works

Beam is designed for launching remote serverless containers quickly. There are a few things that make this possible:

A custom, lazy loading image format (CLIP) backed by S3/FUSE
A fast, redis-based container scheduling engine
Content-addressed storage for caching images and files
A custom runc container runtime

Getting Started Guides

Inference Endpoints

Deploying a REST API

Serverless Functions

Deploying a Task Queue

Tutorials

Faster Whisper

LLaMA 3.1 8B

Stable Diffusion LoRAs

Sending Events Between Apps

Was this page helpful?

Installation

On this page

Features
Quickstart
How It Works
Getting Started Guides
Tutorials

Getting Started

Customizing the Environment

Managing Data

Endpoints and Web Servers

Task Queues

Serverless Functions

Autoscaling and Concurrency

Other Topics

Self-Hosting

Resources

Security

Introduction

Features

Quickstart

How It Works

Getting Started Guides

Inference Endpoints

Serverless Functions

Tutorials

Faster Whisper

LLaMA 3.1 8B

Stable Diffusion LoRAs

Sending Events Between Apps

Getting Started

Customizing the Environment

Managing Data

Endpoints and Web Servers

Task Queues

Serverless Functions

Autoscaling and Concurrency

Other Topics

Self-Hosting

Resources

Security

​Features

​Quickstart

​How It Works

​Getting Started Guides

Inference Endpoints

Serverless Functions

​Tutorials

Faster Whisper

LLaMA 3.1 8B

Stable Diffusion LoRAs

Sending Events Between Apps

Features

Quickstart

How It Works

Getting Started Guides

Tutorials