Run This Ai
EN DE

SGLang

High-performance serving framework for LLMs with RadixAttention

★ 30,000 GitHub Apache-2.0 sglang LLM & Chat

Overview

High-performance serving framework for LLMs with RadixAttention

Requirements

Min vCPU
1
Min RAM
1024 MB
Min Disk
10 GB
Rec vCPU
2
Rec RAM
2048 MB
Rec Disk
20 GB

Recommended VPS

Hetzner · CX22

2 vCPU · 4096 MB · 40 GB

$3.79
View plan

Hetzner · CX22

2 vCPU · 4096 MB · 40 GB

$3.79
View plan

Hetzner · CX22

2 vCPU · 4096 MB · 40 GB

$3.79
View plan

Affiliate disclosure

Docker Compose

# Generated by Run This Ai — docker-compose.yml
services:
  sglang:
    image: lmsysorg/sglang:latest
    restart: unless-stopped
    ports:
      - 8080:8080
    volumes:
      - ./data/sglang:/data

Best VPS for SGLang →

SGLang: install

SGLang

High-performance serving framework for LLMs with RadixAttention

Prerequisites

  • Docker installed (version 24.0+)
  • Docker Compose (version 2.20+)
  • At least 1GB RAM (2GB recommended)

Quick start with Docker

# Pull the image
docker pull lmsysorg/sglang:latest

# Run the container
docker run -d --name sglang -p 8080:8080 lmsysorg/sglang:latest

Key features

SGLang: overview

SGLang

High-performance serving framework for LLMs with RadixAttention

Key features

What it's good for

SGLang runs entirely on your own infrastructure — your data never leaves your server.

Related tools