AMD ROCm Setup Guide
Note: Tested on RDNA 4 in Ubuntu 24.04. Other setups may require different configuration.
Quick Start
Prerequisites
1. ROCm Drivers
Verify ROCm is installed and sees your GPU:
2. Docker Setup
# Add yourself to required groups
sudo usermod -aG docker,video,render $USER
# Activate
newgrp docker
3. HuggingFace Cache
How It Works
CoTLab uses Docker with the official AMD vLLM image:
Host System (ROCm drivers)
│
└── Docker Container
├── rocm/vllm-dev:rocm7.1.1_navi_...
├── vLLM pre-compiled
└── GPU access via /dev/kfd, /dev/dri
Troubleshooting
Permission Denied
GPU Stuck at 100%
Known Limitations
vLLM does not support activation extraction (required for logit_lens and similar mechanistic experiments).