Brave Search

What’s the best way to learn how to use and tools/features for ROCm?

reddit.com › r › ROCm › comments › 1gjzx5m › whats_the_best_way_to_learn_how_to_use_and

ROCm isn't so much a tool you directly use as it is an enablement code set library required by tools you might use so they can run on AMD hardware. There's really a wide range of potential tools that can use ROCm as one of the things it does is provided an alternative code path for the CUDA API domain. So basically anything that can run CUDA can be ported to use ROCm instead. Or with runtime wrappers like Zluda, you can even run code that was only written to be CUDA. You need to start by reading a lot. Watch some YouTube stuff and do whatever you need to get a foot hold in to establishing a execution environment. ROCm is still not really a windows friendly thing (but that's should be coming soon) except via WSL2, so you need to have some experience with Linux and Python. But really you can follow walk throughs and just start learning and reading about whatever you find you don't understand. Mindyou, it doesn't ever really end. You might start here for ROCm specific documentation. https://rocm.docs.amd.com/en/latest/what-is-rocm.html But decided what sort of AI projects you want to learn and figure out what the projects are to run will give you a better starting point. For example, I've been playing with different versions of Stable Diffusion using set ups of automatic1111 with Zluda directly in Win11 one set up and WSL Ubuntu with SD.Next on another and those work nicely. I had to find forks that people had made specifically to use ROCm instead of CUDA, but know more projects can be configured for either easily. And now I want to start playing with ComfyAI and get more involved with understanding pipelines and multi step workflows with HuggingFace models. https://huggingface.co/learn Good Luck Answer from GanacheNegative1988 on reddit.com

GitHub

github.com › ROCm › rocm-examples

GitHub - ROCm/rocm-examples: A collection of examples for the ROCm software stack · GitHub

rocprof-systems: Demonstrates how to use the ROCm Systems Profiler. rocprofv3: Illustrates how to use the rocprofv3 profiler. Tutorials: Showcases HIP Documentation Tutorials.

Starred by 293 users

Forked by 91 users

GitHub

github.com › ROCm › rocm

GitHub - ROCm/ROCm: AMD ROCm™ Software - GitHub Home

If you’re using AMD Radeon GPUs or Ryzen APUs in a workstation setting with a display connected, see the ROCm on Radeon and Ryzen documentation for operating system/framework support and step-by-step installation instructions.

Starred by 6.7K users

Forked by 575 users

Languages Shell 84.7% | Python 11.4% | Makefile 2.4%

Discussions

Getting started with rocM

I use fedora 38 with rocm. I target the rhel 9.2 repo that amd hosts for rocm bits. Make sure to exclude dkms during install if your using a gui Linux, as you'll already have the driver loaded. Unfortunately it is extremely niche, amd users that also use Linux, that also want to play ai/ml, tiny subset of people that are capable of even reaching the starting line. Good news is it works great. I've run native stable diffusion with all kinds of models and model add ons via rocm on my 6800 xt. Just recently got mlc-llm fully working with the new llama2 llm models too. I'm getting 70+ tokens per second, amazing performance. Really cool stuff going on, the people that say amd can't do ml/ai are just lazy. And with all the vram, even a 6800 can run huge 13b models with crazy performance. Here's the article that got me going with the llama2 stuff. I had to compile some stuff to get the 6800xt working, but if you have 7000 series it's all precompiled. https://blog.mlc.ai/2023/08/09/Making-AMD-GPUs-competitive-for-LLM-inference More on reddit.com

r/Amd

27

20

August 28, 2023

ROCm 7.2 official installation instructions

Holy shit AMD crew. Either I am doing something wrong or there is a RADICAL performance boost for inferring diffusion models. I just ran a 5 second wan, 3 samplers (6 steps total). It normally takes me 10 minutes. It just ran in 190 seconds. FP16! I am on an r9700. Did we just more than double our speed? lol holy smokes 2nd generation of 5s is down to 172s. fp16 6 steps 3 samplers this nutty. [edit] Testing some other models and workflows for image gen in comfy. 7.2 on linux is a MASSIVE perf boost. Insane. I was coming off 6.4.1 More on reddit.com

r/ROCm

42

54

January 22, 2026

Videos

youtube.com

AMD HIP Tutorial, 9-1, ROCm Libraries

06:01

YouTube

AMD HIP Tutorial, 1-4, What is ROCm - YouTube

April 19, 2024

15:54

YouTube

How To Install AMD ROCm, PyTorch, Stable Diffusion & YOLO - 2024 ...

March 16, 2024

08:01

YouTube

How to Install AMD ROCm on Linux - Updated Guide 10/2023 - YouTube

AMD GPU Linux ROCm installation - step by step guide - YouTube

June 6, 2023

youtube.com

How To Install AMD ROCm, PyTorch, Stable Diffusion & YOLO ...

View all

AMD ROCm

rocm.docs.amd.com

AMD ROCm documentation — ROCm Documentation

Use ROCm for AI · AI tutorials · Use ROCm for HPC · System optimization · AMD Instinct MI300X performance validation and tuning · System debugging · Use advanced compiler features · Set the number of CUs · Troubleshoot BAR access limitation · ROCm examples ·

DEV Community

dev.to › digitalocean › gpu-programming-for-beginners-rocm-amd-setup-to-edge-detection-29bm

GPU Programming for Beginners: ROCm + AMD Setup to Edge Detection - DEV Community

March 10, 2026 - Understanding GPU programming is ... We'll use ROCm and HIP (AMD's version of CUDA) to take you from zero to running real GPU code, culminating in a computer vision edge detector that processes images in parallel...

reddit.com › r/rocm › what’s the best way to learn how to use and tools/features for rocm?

r/ROCm on Reddit: What’s the best way to learn how to use and tools/features for ROCm?

November 5, 2024 -

Nooby at AI but wanting to learn the ROCm tool set.. any advice?

Top answer

1 of 3

2

ROCm isn't so much a tool you directly use as it is an enablement code set library required by tools you might use so they can run on AMD hardware. There's really a wide range of potential tools that can use ROCm as one of the things it does is provided an alternative code path for the CUDA API domain. So basically anything that can run CUDA can be ported to use ROCm instead. Or with runtime wrappers like Zluda, you can even run code that was only written to be CUDA. You need to start by reading a lot. Watch some YouTube stuff and do whatever you need to get a foot hold in to establishing a execution environment. ROCm is still not really a windows friendly thing (but that's should be coming soon) except via WSL2, so you need to have some experience with Linux and Python. But really you can follow walk throughs and just start learning and reading about whatever you find you don't understand. Mindyou, it doesn't ever really end. You might start here for ROCm specific documentation. https://rocm.docs.amd.com/en/latest/what-is-rocm.html But decided what sort of AI projects you want to learn and figure out what the projects are to run will give you a better starting point. For example, I've been playing with different versions of Stable Diffusion using set ups of automatic1111 with Zluda directly in Win11 one set up and WSL Ubuntu with SD.Next on another and those work nicely. I had to find forks that people had made specifically to use ROCm instead of CUDA, but know more projects can be configured for either easily. And now I want to start playing with ComfyAI and get more involved with understanding pipelines and multi step workflows with HuggingFace models. https://huggingface.co/learn Good Luck

2 of 3

1

There's a lot of good books on using CUDA, learn CUDA, then use the rocm documentation to port your cuda code to ROCM.

Amd

rocm-handbook.amd.com

AMD ROCm Programming Guide — AMD ROCm Programming Guide 7.2.4

Multi-kernel programming: breadth-first search tutorial · HIP compilers · Performance optimization techniques · Understanding GPU performance · Performance guidelines · Optimizing performance · Highly parallel workload: image gamma correction · Fixed-size kernels: image gamma correction · Reduction · Tiling and reuse: matrix multiplication · Tiling and coalescing: matrix transpose · Multi-GPU programming · ROCm platform ·

AMD

amd.com › https://www.amd.com/en.html › developer central › rocm™ hub › training videos

AMD ROCm™ Platform Training Videos

June 24, 2024 - This presentation goes over the AMD Instinct™ architecture and the basics of developing applications within the AMD ROCm ecosystem.

Find elsewhere

Google Bing Mojeek

AMD ROCm

rocm.docs.amd.com › projects › ai-developer-hub › en › latest

Tutorials for AI developers - ROCm Documentation

The AI Developer Hub contains AMD ROCm tutorials in Jupyter Notebook format for training, fine-tuning, and inference.

AMD ROCm

rocm.docs.amd.com › en › docs-5.0.2 › examples › all.html

All Tutorial Material — ROCm 5.0.2 Documentation Home

Detailed walkthroughs of specific use-cases driven by frameworks using ROCm acceleration.

AMD GPUOpen

gpuopen.com › learn › amd-lab-notes › amd-lab-notes-rocm-installation-readme

AMD ROCm™ installation - AMD GPUOpen

Installation of the AMD ROCm™ software package can be challenging. This introductory material shows how to install ROCm on a workstation with an AMD GPU card that supports the AMD GFX9 architecture.

reddit.com › r/amd › getting started with rocm

r/Amd on Reddit: Getting started with rocM

August 28, 2023 -

Hello,

I recently acquired a new PC including the 7900 xtx and 7800x3d and wanted to try to run rocm. I already found the documentation:
https://rocm.docs.amd.com/en/latest/index.html

However i wanted to ask if there is a community that is active and discusses current progression/bugs etc. I honestly cant find much information online however it is probably a relativley niche topic since most people use nvidia.

I will also probably have to install a supported OS and the respective kernel accourding to : https://rocm.docs.amd.com/en/latest/release/gpu_os_support.html

How accurate is this? I currently have Ubuntu 22.04.3 with kernel 6.2.0-26-generic, its neither in the supported nor unsupported tab.

Also what about the disadvantages of dockerization? It sounds pretty good and useful but will the perfomance suffer? Im familiar with docker to some degree but im kinda sceptical how it works to pass the actual physcial gpu to the Container.

Also i would be grateful for every information/source you can give me. Also grateful for every advice.