Skip to content

Commit 464da86

Browse files
authored
Merge pull request #471 from Zeldhoron/rocm
Add ROCm overview and blog post
2 parents dfd346f + 51c5fce commit 464da86

5 files changed

Lines changed: 623 additions & 1 deletion

File tree

docs/blog/.authors.yml

Lines changed: 6 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -63,4 +63,9 @@ authors:
6363
name: Elisabeth Ortega
6464
description: Do IT Now
6565
avatar: https://avatars.githubusercontent.com/u/16071227?v=4
66-
slug: https://github.com/draentropia
66+
slug: https://github.com/draentropia
67+
timvc:
68+
name: Tim Vancompernolle
69+
description: Inuits
70+
avatar: https://avatars.githubusercontent.com/u/78806270?v=4
71+
slug: https://github.com/Timvnc
Lines changed: 71 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,71 @@
1+
---
2+
authors: [toine, timvc]
3+
date: 2025-05-26
4+
slug: rocm
5+
---
6+
7+
# Mapping the AMD ROCm Ecosystem
8+
9+
Within the EESSI community and Inuits, we're excited to share our latest contribution to the scientific computing community: a high-level overview of AMD's ROCm ecosystem.
10+
This document is the result of our recent efforts to prepare for adding ROCm support to EESSI, and we believe it will serve as a valuable resource for anyone working with AMD GPUs in scientific computing environments.
11+
12+
The full overview document can be found at [**Overview of ROCm Ecosystem**](https://eessi.io/docs/rocm).
13+
14+
<!-- more -->
15+
16+
## Why we created this overview
17+
18+
When preparing to add ROCm support to EESSI, we needed a high-level overview of the ROCm ecosystem, but couldn't find one online.
19+
While AMD's official [documentation](https://rocm.docs.amd.com/en/latest/) covers individual components thoroughly, we required a holistic view to understand how everything interconnects - from hardware architectures to high-level libraries and the dependencies between them.
20+
This understanding was crucial for making informed decisions about which components to prioritize and how to handle compatibility across diverse scientific computing environments.
21+
22+
Rather than keep this knowledge internal, we decided to create the overview we wished had existed when we started.
23+
We hope this document will save others the research time we invested in piecing together information from scattered sources.
24+
Depending on community feedback and interest from AMD, we're also excited about the possibility of contributing portions or the entirety of this overview to the official ROCm documentation.
25+
26+
Once we are ready we will contribute it back to the ROCm documentation as described [here](https://rocm.docs.amd.com/en/latest/contribute/contributing.html).
27+
28+
## Scope
29+
30+
The document we've created offers a structured exploration of the ROCm ecosystem, including:
31+
32+
* AMD GPU microarchitectures (CDNA, RDNA, and earlier architectures)
33+
* Core components of the ROCm stack
34+
* Available programming models (HIP, OpenMP, OpenCL)
35+
* The compiler ecosystem (ROCm-LLVM, AOCC, AOMP, hipcc)
36+
* Developer tools for debugging and performance optimization
37+
* Libraries and frameworks for various computational domains
38+
* Compatibility policies
39+
* AMD GPUs available in Azure cloud environments
40+
41+
We've also included dependency diagrams to help visualize the relationships between different components, making it easier to understand how everything fits together.
42+
43+
## A living document for and by the community
44+
45+
This overview is not meant to be a static document, we view it as a living resource that will evolve along with the ROCm ecosystem itself.
46+
As AMD continues to develop and enhance ROCm, we hope to update this document with the latest information, dependency relationships, and best practices.
47+
48+
That's where you come in.
49+
While we've worked diligently to ensure accuracy and comprehensiveness, we recognize that the collective knowledge of the community far exceeds our own.
50+
We invite you to review the document, provide feedback, suggest additions, and point out any inaccuracies or omissions.
51+
52+
## How to contribute
53+
54+
We welcome contributions from the community to help improve and expand this overview.
55+
You can help by:
56+
57+
* Reviewing the document for accuracy and identifying any outdated or incorrect information
58+
* Suggesting missing components, libraries, or tools that should be included
59+
* Providing insights on important details we may have overlooked
60+
* Sharing your experiences with specific ROCm components or workflows
61+
* Pointing out unclear explanations or areas that need better documentation
62+
63+
Please submit your feedback, suggestions, and corrections via the [EESSI support portal](https://eessi.io/docs/support).
64+
We'll review all contributions and update the document accordingly to ensure it remains a valuable resource for the entire community.
65+
66+
## Looking forward
67+
68+
As we prepare to integrate ROCm support into EESSI, this overview serves as our roadmap.
69+
We hope it will also guide others in the scientific computing community who are exploring AMD GPUs for their computational needs.
70+
71+
We look forward to your feedback and to continuing this collaborative effort to better understand and utilize the ROCm ecosystem for scientific computing.

docs/site_specific_config/gpu.md

Lines changed: 36 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -133,3 +133,39 @@ module load CUDA-Samples
133133
deviceQuery
134134
```
135135
If both are successful, you should see information about your GPU printed to your terminal.
136+
137+
## Support for using AMD GPUs {: #amd }
138+
139+
*(last update: May 2025)*
140+
141+
ROCm (Radeon Open Compute) support in EESSI is currently under development to enable GPU-accelerated computing on AMD hardware.
142+
143+
### Current status
144+
145+
ROCm support is not yet available in EESSI.
146+
We are actively working on integrating AMD's ROCm ecosystem to provide GPU computing capabilities for AMD GPUs alongside our existing CUDA support for NVIDIA hardware.
147+
148+
For the latest updates join the #amd-rocm channel in the [EESSI Slack](https://app.slack.com/client/TP0103C4C/C0780BEV9JN).
149+
150+
### Overview of ROCm ecosystem
151+
152+
To prepare for ROCm integration, we have created a comprehensive overview of the entire ROCm ecosystem.
153+
This document maps out AMD GPU architectures, core ROCm components, programming models, libraries, and the dependencies between them.
154+
155+
For a detailed understanding of the ROCm ecosystem that will inform our implementation, see [Overview of ROCm Ecosystem](rocm.md).
156+
157+
We also have a blog post about this [here](http://eessi.io/docs/blog/2025/05/26/rocm/).
158+
159+
### Future plans
160+
161+
Our ROCm integration will focus on providing:
162+
163+
* The ROCm core components
164+
* ROCm-enabled scientific computing libraries
165+
* Support for HIP, OpenMP, and OpenCL programming models
166+
* GPU-accelerated frameworks and applications
167+
168+
### Community input
169+
170+
We welcome feedback from the community on ROCm support priorities and use cases.
171+
If you have specific ROCm software requirements or insights to share, please [contact us](../support.md).

0 commit comments

Comments
 (0)