|
| 1 | +--- |
| 2 | +authors: [toine, timvc] |
| 3 | +date: 2025-05-26 |
| 4 | +slug: rocm |
| 5 | +--- |
| 6 | + |
| 7 | +# Mapping the AMD ROCm Ecosystem |
| 8 | + |
| 9 | +Within the EESSI community and Inuits, we're excited to share our latest contribution to the scientific computing community: a high-level overview of AMD's ROCm ecosystem. |
| 10 | +This document is the result of our recent efforts to prepare for adding ROCm support to EESSI, and we believe it will serve as a valuable resource for anyone working with AMD GPUs in scientific computing environments. |
| 11 | + |
| 12 | +The full overview document can be found at [**Overview of ROCm Ecosystem**](https://eessi.io/docs/rocm). |
| 13 | + |
| 14 | +<!-- more --> |
| 15 | + |
| 16 | +## Why we created this overview |
| 17 | + |
| 18 | +When preparing to add ROCm support to EESSI, we needed a high-level overview of the ROCm ecosystem, but couldn't find one online. |
| 19 | +While AMD's official [documentation](https://rocm.docs.amd.com/en/latest/) covers individual components thoroughly, we required a holistic view to understand how everything interconnects - from hardware architectures to high-level libraries and the dependencies between them. |
| 20 | +This understanding was crucial for making informed decisions about which components to prioritize and how to handle compatibility across diverse scientific computing environments. |
| 21 | + |
| 22 | +Rather than keep this knowledge internal, we decided to create the overview we wished had existed when we started. |
| 23 | +We hope this document will save others the research time we invested in piecing together information from scattered sources. |
| 24 | +Depending on community feedback and interest from AMD, we're also excited about the possibility of contributing portions or the entirety of this overview to the official ROCm documentation. |
| 25 | + |
| 26 | +Once we are ready we will contribute it back to the ROCm documentation as described [here](https://rocm.docs.amd.com/en/latest/contribute/contributing.html). |
| 27 | + |
| 28 | +## Scope |
| 29 | + |
| 30 | +The document we've created offers a structured exploration of the ROCm ecosystem, including: |
| 31 | + |
| 32 | +* AMD GPU microarchitectures (CDNA, RDNA, and earlier architectures) |
| 33 | +* Core components of the ROCm stack |
| 34 | +* Available programming models (HIP, OpenMP, OpenCL) |
| 35 | +* The compiler ecosystem (ROCm-LLVM, AOCC, AOMP, hipcc) |
| 36 | +* Developer tools for debugging and performance optimization |
| 37 | +* Libraries and frameworks for various computational domains |
| 38 | +* Compatibility policies |
| 39 | +* AMD GPUs available in Azure cloud environments |
| 40 | + |
| 41 | +We've also included dependency diagrams to help visualize the relationships between different components, making it easier to understand how everything fits together. |
| 42 | + |
| 43 | +## A living document for and by the community |
| 44 | + |
| 45 | +This overview is not meant to be a static document, we view it as a living resource that will evolve along with the ROCm ecosystem itself. |
| 46 | +As AMD continues to develop and enhance ROCm, we hope to update this document with the latest information, dependency relationships, and best practices. |
| 47 | + |
| 48 | +That's where you come in. |
| 49 | +While we've worked diligently to ensure accuracy and comprehensiveness, we recognize that the collective knowledge of the community far exceeds our own. |
| 50 | +We invite you to review the document, provide feedback, suggest additions, and point out any inaccuracies or omissions. |
| 51 | + |
| 52 | +## How to contribute |
| 53 | + |
| 54 | +We welcome contributions from the community to help improve and expand this overview. |
| 55 | +You can help by: |
| 56 | + |
| 57 | +* Reviewing the document for accuracy and identifying any outdated or incorrect information |
| 58 | +* Suggesting missing components, libraries, or tools that should be included |
| 59 | +* Providing insights on important details we may have overlooked |
| 60 | +* Sharing your experiences with specific ROCm components or workflows |
| 61 | +* Pointing out unclear explanations or areas that need better documentation |
| 62 | + |
| 63 | +Please submit your feedback, suggestions, and corrections via the [EESSI support portal](https://eessi.io/docs/support). |
| 64 | +We'll review all contributions and update the document accordingly to ensure it remains a valuable resource for the entire community. |
| 65 | + |
| 66 | +## Looking forward |
| 67 | + |
| 68 | +As we prepare to integrate ROCm support into EESSI, this overview serves as our roadmap. |
| 69 | +We hope it will also guide others in the scientific computing community who are exploring AMD GPUs for their computational needs. |
| 70 | + |
| 71 | +We look forward to your feedback and to continuing this collaborative effort to better understand and utilize the ROCm ecosystem for scientific computing. |
0 commit comments