{perf}[gompi/2025b] HPCToolkit v2025.0.1 w/ CUDA 12.9.1#23830
{perf}[gompi/2025b] HPCToolkit v2025.0.1 w/ CUDA 12.9.1#23830Micket merged 18 commits intoeasybuilders:developfrom
Conversation
…25.0.0-gompi-2025b-CUDA-12.9.1.eb
Updated software
|
Fail tests due to incorrect values Signed-off-by: Jan André Reuter <j.reuter@fz-juelich.de>
Signed-off-by: Jan André Reuter <j.reuter@fz-juelich.de>
May pick up system OpenCL headers. Signed-off-by: Jan André Reuter <j.reuter@fz-juelich.de>
Signed-off-by: Jan André Reuter <j.reuter@fz-juelich.de>
Signed-off-by: Jan André Reuter <j.reuter@fz-juelich.de>
|
Test report by @Thyre |
The three mentioned subprojects are all part of the HPCToolkit sources. So it shouldn't try to download anything fortunately. |
|
Test report by @Thyre |
Signed-off-by: Jan André Reuter <j.reuter@fz-juelich.de>
Tests here failed because I was building on a node, which was picked up and is not correctly handled by the check. |
Signed-off-by: Jan André Reuter <j.reuter@fz-juelich.de>
…ew_pr_HPCToolkit202500
|
@boegelbot please test @ jsc-zen3-a100 |
|
@Thyre: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de PR test command '
Test results coming soon (I hope)... Details- notification for comment with ID 3271028326 processed Message to humans: this is just bookkeeping information for me, |
|
HPCToolkit-2025.0.0-gompi-2025b.eb Test report by @Thyre |
|
Test report by @Thyre |
|
HPCToolkit-2025.0.0-gompi-2025b-CUDA-12.9.1.eb Test report by @Thyre |
Signed-off-by: Jan André Reuter <j.reuter@fz-juelich.de>
|
Test report by @boegelbot |
Yeah... that's HPCToolkit picking up that we're running on a compute node. The CUDA tests also fail with insufficient permissions. |
Signed-off-by: Jan André Reuter <j.reuter@fz-juelich.de>
Signed-off-by: Jan André Reuter <j.reuter@fz-juelich.de>
|
Test report by @Thyre |
stripped See https://gitlab.com/hpctoolkit/hpctoolkit/-/merge_requests/1310 for more information on why this is done. Signed-off-by: Jan André Reuter <j.reuter@fz-juelich.de>
|
Test report by @Thyre |
|
Test report by @Thyre |
|
@boegelbot please test @ jsc-zen3-a100 |
|
@Thyre: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de PR test command '
Test results coming soon (I hope)... Details- notification for comment with ID 3275022689 processed Message to humans: this is just bookkeeping information for me, |
|
Test report by @boegelbot |
Co-authored-by: Simon Branford <4967+branfosj@users.noreply.github.com>
|
Test report by @branfosj |
|
I created easybuilders/easybuild-framework#4996 to keep track of how we could improve the rpath sanity check. |
|
Test report by @branfosj |
|
Recording the test failures when performance counters are not available here, so that others can see them without having to search the logs. jsc-zen3 has 6 failures:
I have 12 - the above 6 and also:
The ordering of the tests changes, but these pass on jsc-zen3 (tests 57-63). Not sure what difference causes the extra 6 to fail. I know my system has the relevant performance counters disabled. |
The other 6 are very likely caused by the CUDA driver. We also see that with 570 on JUPITER, but don't see it on our other GH200 nodes with 580. |
We have |
|
Test report by @Micket |
|
Test report by @Micket |
|
I'd like to get this merged. Anyone strongly disagrees? |
I think with the build message that now appears, users are informed enough to either work around the problem or install it with |
|
Going in, thanks @Thyre! |
(created using
eb --new-pr)Requires:
TODO:
aarch64dependenciestobuilddependenciesHPCViewer, while also interesting to have, needs to wait until we have GTK3/GTK4. This is blocked by a few more missing dependencies for now.