MPI support by TimThuering · Pull Request #7 · SC-SGS/hardware_sampling

TimThuering · 2026-06-12T11:57:58Z

Summary

This PR adds optional MPI support to hws, enabling MPI-aware hardware sampling across distributed multi-process jobs.

API extensions

system_hardware_sampler(MPI_Comm, mpi_sampling_mode, ...) : MPI-aware constructors that take a communicator and a sampling mode. Passing hws::detail::mpi_sampling_mode::whole_node ensures that every device visible to at least one sampler is sampled exactly once, even if multiple system_hardware_sampler instances from different ranks see the same device.
start_sampling(MPI_Comm) / stop_sampling(MPI_Comm): synchronized start/stop with MPI barriers to align sampling windows across ranks
dump_yaml_global(filename, MPI_Comm): gathers YAML output from all ranks on rank 0 and writes a single combined file; available on both hardware_sampler and system_hardware_sampler

Python bindings

mpi4py support
all new constructors, MPISamplingMode enum, start/stop with communicator overloads, and dump_yaml_global are exposed

CMake

new HWS_ENABLE_MPI_SUPPORT=AUTO|ON|OFF option (default AUTO). When MPI is found and Python bindings are enabled, mpi4py must be importable in the configured Python environment.

…sampler

…I with NVIDIA and AMD GPUs

vancraar

Thanks, Tim, for this comprehensive MPI addition! The implementation is well-structured overall. I commented to a few points that that caught my eye. What's your opinion on them?

vancraar · 2026-06-15T07:24:45Z

+    # Get mpi4py's C header location, simultaneously checking if mpi4py is importable in the current Python environment
+    execute_process(
+            COMMAND "${Python_EXECUTABLE}" -c
+            "import mpi4py, sys; sys.stdout.write(mpi4py.get_include())"


Is it possible, that mpi4py is installed, but broken without include path? Should we additionally check for that?

vancraar · 2026-06-15T17:35:35Z

Why do we now have to link PUBLIC against CUDA?

vancraar · 2026-06-15T17:36:05Z

Same as with CUDA why now PUBLIC?

vancraar · 2026-06-15T17:37:05Z

 #include <utility>    // std::move

+#if defined(HWS_MPI_SUPPORT_ENABLED)
+#include <mpi.h>        // MPI_Comm


vancraar · 2026-06-16T08:02:55Z

 ]
+# optional dependencies
+[project.optional-dependencies]
+mpi = ["mpi4py>=4"]


Why check here for Version >=4 but not in the CMakeLists.txt? Should we do this consistent on both sides?

vancraar · 2026-06-16T08:08:56Z

Should we really include the backend includes in the main utility.hpp, or wouldn't it be better to modularize them?

vancraar · 2026-06-16T08:13:09Z

In my view it looks like lines 70-111: The block inside else if (mode == detail::mpi_sampling_mode::whole_node)
needs one more level of indentation. The #if defined preprocessor directives and
detail::free_hostname_comm(nc) should be indented consistently with the code inside
the conditional block.

vancraar · 2026-06-16T10:42:26Z

 #include <vector>       // std::vector

+#if defined(HWS_MPI_SUPPORT_ENABLED)
+#include <mpi.h>        // MPI_Comm, MPI_Gatherv, MPI_Gather, MPI_Initialized, MPI_Comm_rank, MPI_Comm_size


vancraar

Thanks, Tim, for addressing all my review points! The additional CMake check for a broken mpi4py include path, the mpi4py version check, reverting the GPU library linkage back to PRIVATE, and the cleanup of the backend includes from utility.hpp all look good now. The PR is in great shape – happy to approve!

TimThuering added 13 commits May 28, 2026 13:25

add MPI to CMake configuration

e7f623c

add global yaml dump with data from all MPI ranks to system_hardware_…

673917c

…sampler

update documentation

1f113ef

add global yaml output for individual hardware sampler on all MPI ranks

772d226

add mpi4py compatibility for python bindings

9ae4c80

add dump_yaml_global to system_hardware_sampler python bindings

b3912c2

add dump_yaml_global to hardware_sampler python bindings

59caf70

added system hardware sampler creation which avoids duplicates for MP…

0cfe527

…I with NVIDIA and AMD GPUs

added synchronous start and stop sampling for MPI

bea8838

python bindings for MPI-aware constructor and functions

04210fb

fixes for mpi backend with intel GPUs

89116b4

fix for python bindings

a7c6543

fix for non-mpi mode

45d5099

TimThuering requested a review from vancraar June 12, 2026 11:57

TimThuering added the enhancement New feature or request label Jun 12, 2026

vancraar reviewed Jun 16, 2026

View reviewed changes

TimThuering added 4 commits June 17, 2026 08:45

restructure MPI related utility functions

5532369

consistent formatting

99d7f10

add additional cmake check for mpi4py include path

1d3accf

add mpi4py version check to cmake

9fab189

vancraar approved these changes Jun 17, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MPI support#7

MPI support#7
TimThuering wants to merge 17 commits into
developfrom
feature/mpi-support

TimThuering commented Jun 12, 2026

Uh oh!

vancraar left a comment

Uh oh!

vancraar Jun 15, 2026

Uh oh!

vancraar Jun 15, 2026

Uh oh!

vancraar Jun 15, 2026

Uh oh!

vancraar Jun 15, 2026

Uh oh!

vancraar Jun 16, 2026

Uh oh!

vancraar Jun 16, 2026

Uh oh!

vancraar Jun 16, 2026

Uh oh!

vancraar Jun 16, 2026

Uh oh!

vancraar left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

TimThuering commented Jun 12, 2026

Summary

API extensions

Python bindings

CMake

Uh oh!

vancraar left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vancraar left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants