Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement cudax::async_buffer #3460

Merged
merged 28 commits into from
Feb 28, 2025
Merged

Implement cudax::async_buffer #3460

merged 28 commits into from
Feb 28, 2025

Conversation

miscco
Copy link
Contributor

@miscco miscco commented Jan 21, 2025

This implements an experimental buffer type

It is effectively a simpler vector that only handles allocation and transfer of memory.

it is tagged so that we can statically verify execution spaces.

Furthermore, it is inherently asynchronous so it requires a stream on construction.

All operations are done on that stream, but the stream can be changed if the user desires so.

All operation are in stream order except if the user explicitly requires it through a method annotated with _unsynchronized

@miscco miscco requested review from a team as code owners January 21, 2025 15:57
@miscco miscco requested a review from alliepiper January 21, 2025 15:57
Copy link
Contributor

🟨 CI finished in 33m 03s: Pass: 80%/20 | Total: 2h 35m | Avg: 7m 45s | Max: 27m 00s
  • 🟨 cudax: Pass: 80%/20 | Total: 2h 35m | Avg: 7m 45s | Max: 27m 00s

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  75%/16  | Total:  2h 18m | Avg:  8m 38s | Max: 27m 00s
      🟩 arm64              Pass: 100%/4   | Total: 16m 46s | Avg:  4m 11s | Max:  4m 47s
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  77%/18  | Total:  1h 41m | Avg:  5m 39s | Max: 13m 00s
      🟩 Test               Pass: 100%/2   | Total: 53m 09s | Avg: 26m 34s | Max: 27m 00s
    🟨 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  4m 28s | Avg:  4m 28s | Max:  4m 28s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 56s | Avg:  4m 56s | Max:  4m 56s
      🟩 Clang16            Pass: 100%/1   | Total:  4m 51s | Avg:  4m 51s | Max:  4m 51s
      🟩 Clang17            Pass: 100%/1   | Total:  5m 17s | Avg:  5m 17s | Max:  5m 17s
      🟩 Clang18            Pass: 100%/4   | Total: 40m 01s | Avg: 10m 00s | Max: 26m 09s
      🟩 GCC10              Pass: 100%/1   | Total:  4m 36s | Avg:  4m 36s | Max:  4m 36s
      🟩 GCC11              Pass: 100%/1   | Total:  4m 39s | Avg:  4m 39s | Max:  4m 39s
      🟩 GCC12              Pass: 100%/2   | Total: 32m 19s | Avg: 16m 09s | Max: 27m 00s
      🟩 GCC13              Pass: 100%/4   | Total: 15m 19s | Avg:  3m 49s | Max:  4m 10s
      🟥 MSVC14.36          Pass:   0%/1   | Total: 12m 36s | Avg: 12m 36s | Max: 12m 36s
      🟥 MSVC14.39          Pass:   0%/1   | Total: 13m 00s | Avg: 13m 00s | Max: 13m 00s
      🟥 NVHPC24.7          Pass:   0%/2   | Total: 13m 05s | Avg:  6m 32s | Max:  6m 48s
    🟨 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 59m 33s | Avg:  7m 26s | Max: 26m 09s
      🟩 GCC                Pass: 100%/8   | Total: 56m 53s | Avg:  7m 06s | Max: 27m 00s
      🟥 MSVC               Pass:   0%/2   | Total: 25m 36s | Avg: 12m 48s | Max: 13m 00s
      🟥 NVHPC              Pass:   0%/2   | Total: 13m 05s | Avg:  6m 32s | Max:  6m 48s
    🟨 cudacxx_family
      🟨 nvcc               Pass:  80%/20  | Total:  2h 35m | Avg:  7m 45s | Max: 27m 00s
    🟨 gpu
      🟨 v100               Pass:  80%/20  | Total:  2h 35m | Avg:  7m 45s | Max: 27m 00s
    🟨 ctk
      🟥 12.0               Pass:   0%/1   | Total: 12m 36s | Avg: 12m 36s | Max: 12m 36s
      🟥 12.5               Pass:   0%/2   | Total: 13m 05s | Avg:  6m 32s | Max:  6m 48s
      🟨 12.6               Pass:  94%/17  | Total:  2h 09m | Avg:  7m 36s | Max: 27m 00s
    🟨 cudacxx
      🟥 nvcc12.0           Pass:   0%/1   | Total: 12m 36s | Avg: 12m 36s | Max: 12m 36s
      🟥 nvcc12.5           Pass:   0%/2   | Total: 13m 05s | Avg:  6m 32s | Max:  6m 48s
      🟨 nvcc12.6           Pass:  94%/17  | Total:  2h 09m | Avg:  7m 36s | Max: 27m 00s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  3m 38s | Avg:  3m 38s | Max:  3m 38s
      🟩 90a                Pass: 100%/1   | Total:  3m 42s | Avg:  3m 42s | Max:  3m 42s
    🟨 std
      🟨 17                 Pass:  75%/4   | Total: 19m 02s | Avg:  4m 45s | Max:  6m 48s
      🟨 20                 Pass:  81%/16  | Total:  2h 16m | Avg:  8m 30s | Max: 27m 00s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

🏃‍ Runner counts (total jobs: 20)

# Runner
12 linux-amd64-cpu16
4 linux-arm64-cpu16
2 windows-amd64-cpu16
2 linux-amd64-gpu-v100-latest-1

@miscco miscco requested a review from a team as a code owner January 28, 2025 13:13
@miscco miscco requested a review from ericniebler January 28, 2025 13:13
Copy link
Contributor

🟨 CI finished in 1h 16m: Pass: 94%/157 | Total: 1d 00h | Avg: 9m 25s | Max: 45m 21s | Hits: 422%/10928
  • 🟨 libcudacxx: Pass: 90%/43 | Total: 6h 37m | Avg: 9m 14s | Max: 29m 47s

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  90%/41  | Total:  6h 30m | Avg:  9m 31s | Max: 29m 47s
      🟩 arm64              Pass: 100%/2   | Total:  6m 56s | Avg:  3m 28s | Max:  3m 40s
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 11m | Avg: 17m 46s | Max: 23m 33s
      🔍 nvcc               Pass:  89%/39  | Total:  5h 26m | Avg:  8m 21s | Max: 29m 47s
    🚨 cxx_family: MSVC 🚨
      🟩 Clang              Pass: 100%/18  | Total:  2h 46m | Avg:  9m 13s | Max: 24m 32s
      🟩 GCC                Pass: 100%/19  | Total:  2h 41m | Avg:  8m 31s | Max: 29m 47s
      🔥 MSVC               Pass:   0%/4   | Total: 52m 22s | Avg: 13m 05s | Max: 15m 14s
      🟩 NVHPC              Pass: 100%/2   | Total: 16m 43s | Avg:  8m 21s | Max:  8m 27s
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  89%/38  | Total:  4h 50m | Avg:  7m 39s | Max: 23m 33s
      🟩 NVRTC              Pass: 100%/2   | Total: 57m 45s | Avg: 28m 52s | Max: 29m 47s
      🟩 Test               Pass: 100%/2   | Total: 46m 36s | Avg: 23m 18s | Max: 24m 32s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 59s | Avg:  1m 59s | Max:  1m 59s
    🟨 ctk
      🟨 12.0               Pass:  80%/5   | Total: 25m 07s | Avg:  5m 01s | Max: 10m 21s
      🟩 12.5               Pass: 100%/2   | Total: 16m 43s | Avg:  8m 21s | Max:  8m 27s
      🟨 12.6               Pass:  91%/36  | Total:  5h 55m | Avg:  9m 52s | Max: 29m 47s
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 11m | Avg: 17m 46s | Max: 23m 33s
      🟨 nvcc12.0           Pass:  80%/5   | Total: 25m 07s | Avg:  5m 01s | Max: 10m 21s
      🟩 nvcc12.5           Pass: 100%/2   | Total: 16m 43s | Avg:  8m 21s | Max:  8m 27s
      🟨 nvcc12.6           Pass:  90%/32  | Total:  4h 44m | Avg:  8m 52s | Max: 29m 47s
    🟨 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 17m 02s | Avg:  4m 15s | Max:  4m 54s
      🟩 Clang15            Pass: 100%/2   | Total: 23m 07s | Avg: 11m 33s | Max: 18m 39s
      🟩 Clang16            Pass: 100%/2   | Total:  9m 12s | Avg:  4m 36s | Max:  4m 56s
      🟩 Clang17            Pass: 100%/2   | Total:  8m 57s | Avg:  4m 28s | Max:  4m 33s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 47m | Avg: 13m 28s | Max: 24m 32s
      🟩 GCC7               Pass: 100%/2   | Total:  7m 03s | Avg:  3m 31s | Max:  3m 36s
      🟩 GCC8               Pass: 100%/1   | Total: 14m 51s | Avg: 14m 51s | Max: 14m 51s
      🟩 GCC9               Pass: 100%/2   | Total: 20m 11s | Avg: 10m 05s | Max: 16m 34s
      🟩 GCC10              Pass: 100%/2   | Total:  7m 46s | Avg:  3m 53s | Max:  3m 57s
      🟩 GCC11              Pass: 100%/2   | Total:  7m 29s | Avg:  3m 44s | Max:  3m 59s
      🟩 GCC12              Pass: 100%/2   | Total:  7m 50s | Avg:  3m 55s | Max:  3m 56s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 36m | Avg: 12m 05s | Max: 29m 47s
      🟥 MSVC14.29          Pass:   0%/2   | Total: 23m 53s | Avg: 11m 56s | Max: 13m 32s
      🟥 MSVC14.39          Pass:   0%/2   | Total: 28m 29s | Avg: 14m 14s | Max: 15m 14s
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 16m 43s | Avg:  8m 21s | Max:  8m 27s
    🟨 gpu
      🟨 v100               Pass:  90%/43  | Total:  6h 37m | Avg:  9m 14s | Max: 29m 47s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 12m 36s | Avg: 12m 36s | Max: 12m 36s
      🟩 90a                Pass: 100%/2   | Total: 17m 35s | Avg:  8m 47s | Max: 13m 48s
    🟨 std
      🟨 17                 Pass:  85%/21  | Total:  3h 01m | Avg:  8m 38s | Max: 29m 47s
      🟨 20                 Pass:  95%/21  | Total:  3h 33m | Avg: 10m 10s | Max: 27m 58s
    
  • 🟨 cudax: Pass: 80%/20 | Total: 1h 32m | Avg: 4m 37s | Max: 12m 52s

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  75%/16  | Total:  1h 20m | Avg:  5m 01s | Max: 12m 52s
      🟩 arm64              Pass: 100%/4   | Total: 12m 14s | Avg:  3m 03s | Max:  3m 12s
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  77%/18  | Total:  1h 23m | Avg:  4m 38s | Max: 12m 52s
      🟩 Test               Pass: 100%/2   | Total:  8m 59s | Avg:  4m 29s | Max:  4m 33s
    🟨 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 43s | Avg:  3m 43s | Max:  3m 43s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 53s | Avg:  3m 53s | Max:  3m 53s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 47s | Avg:  3m 47s | Max:  3m 47s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 55s | Avg:  3m 55s | Max:  3m 55s
      🟩 Clang18            Pass: 100%/4   | Total: 14m 31s | Avg:  3m 37s | Max:  4m 26s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 45s | Avg:  3m 45s | Max:  3m 45s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 27s | Avg:  3m 27s | Max:  3m 27s
      🟩 GCC12              Pass: 100%/2   | Total:  8m 12s | Avg:  4m 06s | Max:  4m 33s
      🟩 GCC13              Pass: 100%/4   | Total: 11m 52s | Avg:  2m 58s | Max:  3m 07s
      🟥 MSVC14.36          Pass:   0%/1   | Total:  9m 40s | Avg:  9m 40s | Max:  9m 40s
      🟥 MSVC14.39          Pass:   0%/1   | Total: 12m 52s | Avg: 12m 52s | Max: 12m 52s
      🟥 NVHPC24.7          Pass:   0%/2   | Total: 12m 58s | Avg:  6m 29s | Max:  6m 32s
    🟨 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 29m 49s | Avg:  3m 43s | Max:  4m 26s
      🟩 GCC                Pass: 100%/8   | Total: 27m 16s | Avg:  3m 24s | Max:  4m 33s
      🟥 MSVC               Pass:   0%/2   | Total: 22m 32s | Avg: 11m 16s | Max: 12m 52s
      🟥 NVHPC              Pass:   0%/2   | Total: 12m 58s | Avg:  6m 29s | Max:  6m 32s
    🟨 cudacxx_family
      🟨 nvcc               Pass:  80%/20  | Total:  1h 32m | Avg:  4m 37s | Max: 12m 52s
    🟨 gpu
      🟨 v100               Pass:  80%/20  | Total:  1h 32m | Avg:  4m 37s | Max: 12m 52s
    🟨 ctk
      🟥 12.0               Pass:   0%/1   | Total:  9m 40s | Avg:  9m 40s | Max:  9m 40s
      🟥 12.5               Pass:   0%/2   | Total: 12m 58s | Avg:  6m 29s | Max:  6m 32s
      🟨 12.6               Pass:  94%/17  | Total:  1h 09m | Avg:  4m 06s | Max: 12m 52s
    🟨 cudacxx
      🟥 nvcc12.0           Pass:   0%/1   | Total:  9m 40s | Avg:  9m 40s | Max:  9m 40s
      🟥 nvcc12.5           Pass:   0%/2   | Total: 12m 58s | Avg:  6m 29s | Max:  6m 32s
      🟨 nvcc12.6           Pass:  94%/17  | Total:  1h 09m | Avg:  4m 06s | Max: 12m 52s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 46s | Avg:  2m 46s | Max:  2m 46s
      🟩 90a                Pass: 100%/1   | Total:  3m 07s | Avg:  3m 07s | Max:  3m 07s
    🟨 std
      🟨 17                 Pass:  75%/4   | Total: 15m 08s | Avg:  3m 47s | Max:  6m 26s
      🟨 20                 Pass:  81%/16  | Total:  1h 17m | Avg:  4m 50s | Max: 12m 52s
    
  • 🟨 thrust: Pass: 97%/43 | Total: 7h 01m | Avg: 9m 48s | Max: 32m 41s | Hits: 365%/7376

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/41  | Total:  6h 52m | Avg: 10m 03s | Max: 32m 41s | Hits: 365%/7376  
      🟩 arm64              Pass: 100%/2   | Total:  9m 37s | Avg:  4m 48s | Max:  5m 00s
    🔍 ctk: 12.6 🔍
      🟩 12.0               Pass: 100%/5   | Total: 43m 00s | Avg:  8m 36s | Max: 23m 23s | Hits: 365%/1844  
      🟩 12.5               Pass: 100%/2   | Total: 29m 34s | Avg: 14m 47s | Max: 14m 57s
      🔍 12.6               Pass:  97%/36  | Total:  5h 49m | Avg:  9m 42s | Max: 32m 41s | Hits: 365%/5532  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 11m 13s | Avg:  5m 36s | Max:  5m 47s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 43m 00s | Avg:  8m 36s | Max: 23m 23s | Hits: 365%/1844  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 29m 34s | Avg: 14m 47s | Max: 14m 57s
      🔍 nvcc12.6           Pass:  97%/34  | Total:  5h 38m | Avg:  9m 56s | Max: 32m 41s | Hits: 365%/5532  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 11m 13s | Avg:  5m 36s | Max:  5m 47s
      🔍 nvcc               Pass:  97%/41  | Total:  6h 50m | Avg: 10m 00s | Max: 32m 41s | Hits: 365%/7376  
    🔍 cxx: MSVC14.39 🔍
      🟩 Clang14            Pass: 100%/4   | Total: 21m 19s | Avg:  5m 19s | Max:  5m 54s
      🟩 Clang15            Pass: 100%/2   | Total: 11m 48s | Avg:  5m 54s | Max:  6m 02s
      🟩 Clang16            Pass: 100%/2   | Total: 11m 51s | Avg:  5m 55s | Max:  5m 59s
      🟩 Clang17            Pass: 100%/2   | Total: 11m 27s | Avg:  5m 43s | Max:  5m 49s
      🟩 Clang18            Pass: 100%/7   | Total: 50m 00s | Avg:  7m 08s | Max: 14m 57s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 49s | Avg:  5m 24s | Max:  5m 50s
      🟩 GCC8               Pass: 100%/1   | Total:  6m 10s | Avg:  6m 10s | Max:  6m 10s
      🟩 GCC9               Pass: 100%/2   | Total: 11m 05s | Avg:  5m 32s | Max:  6m 08s
      🟩 GCC10              Pass: 100%/2   | Total: 11m 24s | Avg:  5m 42s | Max:  5m 49s
      🟩 GCC11              Pass: 100%/2   | Total: 12m 25s | Avg:  6m 12s | Max:  6m 13s
      🟩 GCC12              Pass: 100%/2   | Total: 11m 41s | Avg:  5m 50s | Max:  5m 56s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 15m | Avg:  9m 28s | Max: 24m 59s
      🟩 MSVC14.29          Pass: 100%/2   | Total: 52m 17s | Avg: 26m 08s | Max: 28m 54s | Hits: 365%/3688  
      🔍 MSVC14.39          Pass:  66%/3   | Total:  1h 34m | Avg: 31m 23s | Max: 32m 41s | Hits: 365%/3688  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 29m 34s | Avg: 14m 47s | Max: 14m 57s
    🔍 cxx_family: MSVC 🔍
      🟩 Clang              Pass: 100%/17  | Total:  1h 46m | Avg:  6m 15s | Max: 14m 57s
      🟩 GCC                Pass: 100%/19  | Total:  2h 19m | Avg:  7m 20s | Max: 24m 59s
      🔍 MSVC               Pass:  80%/5   | Total:  2h 26m | Avg: 29m 17s | Max: 32m 41s | Hits: 365%/7376  
      🟩 NVHPC              Pass: 100%/2   | Total: 29m 34s | Avg: 14m 47s | Max: 14m 57s
    🔍 jobs: TestCPU 🔍
      🟩 Build              Pass: 100%/37  | Total:  5h 18m | Avg:  8m 36s | Max: 32m 41s | Hits: 365%/7376  
      🔍 TestCPU            Pass:  66%/3   | Total: 48m 14s | Avg: 16m 04s | Max: 32m 29s
      🟩 TestGPU            Pass: 100%/3   | Total: 54m 52s | Avg: 18m 17s | Max: 24m 59s
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/20  | Total:  3h 07m | Avg:  9m 21s | Max: 28m 59s | Hits: 365%/5532  
      🔍 20                 Pass:  95%/21  | Total:  3h 33m | Avg: 10m 10s | Max: 32m 41s | Hits: 365%/1844  
    🟨 gpu
      🟨 v100               Pass:  97%/43  | Total:  7h 01m | Avg:  9m 48s | Max: 32m 41s | Hits: 365%/7376  
    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 21m 03s | Avg: 10m 31s | Max: 14m 56s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 50s | Avg:  4m 50s | Max:  4m 50s
    
  • 🟩 cub: Pass: 100%/44 | Total: 8h 13m | Avg: 11m 12s | Max: 37m 19s | Hits: 540%/3552

    🟩 cpu
      🟩 amd64              Pass: 100%/42  | Total:  8h 03m | Avg: 11m 30s | Max: 37m 19s | Hits: 540%/3552  
      🟩 arm64              Pass: 100%/2   | Total: 10m 08s | Avg:  5m 04s | Max:  5m 15s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 46m 46s | Avg:  9m 21s | Max: 25m 01s | Hits: 540%/888   
      🟩 12.5               Pass: 100%/2   | Total: 17m 46s | Avg:  8m 53s | Max:  8m 55s
      🟩 12.6               Pass: 100%/37  | Total:  7h 08m | Avg: 11m 35s | Max: 37m 19s | Hits: 540%/2664  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  8m 53s | Avg:  4m 26s | Max:  4m 43s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 46m 46s | Avg:  9m 21s | Max: 25m 01s | Hits: 540%/888   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 17m 46s | Avg:  8m 53s | Max:  8m 55s
      🟩 nvcc12.6           Pass: 100%/35  | Total:  6h 59m | Avg: 11m 59s | Max: 37m 19s | Hits: 540%/2664  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  8m 53s | Avg:  4m 26s | Max:  4m 43s
      🟩 nvcc               Pass: 100%/42  | Total:  8h 04m | Avg: 11m 31s | Max: 37m 19s | Hits: 540%/3552  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 21m 23s | Avg:  5m 20s | Max:  5m 29s
      🟩 Clang15            Pass: 100%/2   | Total: 11m 04s | Avg:  5m 32s | Max:  5m 45s
      🟩 Clang16            Pass: 100%/2   | Total: 11m 19s | Avg:  5m 39s | Max:  5m 46s
      🟩 Clang17            Pass: 100%/2   | Total: 11m 01s | Avg:  5m 30s | Max:  5m 39s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 32m | Avg: 13m 16s | Max: 37m 19s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 53s | Avg:  5m 26s | Max:  5m 38s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 21s | Avg:  5m 21s | Max:  5m 21s
      🟩 GCC9               Pass: 100%/2   | Total: 11m 07s | Avg:  5m 33s | Max:  5m 46s
      🟩 GCC10              Pass: 100%/2   | Total: 11m 43s | Avg:  5m 51s | Max:  5m 52s
      🟩 GCC11              Pass: 100%/2   | Total: 11m 43s | Avg:  5m 51s | Max:  5m 55s
      🟩 GCC12              Pass: 100%/4   | Total: 39m 11s | Avg:  9m 47s | Max: 22m 52s
      🟩 GCC13              Pass: 100%/8   | Total:  2h 06m | Avg: 15m 50s | Max: 32m 05s
      🟩 MSVC14.29          Pass: 100%/2   | Total: 52m 31s | Avg: 26m 15s | Max: 27m 30s | Hits: 540%/1776  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 58m 25s | Avg: 29m 12s | Max: 30m 42s | Hits: 540%/1776  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 17m 46s | Avg:  8m 53s | Max:  8m 55s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  2h 27m | Avg:  8m 41s | Max: 37m 19s
      🟩 GCC                Pass: 100%/21  | Total:  3h 36m | Avg: 10m 19s | Max: 32m 05s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 50m | Avg: 27m 44s | Max: 30m 42s | Hits: 540%/3552  
      🟩 NVHPC              Pass: 100%/2   | Total: 17m 46s | Avg:  8m 53s | Max:  8m 55s
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 27m 23s | Avg: 13m 41s | Max: 22m 52s
      🟩 v100               Pass: 100%/42  | Total:  7h 45m | Avg: 11m 05s | Max: 37m 19s | Hits: 540%/3552  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  4h 56m | Avg:  8m 01s | Max: 30m 42s | Hits: 540%/3552  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 32m 05s | Avg: 32m 05s | Max: 32m 05s
      🟩 GraphCapture       Pass: 100%/1   | Total: 19m 05s | Avg: 19m 05s | Max: 19m 05s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 24m | Avg: 28m 19s | Max: 37m 19s
      🟩 TestGPU            Pass: 100%/2   | Total:  1h 00m | Avg: 30m 04s | Max: 30m 31s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 27m 23s | Avg: 13m 41s | Max: 22m 52s
      🟩 90a                Pass: 100%/1   | Total:  4m 17s | Avg:  4m 17s | Max:  4m 17s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  2h 58m | Avg:  8m 54s | Max: 27m 43s | Hits: 540%/2664  
      🟩 20                 Pass: 100%/24  | Total:  5h 15m | Avg: 13m 07s | Max: 37m 19s | Hits: 540%/888   
    
  • 🟩 cccl: Pass: 100%/4 | Total: 20m 04s | Avg: 5m 01s | Max: 5m 35s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 20m 04s | Avg:  5m 01s | Max:  5m 35s
    🟩 ctk
      🟩 12.0               Pass: 100%/2   | Total:  9m 56s | Avg:  4m 58s | Max:  5m 01s
      🟩 12.6               Pass: 100%/2   | Total: 10m 08s | Avg:  5m 04s | Max:  5m 35s
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/2   | Total:  9m 56s | Avg:  4m 58s | Max:  5m 01s
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 08s | Avg:  5m 04s | Max:  5m 35s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 20m 04s | Avg:  5m 01s | Max:  5m 35s
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  5m 01s | Avg:  5m 01s | Max:  5m 01s
      🟩 Clang18            Pass: 100%/1   | Total:  5m 35s | Avg:  5m 35s | Max:  5m 35s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 55s | Avg:  4m 55s | Max:  4m 55s
      🟩 GCC13              Pass: 100%/1   | Total:  4m 33s | Avg:  4m 33s | Max:  4m 33s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total: 10m 36s | Avg:  5m 18s | Max:  5m 35s
      🟩 GCC                Pass: 100%/2   | Total:  9m 28s | Avg:  4m 44s | Max:  4m 55s
    🟩 gpu
      🟩 v100               Pass: 100%/4   | Total: 20m 04s | Avg:  5m 01s | Max:  5m 35s
    🟩 jobs
      🟩 Infra              Pass: 100%/4   | Total: 20m 04s | Avg:  5m 01s | Max:  5m 35s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 19s | Avg: 4m 39s | Max: 7m 16s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 19s | Avg:  4m 39s | Max:  7m 16s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 19s | Avg:  4m 39s | Max:  7m 16s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 19s | Avg:  4m 39s | Max:  7m 16s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 19s | Avg:  4m 39s | Max:  7m 16s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 19s | Avg:  4m 39s | Max:  7m 16s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 19s | Avg:  4m 39s | Max:  7m 16s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 19s | Avg:  4m 39s | Max:  7m 16s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 03s | Avg:  2m 03s | Max:  2m 03s
      🟩 Test               Pass: 100%/1   | Total:  7m 16s | Avg:  7m 16s | Max:  7m 16s
    
  • 🟩 python: Pass: 100%/1 | Total: 45m 21s | Avg: 45m 21s | Max: 45m 21s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 45m 21s | Avg: 45m 21s | Max: 45m 21s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 45m 21s | Avg: 45m 21s | Max: 45m 21s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 45m 21s | Avg: 45m 21s | Max: 45m 21s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 45m 21s | Avg: 45m 21s | Max: 45m 21s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 45m 21s | Avg: 45m 21s | Max: 45m 21s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 45m 21s | Avg: 45m 21s | Max: 45m 21s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 45m 21s | Avg: 45m 21s | Max: 45m 21s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 45m 21s | Avg: 45m 21s | Max: 45m 21s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 157)

# Runner
110 linux-amd64-cpu16
21 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

Copy link
Contributor

github-actions bot commented Feb 3, 2025

🟩 CI finished in 26m 13s: Pass: 100%/20 | Total: 1h 41m | Avg: 5m 03s | Max: 12m 18s | Hits: 312%/552
  • 🟩 cudax: Pass: 100%/20 | Total: 1h 41m | Avg: 5m 03s | Max: 12m 18s | Hits: 312%/552

    🟩 cpu
      🟩 amd64              Pass: 100%/16  | Total:  1h 30m | Avg:  5m 37s | Max: 12m 18s | Hits: 312%/552   
      🟩 arm64              Pass: 100%/4   | Total: 11m 00s | Avg:  2m 45s | Max:  2m 47s
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 10m 33s | Avg: 10m 33s | Max: 10m 33s | Hits: 312%/276   
      🟩 12.5               Pass: 100%/2   | Total: 10m 55s | Avg:  5m 27s | Max:  5m 33s
      🟩 12.8               Pass: 100%/17  | Total:  1h 19m | Avg:  4m 41s | Max: 12m 18s | Hits: 312%/276   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 10m 33s | Avg: 10m 33s | Max: 10m 33s | Hits: 312%/276   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 10m 55s | Avg:  5m 27s | Max:  5m 33s
      🟩 nvcc12.8           Pass: 100%/17  | Total:  1h 19m | Avg:  4m 41s | Max: 12m 18s | Hits: 312%/276   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/20  | Total:  1h 41m | Avg:  5m 03s | Max: 12m 18s | Hits: 312%/552   
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 27s | Avg:  3m 27s | Max:  3m 27s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 22s | Avg:  3m 22s | Max:  3m 22s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 26s | Avg:  3m 26s | Max:  3m 26s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 23s | Avg:  3m 23s | Max:  3m 23s
      🟩 Clang18            Pass: 100%/4   | Total: 21m 08s | Avg:  5m 17s | Max: 12m 18s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 11s | Avg:  3m 11s | Max:  3m 11s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 07s | Avg:  3m 07s | Max:  3m 07s
      🟩 GCC12              Pass: 100%/2   | Total: 15m 46s | Avg:  7m 53s | Max: 12m 17s
      🟩 GCC13              Pass: 100%/4   | Total: 11m 27s | Avg:  2m 51s | Max:  3m 02s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 33s | Avg: 10m 33s | Max: 10m 33s | Hits: 312%/276   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 20s | Avg: 11m 20s | Max: 11m 20s | Hits: 312%/276   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 10m 55s | Avg:  5m 27s | Max:  5m 33s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 34m 46s | Avg:  4m 20s | Max: 12m 18s
      🟩 GCC                Pass: 100%/8   | Total: 33m 31s | Avg:  4m 11s | Max: 12m 17s
      🟩 MSVC               Pass: 100%/2   | Total: 21m 53s | Avg: 10m 56s | Max: 11m 20s | Hits: 312%/552   
      🟩 NVHPC              Pass: 100%/2   | Total: 10m 55s | Avg:  5m 27s | Max:  5m 33s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/20  | Total:  1h 41m | Avg:  5m 03s | Max: 12m 18s | Hits: 312%/552   
    🟩 jobs
      🟩 Build              Pass: 100%/18  | Total:  1h 16m | Avg:  4m 15s | Max: 11m 20s | Hits: 312%/552   
      🟩 Test               Pass: 100%/2   | Total: 24m 35s | Avg: 12m 17s | Max: 12m 18s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 54s | Avg:  2m 54s | Max:  2m 54s
      🟩 90a                Pass: 100%/1   | Total:  3m 02s | Avg:  3m 02s | Max:  3m 02s
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 13m 58s | Avg:  3m 29s | Max:  5m 33s
      🟩 20                 Pass: 100%/16  | Total:  1h 27m | Avg:  5m 26s | Max: 12m 18s | Hits: 312%/552   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

🏃‍ Runner counts (total jobs: 20)

# Runner
12 linux-amd64-cpu16
4 linux-arm64-cpu16
2 windows-amd64-cpu16
2 linux-amd64-gpu-rtx2080-latest-1

Copy link
Contributor

github-actions bot commented Feb 4, 2025

🟩 CI finished in 26m 06s: Pass: 100%/20 | Total: 1h 57m | Avg: 5m 52s | Max: 13m 15s | Hits: 314%/552
  • 🟩 cudax: Pass: 100%/20 | Total: 1h 57m | Avg: 5m 52s | Max: 13m 15s | Hits: 314%/552

    🟩 cpu
      🟩 amd64              Pass: 100%/16  | Total:  1h 43m | Avg:  6m 29s | Max: 13m 15s | Hits: 314%/552   
      🟩 arm64              Pass: 100%/4   | Total: 13m 53s | Avg:  3m 28s | Max:  3m 32s
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 12m 12s | Avg: 12m 12s | Max: 12m 12s | Hits: 314%/276   
      🟩 12.5               Pass: 100%/2   | Total: 14m 41s | Avg:  7m 20s | Max:  7m 33s
      🟩 12.8               Pass: 100%/17  | Total:  1h 30m | Avg:  5m 20s | Max: 13m 15s | Hits: 314%/276   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 12m 12s | Avg: 12m 12s | Max: 12m 12s | Hits: 314%/276   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 14m 41s | Avg:  7m 20s | Max:  7m 33s
      🟩 nvcc12.8           Pass: 100%/17  | Total:  1h 30m | Avg:  5m 20s | Max: 13m 15s | Hits: 314%/276   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/20  | Total:  1h 57m | Avg:  5m 52s | Max: 13m 15s | Hits: 314%/552   
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 48s | Avg:  3m 48s | Max:  3m 48s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 20s | Avg:  4m 20s | Max:  4m 20s
      🟩 Clang16            Pass: 100%/1   | Total:  4m 21s | Avg:  4m 21s | Max:  4m 21s
      🟩 Clang17            Pass: 100%/1   | Total:  4m 14s | Avg:  4m 14s | Max:  4m 14s
      🟩 Clang18            Pass: 100%/4   | Total: 23m 04s | Avg:  5m 46s | Max: 11m 52s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 54s | Avg:  3m 54s | Max:  3m 54s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 49s | Avg:  3m 49s | Max:  3m 49s
      🟩 GCC12              Pass: 100%/2   | Total: 17m 14s | Avg:  8m 37s | Max: 13m 15s
      🟩 GCC13              Pass: 100%/4   | Total: 13m 36s | Avg:  3m 24s | Max:  3m 32s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 12m 12s | Avg: 12m 12s | Max: 12m 12s | Hits: 314%/276   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 12m 24s | Avg: 12m 24s | Max: 12m 24s | Hits: 314%/276   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 14m 41s | Avg:  7m 20s | Max:  7m 33s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 39m 47s | Avg:  4m 58s | Max: 11m 52s
      🟩 GCC                Pass: 100%/8   | Total: 38m 33s | Avg:  4m 49s | Max: 13m 15s
      🟩 MSVC               Pass: 100%/2   | Total: 24m 36s | Avg: 12m 18s | Max: 12m 24s | Hits: 314%/552   
      🟩 NVHPC              Pass: 100%/2   | Total: 14m 41s | Avg:  7m 20s | Max:  7m 33s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/20  | Total:  1h 57m | Avg:  5m 52s | Max: 13m 15s | Hits: 314%/552   
    🟩 jobs
      🟩 Build              Pass: 100%/18  | Total:  1h 32m | Avg:  5m 08s | Max: 12m 24s | Hits: 314%/552   
      🟩 Test               Pass: 100%/2   | Total: 25m 07s | Avg: 12m 33s | Max: 13m 15s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  3m 18s | Avg:  3m 18s | Max:  3m 18s
      🟩 90a                Pass: 100%/1   | Total:  3m 21s | Avg:  3m 21s | Max:  3m 21s
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 17m 23s | Avg:  4m 20s | Max:  7m 08s
      🟩 20                 Pass: 100%/16  | Total:  1h 40m | Avg:  6m 15s | Max: 13m 15s | Hits: 314%/552   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

🏃‍ Runner counts (total jobs: 20)

# Runner
12 linux-amd64-cpu16
4 linux-arm64-cpu16
2 windows-amd64-cpu16
2 linux-amd64-gpu-rtx2080-latest-1

@miscco miscco force-pushed the async_buffer branch 2 times, most recently from b945b88 to b6bfe65 Compare February 10, 2025 10:25
Copy link
Contributor

🟩 CI finished in 24m 18s: Pass: 100%/20 | Total: 2h 01m | Avg: 6m 04s | Max: 13m 22s | Hits: 89%/10380
  • 🟩 cudax: Pass: 100%/20 | Total: 2h 01m | Avg: 6m 04s | Max: 13m 22s | Hits: 89%/10380

    🟩 cpu
      🟩 amd64              Pass: 100%/16  | Total:  1h 45m | Avg:  6m 37s | Max: 13m 22s | Hits:  88%/8108  
      🟩 arm64              Pass: 100%/4   | Total: 15m 31s | Avg:  3m 52s | Max:  4m 08s | Hits:  90%/2272  
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 10m 52s | Avg: 10m 52s | Max: 10m 52s | Hits:  55%/276   
      🟩 12.5               Pass: 100%/2   | Total: 13m 05s | Avg:  6m 32s | Max:  6m 34s | Hits:  80%/736   
      🟩 12.8               Pass: 100%/17  | Total:  1h 37m | Avg:  5m 43s | Max: 13m 22s | Hits:  90%/9368  
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 10m 52s | Avg: 10m 52s | Max: 10m 52s | Hits:  55%/276   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 13m 05s | Avg:  6m 32s | Max:  6m 34s | Hits:  80%/736   
      🟩 nvcc12.8           Pass: 100%/17  | Total:  1h 37m | Avg:  5m 43s | Max: 13m 22s | Hits:  90%/9368  
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/20  | Total:  2h 01m | Avg:  6m 04s | Max: 13m 22s | Hits:  89%/10380 
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  4m 23s | Avg:  4m 23s | Max:  4m 23s | Hits:  91%/570   
      🟩 Clang15            Pass: 100%/1   | Total:  4m 48s | Avg:  4m 48s | Max:  4m 48s | Hits:  91%/568   
      🟩 Clang16            Pass: 100%/1   | Total:  4m 59s | Avg:  4m 59s | Max:  4m 59s | Hits:  91%/568   
      🟩 Clang17            Pass: 100%/1   | Total:  4m 39s | Avg:  4m 39s | Max:  4m 39s | Hits:  91%/568   
      🟩 Clang18            Pass: 100%/4   | Total: 24m 20s | Avg:  6m 05s | Max: 12m 04s | Hits:  93%/2272  
      🟩 GCC10              Pass: 100%/1   | Total:  4m 42s | Avg:  4m 42s | Max:  4m 42s | Hits:  90%/570   
      🟩 GCC11              Pass: 100%/1   | Total:  4m 46s | Avg:  4m 46s | Max:  4m 46s | Hits:  90%/568   
      🟩 GCC12              Pass: 100%/2   | Total: 18m 23s | Avg:  9m 11s | Max: 13m 22s | Hits:  95%/1136  
      🟩 GCC13              Pass: 100%/4   | Total: 15m 52s | Avg:  3m 58s | Max:  4m 08s | Hits:  90%/2272  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 10m 52s | Avg: 10m 52s | Max: 10m 52s | Hits:  55%/276   
      🟩 MSVC14.42          Pass: 100%/1   | Total: 10m 34s | Avg: 10m 34s | Max: 10m 34s | Hits:  55%/276   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 13m 05s | Avg:  6m 32s | Max:  6m 34s | Hits:  80%/736   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 43m 09s | Avg:  5m 23s | Max: 12m 04s | Hits:  92%/4546  
      🟩 GCC                Pass: 100%/8   | Total: 43m 43s | Avg:  5m 27s | Max: 13m 22s | Hits:  91%/4546  
      🟩 MSVC               Pass: 100%/2   | Total: 21m 26s | Avg: 10m 43s | Max: 10m 52s | Hits:  55%/552   
      🟩 NVHPC              Pass: 100%/2   | Total: 13m 05s | Avg:  6m 32s | Max:  6m 34s | Hits:  80%/736   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/20  | Total:  2h 01m | Avg:  6m 04s | Max: 13m 22s | Hits:  89%/10380 
    🟩 jobs
      🟩 Build              Pass: 100%/18  | Total:  1h 35m | Avg:  5m 19s | Max: 10m 52s | Hits:  87%/9244  
      🟩 Test               Pass: 100%/2   | Total: 25m 26s | Avg: 12m 43s | Max: 13m 22s | Hits:  99%/1136  
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  3m 53s | Avg:  3m 53s | Max:  3m 53s | Hits:  90%/568   
      🟩 90a                Pass: 100%/1   | Total:  4m 00s | Avg:  4m 00s | Max:  4m 00s | Hits:  90%/568   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 18m 06s | Avg:  4m 31s | Max:  6m 31s | Hits:  89%/2072  
      🟩 20                 Pass: 100%/16  | Total:  1h 43m | Avg:  6m 27s | Max: 13m 22s | Hits:  89%/8308  
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

🏃‍ Runner counts (total jobs: 20)

# Runner
12 linux-amd64-cpu16
4 linux-arm64-cpu16
2 windows-amd64-cpu16
2 linux-amd64-gpu-rtx2080-latest-1

Copy link
Contributor

🟩 CI finished in 1h 26m: Pass: 100%/20 | Total: 3h 36m | Avg: 10m 50s | Max: 13m 32s | Hits: 68%/10380
  • 🟩 cudax: Pass: 100%/20 | Total: 3h 36m | Avg: 10m 50s | Max: 13m 32s | Hits: 68%/10380

    🟩 cpu
      🟩 amd64              Pass: 100%/16  | Total:  2h 57m | Avg: 11m 06s | Max: 13m 32s | Hits:  70%/8108  
      🟩 arm64              Pass: 100%/4   | Total: 38m 53s | Avg:  9m 43s | Max: 10m 31s | Hits:  64%/2272  
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 12m 09s | Avg: 12m 09s | Max: 12m 09s | Hits:  55%/276   
      🟩 12.5               Pass: 100%/2   | Total: 17m 12s | Avg:  8m 36s | Max:  8m 50s | Hits:  66%/736   
      🟩 12.8               Pass: 100%/17  | Total:  3h 07m | Avg: 11m 01s | Max: 13m 32s | Hits:  69%/9368  
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 12m 09s | Avg: 12m 09s | Max: 12m 09s | Hits:  55%/276   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 17m 12s | Avg:  8m 36s | Max:  8m 50s | Hits:  66%/736   
      🟩 nvcc12.8           Pass: 100%/17  | Total:  3h 07m | Avg: 11m 01s | Max: 13m 32s | Hits:  69%/9368  
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/20  | Total:  3h 36m | Avg: 10m 50s | Max: 13m 32s | Hits:  68%/10380 
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total: 10m 39s | Avg: 10m 39s | Max: 10m 39s | Hits:  66%/570   
      🟩 Clang15            Pass: 100%/1   | Total: 12m 48s | Avg: 12m 48s | Max: 12m 48s | Hits:  59%/568   
      🟩 Clang16            Pass: 100%/1   | Total: 12m 52s | Avg: 12m 52s | Max: 12m 52s | Hits:  66%/568   
      🟩 Clang17            Pass: 100%/1   | Total: 11m 45s | Avg: 11m 45s | Max: 11m 45s | Hits:  70%/568   
      🟩 Clang18            Pass: 100%/4   | Total: 43m 14s | Avg: 10m 48s | Max: 12m 37s | Hits:  72%/2272  
      🟩 GCC10              Pass: 100%/1   | Total: 11m 24s | Avg: 11m 24s | Max: 11m 24s | Hits:  68%/570   
      🟩 GCC11              Pass: 100%/1   | Total: 12m 15s | Avg: 12m 15s | Max: 12m 15s | Hits:  58%/568   
      🟩 GCC12              Pass: 100%/2   | Total: 26m 05s | Avg: 13m 02s | Max: 13m 32s | Hits:  81%/1136  
      🟩 GCC13              Pass: 100%/4   | Total: 34m 22s | Avg:  8m 35s | Max: 10m 22s | Hits:  68%/2272  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 12m 09s | Avg: 12m 09s | Max: 12m 09s | Hits:  55%/276   
      🟩 MSVC14.42          Pass: 100%/1   | Total: 11m 56s | Avg: 11m 56s | Max: 11m 56s | Hits:  55%/276   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 17m 12s | Avg:  8m 36s | Max:  8m 50s | Hits:  66%/736   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total:  1h 31m | Avg: 11m 24s | Max: 12m 52s | Hits:  69%/4546  
      🟩 GCC                Pass: 100%/8   | Total:  1h 24m | Avg: 10m 30s | Max: 13m 32s | Hits:  70%/4546  
      🟩 MSVC               Pass: 100%/2   | Total: 24m 05s | Avg: 12m 02s | Max: 12m 09s | Hits:  55%/552   
      🟩 NVHPC              Pass: 100%/2   | Total: 17m 12s | Avg:  8m 36s | Max:  8m 50s | Hits:  66%/736   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/20  | Total:  3h 36m | Avg: 10m 50s | Max: 13m 32s | Hits:  68%/10380 
    🟩 jobs
      🟩 Build              Pass: 100%/18  | Total:  3h 11m | Avg: 10m 38s | Max: 13m 32s | Hits:  65%/9244  
      🟩 Test               Pass: 100%/2   | Total: 25m 10s | Avg: 12m 35s | Max: 12m 37s | Hits:  99%/1136  
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  6m 54s | Avg:  6m 54s | Max:  6m 54s | Hits:  71%/568   
      🟩 90a                Pass: 100%/1   | Total:  7m 54s | Avg:  7m 54s | Max:  7m 54s | Hits:  72%/568   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 33m 16s | Avg:  8m 19s | Max:  9m 12s | Hits:  67%/2072  
      🟩 20                 Pass: 100%/16  | Total:  3h 03m | Avg: 11m 27s | Max: 13m 32s | Hits:  69%/8308  
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

🏃‍ Runner counts (total jobs: 20)

# Runner
12 linux-amd64-cpu16
4 linux-arm64-cpu16
2 windows-amd64-cpu16
2 linux-amd64-gpu-rtx2080-latest-1

@jrhemstad jrhemstad requested review from pciolkosz and removed request for alliepiper February 12, 2025 17:35
Copy link
Contributor

🟩 CI finished in 24m 01s: Pass: 100%/22 | Total: 2h 29m | Avg: 6m 48s | Max: 17m 26s | Hits: 88%/11574
  • 🟩 cudax: Pass: 100%/22 | Total: 2h 29m | Avg: 6m 48s | Max: 17m 26s | Hits: 88%/11574

    🟩 cpu
      🟩 amd64              Pass: 100%/18  | Total:  2h 13m | Avg:  7m 25s | Max: 17m 26s | Hits:  88%/9290  
      🟩 arm64              Pass: 100%/4   | Total: 16m 08s | Avg:  4m 02s | Max:  4m 16s | Hits:  89%/2284  
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 17m 26s | Avg: 17m 26s | Max: 17m 26s | Hits:  55%/277   
      🟩 12.5               Pass: 100%/2   | Total: 14m 22s | Avg:  7m 11s | Max:  7m 20s | Hits:  79%/738   
      🟩 12.8               Pass: 100%/19  | Total:  1h 57m | Avg:  6m 12s | Max: 13m 49s | Hits:  90%/10559 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 17m 26s | Avg: 17m 26s | Max: 17m 26s | Hits:  55%/277   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 14m 22s | Avg:  7m 11s | Max:  7m 20s | Hits:  79%/738   
      🟩 nvcc12.8           Pass: 100%/19  | Total:  1h 57m | Avg:  6m 12s | Max: 13m 49s | Hits:  90%/10559 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/22  | Total:  2h 29m | Avg:  6m 48s | Max: 17m 26s | Hits:  88%/11574 
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  4m 27s | Avg:  4m 27s | Max:  4m 27s | Hits:  89%/573   
      🟩 Clang15            Pass: 100%/1   | Total:  4m 42s | Avg:  4m 42s | Max:  4m 42s | Hits:  89%/571   
      🟩 Clang16            Pass: 100%/1   | Total:  4m 36s | Avg:  4m 36s | Max:  4m 36s | Hits:  89%/571   
      🟩 Clang17            Pass: 100%/1   | Total:  4m 42s | Avg:  4m 42s | Max:  4m 42s | Hits:  89%/571   
      🟩 Clang18            Pass: 100%/4   | Total: 25m 42s | Avg:  6m 25s | Max: 13m 15s | Hits:  92%/2284  
      🟩 GCC10              Pass: 100%/1   | Total:  5m 04s | Avg:  5m 04s | Max:  5m 04s | Hits:  89%/573   
      🟩 GCC11              Pass: 100%/1   | Total:  4m 53s | Avg:  4m 53s | Max:  4m 53s | Hits:  89%/571   
      🟩 GCC12              Pass: 100%/2   | Total: 17m 08s | Avg:  8m 34s | Max: 12m 11s | Hits:  94%/1142  
      🟩 GCC13              Pass: 100%/6   | Total: 34m 00s | Avg:  5m 40s | Max: 13m 49s | Hits:  90%/3426  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 17m 26s | Avg: 17m 26s | Max: 17m 26s | Hits:  55%/277   
      🟩 MSVC14.42          Pass: 100%/1   | Total: 12m 39s | Avg: 12m 39s | Max: 12m 39s | Hits:  55%/277   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 14m 22s | Avg:  7m 11s | Max:  7m 20s | Hits:  79%/738   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 44m 09s | Avg:  5m 31s | Max: 13m 15s | Hits:  90%/4570  
      🟩 GCC                Pass: 100%/10  | Total:  1h 01m | Avg:  6m 06s | Max: 13m 49s | Hits:  91%/5712  
      🟩 MSVC               Pass: 100%/2   | Total: 30m 05s | Avg: 15m 02s | Max: 17m 26s | Hits:  55%/554   
      🟩 NVHPC              Pass: 100%/2   | Total: 14m 22s | Avg:  7m 11s | Max:  7m 20s | Hits:  79%/738   
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 17m 56s | Avg:  8m 58s | Max: 13m 49s | Hits:  94%/1142  
      🟩 rtx2080            Pass: 100%/20  | Total:  2h 11m | Avg:  6m 35s | Max: 17m 26s | Hits:  88%/10432 
    🟩 jobs
      🟩 Build              Pass: 100%/19  | Total:  1h 50m | Avg:  5m 48s | Max: 17m 26s | Hits:  86%/9861  
      🟩 Test               Pass: 100%/3   | Total: 39m 15s | Avg: 13m 05s | Max: 13m 49s | Hits:  99%/1713  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 21m 47s | Avg:  7m 15s | Max: 13m 49s | Hits:  92%/1713  
      🟩 90a                Pass: 100%/1   | Total:  3m 57s | Avg:  3m 57s | Max:  3m 57s | Hits:  89%/571   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 18m 47s | Avg:  4m 41s | Max:  7m 02s | Hits:  87%/2082  
      🟩 20                 Pass: 100%/18  | Total:  2h 10m | Avg:  7m 16s | Max: 17m 26s | Hits:  88%/9492  
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

🏃‍ Runner counts (total jobs: 22)

# Runner
13 linux-amd64-cpu16
4 linux-arm64-cpu16
2 windows-amd64-cpu16
2 linux-amd64-gpu-rtx2080-latest-1
1 linux-amd64-gpu-h100-latest-1

@miscco
Copy link
Contributor Author

miscco commented Feb 28, 2025

__ensure_current_device(stream)

addressed the remaining comments

Copy link
Contributor

🟩 CI finished in 22m 58s: Pass: 100%/22 | Total: 2h 15m | Avg: 6m 10s | Max: 16m 12s | Hits: 95%/11722
  • 🟩 cudax: Pass: 100%/22 | Total: 2h 15m | Avg: 6m 10s | Max: 16m 12s | Hits: 95%/11722

    🟩 cpu
      🟩 amd64              Pass: 100%/18  | Total:  2h 01m | Avg:  6m 43s | Max: 16m 12s | Hits:  94%/9406  
      🟩 arm64              Pass: 100%/4   | Total: 14m 40s | Avg:  3m 40s | Max:  3m 49s | Hits:  96%/2316  
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 11m 13s | Avg: 11m 13s | Max: 11m 13s | Hits:  57%/277   
      🟩 12.5               Pass: 100%/2   | Total: 12m 20s | Avg:  6m 10s | Max:  6m 12s | Hits:  91%/742   
      🟩 12.8               Pass: 100%/19  | Total:  1h 52m | Avg:  5m 54s | Max: 16m 12s | Hits:  96%/10703 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 11m 13s | Avg: 11m 13s | Max: 11m 13s | Hits:  57%/277   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 12m 20s | Avg:  6m 10s | Max:  6m 12s | Hits:  91%/742   
      🟩 nvcc12.8           Pass: 100%/19  | Total:  1h 52m | Avg:  5m 54s | Max: 16m 12s | Hits:  96%/10703 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/22  | Total:  2h 15m | Avg:  6m 10s | Max: 16m 12s | Hits:  95%/11722 
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  4m 00s | Avg:  4m 00s | Max:  4m 00s | Hits:  97%/581   
      🟩 Clang15            Pass: 100%/1   | Total:  4m 10s | Avg:  4m 10s | Max:  4m 10s | Hits:  97%/579   
      🟩 Clang16            Pass: 100%/1   | Total:  4m 16s | Avg:  4m 16s | Max:  4m 16s | Hits:  97%/579   
      🟩 Clang17            Pass: 100%/1   | Total:  4m 30s | Avg:  4m 30s | Max:  4m 30s | Hits:  97%/579   
      🟩 Clang18            Pass: 100%/4   | Total: 27m 34s | Avg:  6m 53s | Max: 16m 12s | Hits:  97%/2316  
      🟩 GCC10              Pass: 100%/1   | Total:  3m 54s | Avg:  3m 54s | Max:  3m 54s | Hits:  96%/581   
      🟩 GCC11              Pass: 100%/1   | Total:  3m 59s | Avg:  3m 59s | Max:  3m 59s | Hits:  96%/579   
      🟩 GCC12              Pass: 100%/2   | Total: 17m 25s | Avg:  8m 42s | Max: 13m 11s | Hits:  98%/1158  
      🟩 GCC13              Pass: 100%/6   | Total: 31m 55s | Avg:  5m 19s | Max: 13m 56s | Hits:  97%/3474  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 13s | Avg: 11m 13s | Max: 11m 13s | Hits:  57%/277   
      🟩 MSVC14.42          Pass: 100%/1   | Total: 10m 32s | Avg: 10m 32s | Max: 10m 32s | Hits:  57%/277   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 12m 20s | Avg:  6m 10s | Max:  6m 12s | Hits:  91%/742   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 44m 30s | Avg:  5m 33s | Max: 16m 12s | Hits:  97%/4634  
      🟩 GCC                Pass: 100%/10  | Total: 57m 13s | Avg:  5m 43s | Max: 13m 56s | Hits:  97%/5792  
      🟩 MSVC               Pass: 100%/2   | Total: 21m 45s | Avg: 10m 52s | Max: 11m 13s | Hits:  57%/554   
      🟩 NVHPC              Pass: 100%/2   | Total: 12m 20s | Avg:  6m 10s | Max:  6m 12s | Hits:  91%/742   
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max: 13m 56s | Hits:  98%/1158  
      🟩 rtx2080            Pass: 100%/20  | Total:  1h 58m | Avg:  5m 55s | Max: 16m 12s | Hits:  94%/10564 
    🟩 jobs
      🟩 Build              Pass: 100%/19  | Total:  1h 32m | Avg:  4m 52s | Max: 11m 13s | Hits:  94%/9985  
      🟩 Test               Pass: 100%/3   | Total: 43m 19s | Avg: 14m 26s | Max: 16m 12s | Hits:  99%/1737  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 20m 46s | Avg:  6m 55s | Max: 13m 56s | Hits:  97%/1737  
      🟩 90a                Pass: 100%/1   | Total:  3m 32s | Avg:  3m 32s | Max:  3m 32s | Hits:  96%/579   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 17m 06s | Avg:  4m 16s | Max:  6m 12s | Hits:  95%/2108  
      🟩 20                 Pass: 100%/18  | Total:  1h 58m | Avg:  6m 35s | Max: 16m 12s | Hits:  94%/9614  
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

🏃‍ Runner counts (total jobs: 22)

# Runner
13 linux-amd64-cpu16
4 linux-arm64-cpu16
2 windows-amd64-cpu16
2 linux-amd64-gpu-rtx2080-latest-1
1 linux-amd64-gpu-h100-latest-1

Copy link
Contributor

🟩 CI finished in 27m 50s: Pass: 100%/22 | Total: 2h 11m | Avg: 5m 59s | Max: 13m 46s | Hits: 95%/11722
  • 🟩 cudax: Pass: 100%/22 | Total: 2h 11m | Avg: 5m 59s | Max: 13m 46s | Hits: 95%/11722

    🟩 cpu
      🟩 amd64              Pass: 100%/18  | Total:  1h 57m | Avg:  6m 32s | Max: 13m 46s | Hits:  95%/9406  
      🟩 arm64              Pass: 100%/4   | Total: 14m 10s | Avg:  3m 32s | Max:  3m 37s | Hits:  97%/2316  
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 12m 00s | Avg: 12m 00s | Max: 12m 00s | Hits:  59%/277   
      🟩 12.5               Pass: 100%/2   | Total: 12m 34s | Avg:  6m 17s | Max:  6m 39s | Hits:  92%/742   
      🟩 12.8               Pass: 100%/19  | Total:  1h 47m | Avg:  5m 39s | Max: 13m 46s | Hits:  96%/10703 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 12m 00s | Avg: 12m 00s | Max: 12m 00s | Hits:  59%/277   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 12m 34s | Avg:  6m 17s | Max:  6m 39s | Hits:  92%/742   
      🟩 nvcc12.8           Pass: 100%/19  | Total:  1h 47m | Avg:  5m 39s | Max: 13m 46s | Hits:  96%/10703 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/22  | Total:  2h 11m | Avg:  5m 59s | Max: 13m 46s | Hits:  95%/11722 
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 47s | Avg:  3m 47s | Max:  3m 47s | Hits:  97%/581   
      🟩 Clang15            Pass: 100%/1   | Total:  4m 00s | Avg:  4m 00s | Max:  4m 00s | Hits:  97%/579   
      🟩 Clang16            Pass: 100%/1   | Total:  3m 49s | Avg:  3m 49s | Max:  3m 49s | Hits:  97%/579   
      🟩 Clang17            Pass: 100%/1   | Total:  4m 11s | Avg:  4m 11s | Max:  4m 11s | Hits:  97%/579   
      🟩 Clang18            Pass: 100%/4   | Total: 23m 24s | Avg:  5m 51s | Max: 12m 24s | Hits:  98%/2316  
      🟩 GCC10              Pass: 100%/1   | Total:  3m 51s | Avg:  3m 51s | Max:  3m 51s | Hits:  97%/581   
      🟩 GCC11              Pass: 100%/1   | Total:  3m 52s | Avg:  3m 52s | Max:  3m 52s | Hits:  97%/579   
      🟩 GCC12              Pass: 100%/2   | Total: 17m 20s | Avg:  8m 40s | Max: 13m 27s | Hits:  98%/1158  
      🟩 GCC13              Pass: 100%/6   | Total: 31m 31s | Avg:  5m 15s | Max: 13m 46s | Hits:  97%/3474  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 12m 00s | Avg: 12m 00s | Max: 12m 00s | Hits:  59%/277   
      🟩 MSVC14.42          Pass: 100%/1   | Total: 11m 37s | Avg: 11m 37s | Max: 11m 37s | Hits:  58%/277   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 12m 34s | Avg:  6m 17s | Max:  6m 39s | Hits:  92%/742   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 39m 11s | Avg:  4m 53s | Max: 12m 24s | Hits:  97%/4634  
      🟩 GCC                Pass: 100%/10  | Total: 56m 34s | Avg:  5m 39s | Max: 13m 46s | Hits:  97%/5792  
      🟩 MSVC               Pass: 100%/2   | Total: 23m 37s | Avg: 11m 48s | Max: 12m 00s | Hits:  59%/554   
      🟩 NVHPC              Pass: 100%/2   | Total: 12m 34s | Avg:  6m 17s | Max:  6m 39s | Hits:  92%/742   
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 17m 05s | Avg:  8m 32s | Max: 13m 46s | Hits:  98%/1158  
      🟩 rtx2080            Pass: 100%/20  | Total:  1h 54m | Avg:  5m 44s | Max: 13m 27s | Hits:  95%/10564 
    🟩 jobs
      🟩 Build              Pass: 100%/19  | Total:  1h 32m | Avg:  4m 51s | Max: 12m 00s | Hits:  94%/9985  
      🟩 Test               Pass: 100%/3   | Total: 39m 37s | Avg: 13m 12s | Max: 13m 46s | Hits:  99%/1737  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 20m 38s | Avg:  6m 52s | Max: 13m 46s | Hits:  98%/1737  
      🟩 90a                Pass: 100%/1   | Total:  3m 41s | Avg:  3m 41s | Max:  3m 41s | Hits:  97%/579   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 16m 31s | Avg:  4m 07s | Max:  5m 55s | Hits:  96%/2108  
      🟩 20                 Pass: 100%/18  | Total:  1h 55m | Avg:  6m 24s | Max: 13m 46s | Hits:  95%/9614  
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

🏃‍ Runner counts (total jobs: 22)

# Runner
13 linux-amd64-cpu16
4 linux-arm64-cpu16
2 windows-amd64-cpu16
2 linux-amd64-gpu-rtx2080-latest-1
1 linux-amd64-gpu-h100-latest-1

@pciolkosz pciolkosz merged commit b048cb7 into NVIDIA:main Feb 28, 2025
34 of 37 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

5 participants