Skip to content

Conversation

@bdice
Copy link
Contributor

@bdice bdice commented Nov 21, 2025

Summary

Backporting changes from PRs #5252, #5440, and #6451. These are needed to compile xgboost with CCCL 3.1.x.

Test plan

  • Build RMM artifacts with these changes
  • Validate XGBoost compilation with CCCL 3.1.x and new RMM artifacts

@bernhardmgruber
Copy link
Contributor

Backporting changes from PRs #5252, #5440, and #6451. These are needed to compile xgboost with CCCL 3.1.x.

That's fine. What confuses me is that I should see 3 commits that are cherry-picked from the main branch. Can you please check the commits you are proposing? Something is odd.

@bdice
Copy link
Contributor Author

bdice commented Nov 21, 2025

I got these patches from @trxcllnt. @trxcllnt Can you answer @bernhardmgruber's question?

I believe the answer is that the commits on main don't apply cleanly, so Paul pieced this together from a partial rebase, but I'm not sure.

@github-actions
Copy link
Contributor

🟩 CI finished in 2h 02m: Pass: 100%/157 | Total: 3d 19h | Avg: 35m 04s | Max: 1h 49m | Hits: 77%/173046
  • 🟩 cub: Pass: 100%/50 | Total: 2d 02h | Avg: 1h 00m | Max: 1h 49m | Hits: 68%/61654

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  2d 00h | Avg:  1h 00m | Max:  1h 49m | Hits:  68%/59140 
      🟩 arm64              Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 09m | Hits:  62%/2514  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 47m | Avg:  1h 09m | Max:  1h 45m | Hits:  62%/6181  
      🟩 12.9               Pass: 100%/45  | Total:  1d 20h | Avg: 59m 36s | Max:  1h 49m | Hits:  69%/55473 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total:  1h 03m | Avg: 31m 49s | Max: 32m 47s | Hits:  68%/2161  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 47m | Avg:  1h 09m | Max:  1h 45m | Hits:  62%/6181  
      🟩 nvcc12.9           Pass: 100%/43  | Total:  1d 19h | Avg:  1h 00m | Max:  1h 49m | Hits:  69%/53312 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 03m | Avg: 31m 49s | Max: 32m 47s | Hits:  68%/2161  
      🟩 nvcc               Pass: 100%/48  | Total:  2d 01h | Avg:  1h 01m | Max:  1h 49m | Hits:  68%/59493 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  3h 57m | Avg: 59m 18s | Max:  1h 02m | Hits:  63%/5030  
      🟩 Clang15            Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 04m | Hits:  63%/2511  
      🟩 Clang16            Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 05m | Hits:  62%/2511  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 59m | Avg: 59m 30s | Max:  1h 01m | Hits:  63%/2511  
      🟩 Clang18            Pass: 100%/2   | Total:  1h 57m | Avg: 58m 49s | Max:  1h 00m | Hits:  63%/2511  
      🟩 Clang19            Pass: 100%/7   | Total:  5h 15m | Avg: 45m 00s | Max:  1h 01m | Hits:  75%/8440  
      🟩 GCC7               Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 15m | Hits:  62%/2514  
      🟩 GCC8               Pass: 100%/1   | Total:  1h 04m | Avg:  1h 04m | Max:  1h 04m | Hits:  62%/1257  
      🟩 GCC9               Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 05m | Hits:  62%/2514  
      🟩 GCC10              Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 08m | Hits:  62%/2515  
      🟩 GCC11              Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 13m | Hits:  62%/2511  
      🟩 GCC12              Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 10m | Hits:  62%/2511  
      🟩 GCC13              Pass: 100%/12  | Total:  8h 46m | Avg: 43m 54s | Max:  1h 09m | Hits:  81%/15093 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  3h 23m | Avg:  1h 41m | Max:  1h 45m | Hits:  62%/2304  
      🟩 MSVC14.43          Pass: 100%/4   | Total:  6h 07m | Avg:  1h 31m | Max:  1h 49m | Hits:  62%/4608  
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  2h 26m | Avg:  1h 13m | Max:  1h 15m | Hits:  56%/2313  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total: 17h 23m | Avg: 54m 55s | Max:  1h 05m | Hits:  67%/23514 
      🟩 GCC                Pass: 100%/23  | Total: 21h 08m | Avg: 55m 08s | Max:  1h 15m | Hits:  72%/28915 
      🟩 MSVC               Pass: 100%/6   | Total:  9h 31m | Avg:  1h 35m | Max:  1h 49m | Hits:  62%/6912  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 26m | Avg:  1h 13m | Max:  1h 15m | Hits:  56%/2313  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 22m | Avg: 27m 26s | Max: 31m 03s | Hits:  87%/3774  
      🟩 rtx2080            Pass: 100%/39  | Total:  1d 19h | Avg:  1h 06m | Max:  1h 49m | Hits:  62%/47822 
      🟩 rtxa6000           Pass: 100%/8   | Total:  5h 43m | Avg: 42m 54s | Max:  1h 05m | Hits:  90%/10058 
    🟩 jobs
      🟩 Build              Pass: 100%/42  | Total:  1d 21h | Avg:  1h 05m | Max:  1h 49m | Hits:  62%/51594 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 37m 51s | Avg: 37m 51s | Max: 37m 51s | Hits:  99%/1258  
      🟩 GraphCapture       Pass: 100%/1   | Total: 32m 33s | Avg: 32m 33s | Max: 32m 33s | Hits:  99%/1258  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 38m | Avg: 32m 41s | Max: 40m 17s | Hits:  99%/3772  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 44m | Avg: 34m 53s | Max: 42m 13s | Hits:  99%/3772  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 22m | Avg: 27m 26s | Max: 31m 03s | Hits:  87%/3774  
      🟩 90;90a             Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 21m | Hits:  62%/2410  
      🟩 100;120            Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 15m | Hits:  62%/2410  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  1d 00h | Avg:  1h 09m | Max:  1h 45m | Hits:  62%/25788 
      🟩 20                 Pass: 100%/29  | Total:  1d 02h | Avg: 54m 21s | Max:  1h 49m | Hits:  73%/35866 
    
  • 🟩 thrust: Pass: 100%/50 | Total: 1d 09h | Avg: 39m 46s | Max: 1h 18m | Hits: 80%/95771

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 51m 08s | Avg: 25m 34s | Max: 36m 22s | Hits:  90%/3834  
    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  1d 07h | Avg: 39m 49s | Max:  1h 18m | Hits:  80%/91938 
      🟩 arm64              Pass: 100%/2   | Total:  1h 17m | Avg: 38m 43s | Max: 41m 32s | Hits:  77%/3833  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 39m | Avg: 43m 58s | Max:  1h 14m | Hits:  77%/9575  
      🟩 12.9               Pass: 100%/45  | Total:  1d 05h | Avg: 39m 18s | Max:  1h 18m | Hits:  80%/86196 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 57m 33s | Avg: 28m 46s | Max: 29m 31s | Hits:  80%/3832  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 39m | Avg: 43m 58s | Max:  1h 14m | Hits:  77%/9575  
      🟩 nvcc12.9           Pass: 100%/43  | Total:  1d 04h | Avg: 39m 48s | Max:  1h 18m | Hits:  80%/82364 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 57m 33s | Avg: 28m 46s | Max: 29m 31s | Hits:  80%/3832  
      🟩 nvcc               Pass: 100%/48  | Total:  1d 08h | Avg: 40m 14s | Max:  1h 18m | Hits:  80%/91939 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 21m | Avg: 35m 23s | Max: 37m 51s | Hits:  79%/7664  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 14m | Avg: 37m 28s | Max: 37m 37s | Hits:  74%/3832  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 11m | Avg: 35m 59s | Max: 36m 27s | Hits:  74%/3832  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 15m | Avg: 37m 57s | Max: 40m 08s | Hits:  74%/3832  
      🟩 Clang18            Pass: 100%/2   | Total:  1h 19m | Avg: 39m 46s | Max: 41m 01s | Hits:  74%/3832  
      🟩 Clang19            Pass: 100%/7   | Total:  3h 08m | Avg: 26m 54s | Max: 39m 37s | Hits:  85%/13412 
      🟩 GCC7               Pass: 100%/2   | Total:  1h 19m | Avg: 39m 37s | Max: 42m 22s | Hits:  80%/3834  
      🟩 GCC8               Pass: 100%/1   | Total: 42m 19s | Avg: 42m 19s | Max: 42m 19s | Hits:  74%/1917  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 21m | Avg: 40m 34s | Max: 40m 57s | Hits:  74%/3834  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 27m | Avg: 43m 39s | Max: 45m 20s | Hits:  74%/3834  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 21m | Avg: 40m 56s | Max: 41m 18s | Hits:  80%/3834  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 28m | Avg: 44m 25s | Max: 46m 06s | Hits:  80%/3834  
      🟩 GCC13              Pass: 100%/11  | Total:  4h 58m | Avg: 27m 08s | Max: 46m 14s | Hits:  87%/21087 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 14m | Hits:  75%/3818  
      🟩 MSVC14.43          Pass: 100%/5   | Total:  5h 11m | Avg:  1h 02m | Max:  1h 18m | Hits:  79%/9545  
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 09m | Hits:  71%/3830  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total: 10h 32m | Avg: 33m 16s | Max: 41m 01s | Hits:  79%/36404 
      🟩 GCC                Pass: 100%/22  | Total: 12h 39m | Avg: 34m 30s | Max: 46m 14s | Hits:  82%/42174 
      🟩 MSVC               Pass: 100%/7   | Total:  7h 39m | Avg:  1h 05m | Max:  1h 18m | Hits:  78%/13363 
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 09m | Hits:  71%/3830  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 33m 43s | Avg: 16m 51s | Max: 22m 34s | Hits:  90%/3834  
      🟩 rtx2080            Pass: 100%/38  | Total:  1d 03h | Avg: 43m 57s | Max:  1h 16m | Hits:  77%/72786 
      🟩 rtx4090            Pass: 100%/10  | Total:  4h 44m | Avg: 28m 27s | Max:  1h 18m | Hits:  91%/19151 
    🟩 jobs
      🟩 Build              Pass: 100%/43  | Total:  1d 07h | Avg: 43m 49s | Max:  1h 18m | Hits:  77%/82362 
      🟩 TestCPU            Pass: 100%/3   | Total: 54m 50s | Avg: 18m 16s | Max: 37m 43s | Hits:  99%/5742  
      🟩 TestGPU            Pass: 100%/4   | Total: 49m 38s | Avg: 12m 24s | Max: 14m 46s | Hits:  99%/7667  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 33m 43s | Avg: 16m 51s | Max: 22m 34s | Hits:  90%/3834  
      🟩 90;90a             Pass: 100%/2   | Total:  1h 29m | Avg: 44m 49s | Max: 58m 38s | Hits:  77%/3826  
      🟩 100;120            Pass: 100%/2   | Total:  1h 33m | Avg: 46m 52s | Max:  1h 01m | Hits:  77%/3826  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total: 16h 10m | Avg: 46m 13s | Max:  1h 16m | Hits:  76%/40223 
      🟩 20                 Pass: 100%/27  | Total: 16h 06m | Avg: 35m 48s | Max:  1h 18m | Hits:  82%/51714 
    
  • 🟩 cudax: Pass: 100%/28 | Total: 3h 53m | Avg: 8m 19s | Max: 18m 22s | Hits: 89%/15126

    🟩 cpu
      🟩 amd64              Pass: 100%/24  | Total:  3h 29m | Avg:  8m 43s | Max: 18m 22s | Hits:  89%/12794 
      🟩 arm64              Pass: 100%/4   | Total: 23m 38s | Avg:  5m 54s | Max:  6m 44s | Hits:  90%/2332  
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 27m 16s | Avg:  9m 05s | Max: 17m 30s | Hits:  87%/1448  
      🟩 12.9               Pass: 100%/25  | Total:  3h 25m | Avg:  8m 14s | Max: 18m 22s | Hits:  89%/13678 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 27m 16s | Avg:  9m 05s | Max: 17m 30s | Hits:  87%/1448  
      🟩 nvcc12.9           Pass: 100%/25  | Total:  3h 25m | Avg:  8m 14s | Max: 18m 22s | Hits:  89%/13678 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/28  | Total:  3h 53m | Avg:  8m 19s | Max: 18m 22s | Hits:  89%/15126 
    🟩 cxx
      🟩 Clang14            Pass: 100%/2   | Total: 10m 34s | Avg:  5m 17s | Max:  6m 00s | Hits:  90%/1168  
      🟩 Clang15            Pass: 100%/1   | Total:  5m 58s | Avg:  5m 58s | Max:  5m 58s | Hits:  90%/583   
      🟩 Clang16            Pass: 100%/1   | Total:  6m 03s | Avg:  6m 03s | Max:  6m 03s | Hits:  90%/583   
      🟩 Clang17            Pass: 100%/1   | Total:  5m 23s | Avg:  5m 23s | Max:  5m 23s | Hits:  90%/583   
      🟩 Clang18            Pass: 100%/1   | Total:  5m 21s | Avg:  5m 21s | Max:  5m 21s | Hits:  90%/583   
      🟩 Clang19            Pass: 100%/4   | Total: 25m 01s | Avg:  6m 15s | Max:  8m 06s | Hits:  92%/2332  
      🟩 GCC10              Pass: 100%/2   | Total: 11m 10s | Avg:  5m 35s | Max:  5m 58s | Hits:  90%/1168  
      🟩 GCC11              Pass: 100%/1   | Total:  6m 55s | Avg:  6m 55s | Max:  6m 55s | Hits:  89%/583   
      🟩 GCC12              Pass: 100%/1   | Total:  6m 34s | Avg:  6m 34s | Max:  6m 34s | Hits:  89%/583   
      🟩 GCC13              Pass: 100%/8   | Total: 58m 23s | Avg:  7m 17s | Max: 15m 52s | Hits:  92%/4664  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 17m 30s | Avg: 17m 30s | Max: 17m 30s | Hits:  77%/282   
      🟩 MSVC14.43          Pass: 100%/3   | Total: 54m 18s | Avg: 18m 06s | Max: 18m 22s | Hits:  77%/852   
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 19m 59s | Avg:  9m 59s | Max: 10m 24s | Hits:  76%/1162  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/10  | Total: 58m 20s | Avg:  5m 50s | Max:  8m 06s | Hits:  91%/5832  
      🟩 GCC                Pass: 100%/12  | Total:  1h 23m | Avg:  6m 55s | Max: 15m 52s | Hits:  91%/6998  
      🟩 MSVC               Pass: 100%/4   | Total:  1h 11m | Avg: 17m 57s | Max: 18m 22s | Hits:  77%/1134  
      🟩 NVHPC              Pass: 100%/2   | Total: 19m 59s | Avg:  9m 59s | Max: 10m 24s | Hits:  76%/1162  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 11m 19s | Avg:  5m 39s | Max:  6m 34s | Hits:  94%/1166  
      🟩 rtx2080            Pass: 100%/26  | Total:  3h 41m | Avg:  8m 31s | Max: 18m 22s | Hits:  88%/13960 
    🟩 jobs
      🟩 Build              Pass: 100%/25  | Total:  3h 22m | Avg:  8m 06s | Max: 18m 22s | Hits:  87%/13377 
      🟩 Test               Pass: 100%/3   | Total: 30m 32s | Avg: 10m 10s | Max: 15m 52s | Hits:  99%/1749  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 11m 19s | Avg:  5m 39s | Max:  6m 34s | Hits:  94%/1166  
      🟩 90;90a             Pass: 100%/2   | Total: 23m 53s | Avg: 11m 56s | Max: 17m 52s | Hits:  85%/867   
      🟩 100;120            Pass: 100%/2   | Total: 24m 00s | Avg: 12m 00s | Max: 18m 04s | Hits:  85%/867   
    🟩 std
      🟩 17                 Pass: 100%/3   | Total: 21m 51s | Avg:  7m 17s | Max: 10m 24s | Hits:  85%/1747  
      🟩 20                 Pass: 100%/25  | Total:  3h 31m | Avg:  8m 27s | Max: 18m 22s | Hits:  89%/13379 
    
  • 🟩 python: Pass: 100%/18 | Total: 3h 09m | Avg: 10m 32s | Max: 20m 35s

    🟩 cpu
      🟩 amd64              Pass: 100%/18  | Total:  3h 09m | Avg: 10m 32s | Max: 20m 35s
    🟩 ctk
      🟩 12.9               Pass: 100%/18  | Total:  3h 09m | Avg: 10m 32s | Max: 20m 35s
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/18  | Total:  3h 09m | Avg: 10m 32s | Max: 20m 35s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/18  | Total:  3h 09m | Avg: 10m 32s | Max: 20m 35s
    🟩 cxx
      🟩 GCC13              Pass: 100%/18  | Total:  3h 09m | Avg: 10m 32s | Max: 20m 35s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/18  | Total:  3h 09m | Avg: 10m 32s | Max: 20m 35s
    🟩 gpu
      🟩 h100               Pass: 100%/8   | Total:  1h 15m | Avg:  9m 29s | Max: 15m 30s
      🟩 rtxa6000           Pass: 100%/10  | Total:  1h 53m | Avg: 11m 22s | Max: 20m 35s
    🟩 jobs
      🟩 Build cuda.cccl    Pass: 100%/2   | Total: 17m 40s | Avg:  8m 50s | Max:  8m 50s
      🟩 Test cuda.cccl.cooperative Pass: 100%/4   | Total:  1h 02m | Avg: 15m 40s | Max: 20m 35s
      🟩 Test cuda.cccl.examples Pass: 100%/4   | Total: 21m 40s | Avg:  5m 25s | Max:  5m 52s
      🟩 Test cuda.cccl.headers Pass: 100%/4   | Total: 20m 42s | Avg:  5m 10s | Max:  5m 34s
      🟩 Test cuda.cccl.parallel Pass: 100%/4   | Total:  1h 06m | Avg: 16m 44s | Max: 18m 28s
    🟩 py_version
      🟩 3.10               Pass: 100%/9   | Total:  1h 35m | Avg: 10m 37s | Max: 20m 35s
      🟩 3.13               Pass: 100%/9   | Total:  1h 34m | Avg: 10m 27s | Max: 18m 58s
    
  • 🟩 packaging: Pass: 100%/4 | Total: 20m 42s | Avg: 5m 10s | Max: 7m 27s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 20m 42s | Avg:  5m 10s | Max:  7m 27s
    🟩 ctk
      🟩 12.0               Pass: 100%/2   | Total: 11m 53s | Avg:  5m 56s | Max:  7m 27s
      🟩 12.9               Pass: 100%/2   | Total:  8m 49s | Avg:  4m 24s | Max:  4m 56s
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/2   | Total: 11m 53s | Avg:  5m 56s | Max:  7m 27s
      🟩 nvcc12.9           Pass: 100%/2   | Total:  8m 49s | Avg:  4m 24s | Max:  4m 56s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 20m 42s | Avg:  5m 10s | Max:  7m 27s
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  4m 26s | Avg:  4m 26s | Max:  4m 26s
      🟩 Clang19            Pass: 100%/1   | Total:  4m 56s | Avg:  4m 56s | Max:  4m 56s
      🟩 GCC12              Pass: 100%/1   | Total:  7m 27s | Avg:  7m 27s | Max:  7m 27s
      🟩 GCC13              Pass: 100%/1   | Total:  3m 53s | Avg:  3m 53s | Max:  3m 53s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total:  9m 22s | Avg:  4m 41s | Max:  4m 56s
      🟩 GCC                Pass: 100%/2   | Total: 11m 20s | Avg:  5m 40s | Max:  7m 27s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 20m 42s | Avg:  5m 10s | Max:  7m 27s
    🟩 jobs
      🟩 Test               Pass: 100%/4   | Total: 20m 42s | Avg:  5m 10s | Max:  7m 27s
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 15m 25s | Avg: 3m 51s | Max: 4m 01s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  7m 53s | Avg:  3m 56s | Max:  4m 01s
      🟩 arm64              Pass: 100%/2   | Total:  7m 32s | Avg:  3m 46s | Max:  3m 48s
    🟩 ctk
      🟩 12.9               Pass: 100%/4   | Total: 15m 25s | Avg:  3m 51s | Max:  4m 01s
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/4   | Total: 15m 25s | Avg:  3m 51s | Max:  4m 01s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 15m 25s | Avg:  3m 51s | Max:  4m 01s
    🟩 cxx
      🟩 NVHPC25.5          Pass: 100%/4   | Total: 15m 25s | Avg:  3m 51s | Max:  4m 01s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 15m 25s | Avg:  3m 51s | Max:  4m 01s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 15m 25s | Avg:  3m 51s | Max:  4m 01s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 15m 25s | Avg:  3m 51s | Max:  4m 01s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  7m 49s | Avg:  3m 54s | Max:  4m 01s
      🟩 20                 Pass: 100%/2   | Total:  7m 36s | Avg:  3m 48s | Max:  3m 52s
    
  • 🟩 cccl_c_parallel: Pass: 100%/3 | Total: 28m 53s | Avg: 9m 37s | Max: 15m 00s | Hits: 97%/495

    🟩 cpu
      🟩 amd64              Pass: 100%/3   | Total: 28m 53s | Avg:  9m 37s | Max: 15m 00s | Hits:  97%/495   
    🟩 ctk
      🟩 12.9               Pass: 100%/3   | Total: 28m 53s | Avg:  9m 37s | Max: 15m 00s | Hits:  97%/495   
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/3   | Total: 28m 53s | Avg:  9m 37s | Max: 15m 00s | Hits:  97%/495   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/3   | Total: 28m 53s | Avg:  9m 37s | Max: 15m 00s | Hits:  97%/495   
    🟩 cxx
      🟩 GCC13              Pass: 100%/3   | Total: 28m 53s | Avg:  9m 37s | Max: 15m 00s | Hits:  97%/495   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/3   | Total: 28m 53s | Avg:  9m 37s | Max: 15m 00s | Hits:  97%/495   
    🟩 gpu
      🟩 h100               Pass: 100%/1   | Total: 10m 49s | Avg: 10m 49s | Max: 10m 49s | Hits:  98%/165   
      🟩 rtx2080            Pass: 100%/2   | Total: 18m 04s | Avg:  9m 02s | Max: 15m 00s | Hits:  97%/330   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  3m 04s | Avg:  3m 04s | Max:  3m 04s | Hits:  96%/165   
      🟩 Test               Pass: 100%/2   | Total: 25m 49s | Avg: 12m 54s | Max: 15m 00s | Hits:  98%/330   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
CCCL Packaging
libcu++
+/- CUB
Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- CCCL Packaging
libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 157)

# Runner
93 linux-amd64-cpu16
17 windows-amd64-cpu16
14 linux-amd64-gpu-rtxa6000-latest-1
13 linux-amd64-gpu-h100-latest-1
10 linux-arm64-cpu16
7 linux-amd64-gpu-rtx2080-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

elstehle and others added 3 commits November 21, 2025 15:18
…TrivialRuns` (NVIDIA#5252)

* streaming non trivial runs

* change global offset computation

* fixes style

* integrate latest bench and test changes

* addresses review comments

* replaces getters with member var
…A#5440)

* adds vsmem to reduce_by_key

* adds tests for vsmem

* fixes rle, which does not support vsmem yet

* addresses review comments
…ypes in `DeviceRunLengthEncode::Encode` (NVIDIA#6451)

* adds tests for non default constructible iterators

* fixes non default constructible iterators in rle

* Simplify generation of `streaming_context` for run_length_encode

* Reinstate regression test

---------

Co-authored-by: Michael Schellenberger Costa <miscco@nvidia.com>
@bdice
Copy link
Contributor Author

bdice commented Nov 21, 2025

Answering the above question: the only difference is that the patches I got earlier did not include a few changes in cub/benchmarks or cub/tests. I think those should be included here, though they weren't included in the original source of these patches (tests and benchmarks weren't being run there, so a minimal patch was acceptable). I am going to force-push this PR with the original commits as cherry-picks.

@bdice bdice force-pushed the xgboost-backport-fixes branch from bd34a2b to 1776c74 Compare November 21, 2025 21:21
@bdice
Copy link
Contributor Author

bdice commented Nov 21, 2025

This is ready for review.

I validated downstream that this fixes xgboost builds.

I am still verifying that all of RAPIDS 25.12 builds and that cuDF passes tests with these changes, but I expect that to be done shortly.

@github-actions
Copy link
Contributor

🟩 CI finished in 1h 30m: Pass: 100%/157 | Total: 1d 05h | Avg: 11m 22s | Max: 42m 16s | Hits: 99%/173046
  • 🟩 cub: Pass: 100%/50 | Total: 12h 26m | Avg: 14m 55s | Max: 42m 16s | Hits: 99%/61654

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total: 12h 11m | Avg: 15m 14s | Max: 42m 16s | Hits:  99%/59140 
      🟩 arm64              Pass: 100%/2   | Total: 14m 50s | Avg:  7m 25s | Max:  8m 40s | Hits:  99%/2514  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 03m | Avg: 12m 38s | Max: 34m 29s | Hits:  99%/6181  
      🟩 12.9               Pass: 100%/45  | Total: 11h 23m | Avg: 15m 10s | Max: 42m 16s | Hits:  99%/55473 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 10m 32s | Avg:  5m 16s | Max:  5m 17s | Hits:  99%/2161  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 03m | Avg: 12m 38s | Max: 34m 29s | Hits:  99%/6181  
      🟩 nvcc12.9           Pass: 100%/43  | Total: 11h 12m | Avg: 15m 38s | Max: 42m 16s | Hits:  99%/53312 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 32s | Avg:  5m 16s | Max:  5m 17s | Hits:  99%/2161  
      🟩 nvcc               Pass: 100%/48  | Total: 12h 15m | Avg: 15m 19s | Max: 42m 16s | Hits:  99%/59493 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 25m 29s | Avg:  6m 22s | Max:  6m 52s | Hits:  99%/5030  
      🟩 Clang15            Pass: 100%/2   | Total: 13m 22s | Avg:  6m 41s | Max:  6m 48s | Hits:  99%/2511  
      🟩 Clang16            Pass: 100%/2   | Total: 13m 13s | Avg:  6m 36s | Max:  6m 37s | Hits:  99%/2511  
      🟩 Clang17            Pass: 100%/2   | Total: 13m 26s | Avg:  6m 43s | Max:  6m 51s | Hits:  99%/2511  
      🟩 Clang18            Pass: 100%/2   | Total: 13m 05s | Avg:  6m 32s | Max:  6m 40s | Hits:  99%/2511  
      🟩 Clang19            Pass: 100%/7   | Total:  1h 34m | Avg: 13m 26s | Max: 35m 18s | Hits:  99%/8440  
      🟩 GCC7               Pass: 100%/2   | Total: 16m 39s | Avg:  8m 19s | Max:  8m 39s | Hits:  99%/2514  
      🟩 GCC8               Pass: 100%/1   | Total:  8m 36s | Avg:  8m 36s | Max:  8m 36s | Hits:  99%/1257  
      🟩 GCC9               Pass: 100%/2   | Total: 17m 36s | Avg:  8m 48s | Max:  8m 57s | Hits:  99%/2514  
      🟩 GCC10              Pass: 100%/2   | Total: 17m 48s | Avg:  8m 54s | Max:  9m 10s | Hits:  99%/2515  
      🟩 GCC11              Pass: 100%/2   | Total: 18m 13s | Avg:  9m 06s | Max:  9m 10s | Hits:  99%/2511  
      🟩 GCC12              Pass: 100%/2   | Total: 17m 58s | Avg:  8m 59s | Max:  9m 01s | Hits:  99%/2511  
      🟩 GCC13              Pass: 100%/12  | Total:  4h 08m | Avg: 20m 41s | Max: 42m 16s | Hits:  99%/15093 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 09m | Avg: 34m 38s | Max: 34m 47s | Hits:  99%/2304  
      🟩 MSVC14.43          Pass: 100%/4   | Total:  2h 15m | Avg: 33m 46s | Max: 36m 50s | Hits:  99%/4608  
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 24m 18s | Avg: 12m 09s | Max: 12m 18s | Hits:  97%/2313  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  2h 52m | Avg:  9m 05s | Max: 35m 18s | Hits:  99%/23514 
      🟩 GCC                Pass: 100%/23  | Total:  5h 45m | Avg: 15m 00s | Max: 42m 16s | Hits:  99%/28915 
      🟩 MSVC               Pass: 100%/6   | Total:  3h 24m | Avg: 34m 03s | Max: 36m 50s | Hits:  99%/6912  
      🟩 NVHPC              Pass: 100%/2   | Total: 24m 18s | Avg: 12m 09s | Max: 12m 18s | Hits:  97%/2313  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total: 58m 22s | Avg: 19m 27s | Max: 26m 14s | Hits:  99%/3774  
      🟩 rtx2080            Pass: 100%/39  | Total:  7h 41m | Avg: 11m 49s | Max: 36m 50s | Hits:  99%/47822 
      🟩 rtxa6000           Pass: 100%/8   | Total:  3h 46m | Avg: 28m 20s | Max: 42m 16s | Hits:  99%/10058 
    🟩 jobs
      🟩 Build              Pass: 100%/42  | Total:  8h 03m | Avg: 11m 30s | Max: 36m 50s | Hits:  99%/51594 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 35m 34s | Avg: 35m 34s | Max: 35m 34s | Hits:  99%/1258  
      🟩 GraphCapture       Pass: 100%/1   | Total: 31m 12s | Avg: 31m 12s | Max: 31m 12s | Hits:  99%/1258  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 43m | Avg: 34m 36s | Max: 42m 16s | Hits:  99%/3772  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 32m | Avg: 30m 43s | Max: 37m 45s | Hits:  99%/3772  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 58m 22s | Avg: 19m 27s | Max: 26m 14s | Hits:  99%/3774  
      🟩 90;90a             Pass: 100%/2   | Total: 40m 41s | Avg: 20m 20s | Max: 32m 58s | Hits:  99%/2410  
      🟩 100;120            Pass: 100%/2   | Total: 38m 43s | Avg: 19m 21s | Max: 30m 20s | Hits:  99%/2410  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  4h 08m | Avg: 11m 49s | Max: 36m 50s | Hits:  99%/25788 
      🟩 20                 Pass: 100%/29  | Total:  8h 18m | Avg: 17m 10s | Max: 42m 16s | Hits:  99%/35866 
    
  • 🟩 thrust: Pass: 100%/50 | Total: 9h 57m | Avg: 11m 57s | Max: 39m 01s | Hits: 99%/95771

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 22m 50s | Avg: 11m 25s | Max: 14m 44s | Hits:  99%/3834  
    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  9h 45m | Avg: 12m 11s | Max: 39m 01s | Hits:  99%/91938 
      🟩 arm64              Pass: 100%/2   | Total: 12m 28s | Avg:  6m 14s | Max:  7m 14s | Hits:  99%/3833  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 55m 26s | Avg: 11m 05s | Max: 30m 58s | Hits:  99%/9575  
      🟩 12.9               Pass: 100%/45  | Total:  9h 02m | Avg: 12m 03s | Max: 39m 01s | Hits:  99%/86196 
    🟩 cudacxx
      🟩 ClangCUDA19        Pass: 100%/2   | Total: 10m 45s | Avg:  5m 22s | Max:  5m 34s | Hits: 100%/3832  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 55m 26s | Avg: 11m 05s | Max: 30m 58s | Hits:  99%/9575  
      🟩 nvcc12.9           Pass: 100%/43  | Total:  8h 51m | Avg: 12m 21s | Max: 39m 01s | Hits:  99%/82364 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 45s | Avg:  5m 22s | Max:  5m 34s | Hits: 100%/3832  
      🟩 nvcc               Pass: 100%/48  | Total:  9h 46m | Avg: 12m 13s | Max: 39m 01s | Hits:  99%/91939 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 22m 31s | Avg:  5m 37s | Max:  5m 48s | Hits: 100%/7664  
      🟩 Clang15            Pass: 100%/2   | Total: 12m 19s | Avg:  6m 09s | Max:  6m 16s | Hits: 100%/3832  
      🟩 Clang16            Pass: 100%/2   | Total: 11m 10s | Avg:  5m 35s | Max:  5m 40s | Hits: 100%/3832  
      🟩 Clang17            Pass: 100%/2   | Total: 11m 39s | Avg:  5m 49s | Max:  6m 03s | Hits: 100%/3832  
      🟩 Clang18            Pass: 100%/2   | Total: 11m 11s | Avg:  5m 35s | Max:  5m 38s | Hits: 100%/3832  
      🟩 Clang19            Pass: 100%/7   | Total: 47m 08s | Avg:  6m 44s | Max: 11m 15s | Hits: 100%/13412 
      🟩 GCC7               Pass: 100%/2   | Total: 13m 36s | Avg:  6m 48s | Max:  6m 50s | Hits:  99%/3834  
      🟩 GCC8               Pass: 100%/1   | Total:  7m 05s | Avg:  7m 05s | Max:  7m 05s | Hits:  99%/1917  
      🟩 GCC9               Pass: 100%/2   | Total: 14m 37s | Avg:  7m 18s | Max:  7m 57s | Hits:  99%/3834  
      🟩 GCC10              Pass: 100%/2   | Total: 14m 50s | Avg:  7m 25s | Max:  7m 44s | Hits:  99%/3834  
      🟩 GCC11              Pass: 100%/2   | Total: 14m 22s | Avg:  7m 11s | Max:  7m 13s | Hits:  99%/3834  
      🟩 GCC12              Pass: 100%/2   | Total: 16m 19s | Avg:  8m 09s | Max:  8m 27s | Hits:  99%/3834  
      🟩 GCC13              Pass: 100%/11  | Total:  1h 44m | Avg:  9m 27s | Max: 14m 44s | Hits:  97%/21087 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 03m | Avg: 31m 31s | Max: 32m 05s | Hits:  99%/3818  
      🟩 MSVC14.43          Pass: 100%/5   | Total:  2h 45m | Avg: 33m 04s | Max: 39m 01s | Hits:  99%/9545  
      🟩 NVHPC25.5          Pass: 100%/2   | Total:  1h 08m | Avg: 34m 12s | Max: 37m 23s | Hits:  98%/3830  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/19  | Total:  1h 55m | Avg:  6m 06s | Max: 11m 15s | Hits: 100%/36404 
      🟩 GCC                Pass: 100%/22  | Total:  3h 04m | Avg:  8m 24s | Max: 14m 44s | Hits:  98%/42174 
      🟩 MSVC               Pass: 100%/7   | Total:  3h 48m | Avg: 32m 37s | Max: 39m 01s | Hits:  99%/13363 
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 08m | Avg: 34m 12s | Max: 37m 23s | Hits:  98%/3830  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 16m 34s | Avg:  8m 17s | Max: 11m 00s | Hits:  99%/3834  
      🟩 rtx2080            Pass: 100%/38  | Total:  7h 06m | Avg: 11m 14s | Max: 37m 23s | Hits:  99%/72786 
      🟩 rtx4090            Pass: 100%/10  | Total:  2h 34m | Avg: 15m 25s | Max: 39m 01s | Hits:  97%/19151 
    🟩 jobs
      🟩 Build              Pass: 100%/43  | Total:  8h 11m | Avg: 11m 26s | Max: 37m 23s | Hits:  99%/82362 
      🟩 TestCPU            Pass: 100%/3   | Total: 56m 23s | Avg: 18m 47s | Max: 39m 01s | Hits:  99%/5742  
      🟩 TestGPU            Pass: 100%/4   | Total: 49m 27s | Avg: 12m 21s | Max: 14m 44s | Hits:  99%/7667  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 16m 34s | Avg:  8m 17s | Max: 11m 00s | Hits:  99%/3834  
      🟩 90;90a             Pass: 100%/2   | Total: 36m 40s | Avg: 18m 20s | Max: 29m 25s | Hits:  99%/3826  
      🟩 100;120            Pass: 100%/2   | Total: 35m 38s | Avg: 17m 49s | Max: 28m 55s | Hits:  99%/3826  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  4h 08m | Avg: 11m 49s | Max: 37m 23s | Hits:  99%/40223 
      🟩 20                 Pass: 100%/27  | Total:  5h 26m | Avg: 12m 05s | Max: 39m 01s | Hits:  98%/51714 
    
  • 🟩 cudax: Pass: 100%/28 | Total: 2h 45m | Avg: 5m 55s | Max: 17m 11s | Hits: 99%/15126

    🟩 cpu
      🟩 amd64              Pass: 100%/24  | Total:  2h 32m | Avg:  6m 22s | Max: 17m 11s | Hits:  99%/12794 
      🟩 arm64              Pass: 100%/4   | Total: 12m 54s | Avg:  3m 13s | Max:  3m 30s | Hits:  99%/2332  
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 21m 34s | Avg:  7m 11s | Max: 15m 37s | Hits:  98%/1448  
      🟩 12.9               Pass: 100%/25  | Total:  2h 24m | Avg:  5m 46s | Max: 17m 11s | Hits:  99%/13678 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 21m 34s | Avg:  7m 11s | Max: 15m 37s | Hits:  98%/1448  
      🟩 nvcc12.9           Pass: 100%/25  | Total:  2h 24m | Avg:  5m 46s | Max: 17m 11s | Hits:  99%/13678 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/28  | Total:  2h 45m | Avg:  5m 55s | Max: 17m 11s | Hits:  99%/15126 
    🟩 cxx
      🟩 Clang14            Pass: 100%/2   | Total:  5m 40s | Avg:  2m 50s | Max:  2m 57s | Hits: 100%/1168  
      🟩 Clang15            Pass: 100%/1   | Total:  2m 58s | Avg:  2m 58s | Max:  2m 58s | Hits: 100%/583   
      🟩 Clang16            Pass: 100%/1   | Total:  2m 54s | Avg:  2m 54s | Max:  2m 54s | Hits: 100%/583   
      🟩 Clang17            Pass: 100%/1   | Total:  3m 05s | Avg:  3m 05s | Max:  3m 05s | Hits: 100%/583   
      🟩 Clang18            Pass: 100%/1   | Total:  3m 02s | Avg:  3m 02s | Max:  3m 02s | Hits: 100%/583   
      🟩 Clang19            Pass: 100%/4   | Total: 17m 11s | Avg:  4m 17s | Max:  8m 03s | Hits: 100%/2332  
      🟩 GCC10              Pass: 100%/2   | Total:  6m 44s | Avg:  3m 22s | Max:  3m 30s | Hits:  99%/1168  
      🟩 GCC11              Pass: 100%/1   | Total:  3m 33s | Avg:  3m 33s | Max:  3m 33s | Hits:  99%/583   
      🟩 GCC12              Pass: 100%/1   | Total:  3m 56s | Avg:  3m 56s | Max:  3m 56s | Hits:  99%/583   
      🟩 GCC13              Pass: 100%/8   | Total: 39m 26s | Avg:  4m 55s | Max: 12m 00s | Hits:  99%/4664  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 15m 37s | Avg: 15m 37s | Max: 15m 37s | Hits:  95%/282   
      🟩 MSVC14.43          Pass: 100%/3   | Total: 47m 36s | Avg: 15m 52s | Max: 17m 11s | Hits:  95%/852   
      🟩 NVHPC25.5          Pass: 100%/2   | Total: 14m 02s | Avg:  7m 01s | Max:  7m 01s | Hits:  96%/1162  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/10  | Total: 34m 50s | Avg:  3m 29s | Max:  8m 03s | Hits: 100%/5832  
      🟩 GCC                Pass: 100%/12  | Total: 53m 39s | Avg:  4m 28s | Max: 12m 00s | Hits:  99%/6998  
      🟩 MSVC               Pass: 100%/4   | Total:  1h 03m | Avg: 15m 48s | Max: 17m 11s | Hits:  95%/1134  
      🟩 NVHPC              Pass: 100%/2   | Total: 14m 02s | Avg:  7m 01s | Max:  7m 01s | Hits:  96%/1162  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total:  9m 26s | Avg:  4m 43s | Max:  6m 31s | Hits:  99%/1166  
      🟩 rtx2080            Pass: 100%/26  | Total:  2h 36m | Avg:  6m 00s | Max: 17m 11s | Hits:  99%/13960 
    🟩 jobs
      🟩 Build              Pass: 100%/25  | Total:  2h 19m | Avg:  5m 34s | Max: 17m 11s | Hits:  99%/13377 
      🟩 Test               Pass: 100%/3   | Total: 26m 34s | Avg:  8m 51s | Max: 12m 00s | Hits:  99%/1749  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total:  9m 26s | Avg:  4m 43s | Max:  6m 31s | Hits:  99%/1166  
      🟩 90;90a             Pass: 100%/2   | Total: 20m 39s | Avg: 10m 19s | Max: 17m 11s | Hits:  98%/867   
      🟩 100;120            Pass: 100%/2   | Total: 18m 07s | Avg:  9m 03s | Max: 14m 44s | Hits:  98%/867   
    🟩 std
      🟩 17                 Pass: 100%/3   | Total: 13m 29s | Avg:  4m 29s | Max:  7m 01s | Hits:  98%/1747  
      🟩 20                 Pass: 100%/25  | Total:  2h 32m | Avg:  6m 05s | Max: 17m 11s | Hits:  99%/13379 
    
  • 🟩 python: Pass: 100%/18 | Total: 3h 41m | Avg: 12m 17s | Max: 34m 04s

    🟩 cpu
      🟩 amd64              Pass: 100%/18  | Total:  3h 41m | Avg: 12m 17s | Max: 34m 04s
    🟩 ctk
      🟩 12.9               Pass: 100%/18  | Total:  3h 41m | Avg: 12m 17s | Max: 34m 04s
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/18  | Total:  3h 41m | Avg: 12m 17s | Max: 34m 04s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/18  | Total:  3h 41m | Avg: 12m 17s | Max: 34m 04s
    🟩 cxx
      🟩 GCC13              Pass: 100%/18  | Total:  3h 41m | Avg: 12m 17s | Max: 34m 04s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/18  | Total:  3h 41m | Avg: 12m 17s | Max: 34m 04s
    🟩 gpu
      🟩 h100               Pass: 100%/8   | Total:  1h 14m | Avg:  9m 19s | Max: 15m 40s
      🟩 rtxa6000           Pass: 100%/10  | Total:  2h 26m | Avg: 14m 38s | Max: 34m 04s
    🟩 jobs
      🟩 Build cuda.cccl    Pass: 100%/2   | Total: 16m 46s | Avg:  8m 23s | Max:  8m 25s
      🟩 Test cuda.cccl.cooperative Pass: 100%/4   | Total:  1h 06m | Avg: 16m 34s | Max: 22m 50s
      🟩 Test cuda.cccl.examples Pass: 100%/4   | Total: 22m 08s | Avg:  5m 32s | Max:  6m 09s
      🟩 Test cuda.cccl.headers Pass: 100%/4   | Total: 48m 26s | Avg: 12m 06s | Max: 34m 04s
      🟩 Test cuda.cccl.parallel Pass: 100%/4   | Total:  1h 07m | Avg: 16m 52s | Max: 18m 34s
    🟩 py_version
      🟩 3.10               Pass: 100%/9   | Total:  2h 04m | Avg: 13m 47s | Max: 34m 04s
      🟩 3.13               Pass: 100%/9   | Total:  1h 37m | Avg: 10m 47s | Max: 22m 50s
    
  • 🟩 packaging: Pass: 100%/4 | Total: 14m 55s | Avg: 3m 43s | Max: 3m 59s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 14m 55s | Avg:  3m 43s | Max:  3m 59s
    🟩 ctk
      🟩 12.0               Pass: 100%/2   | Total:  7m 17s | Avg:  3m 38s | Max:  3m 43s
      🟩 12.9               Pass: 100%/2   | Total:  7m 38s | Avg:  3m 49s | Max:  3m 59s
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/2   | Total:  7m 17s | Avg:  3m 38s | Max:  3m 43s
      🟩 nvcc12.9           Pass: 100%/2   | Total:  7m 38s | Avg:  3m 49s | Max:  3m 59s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 14m 55s | Avg:  3m 43s | Max:  3m 59s
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 34s | Avg:  3m 34s | Max:  3m 34s
      🟩 Clang19            Pass: 100%/1   | Total:  3m 59s | Avg:  3m 59s | Max:  3m 59s
      🟩 GCC12              Pass: 100%/1   | Total:  3m 43s | Avg:  3m 43s | Max:  3m 43s
      🟩 GCC13              Pass: 100%/1   | Total:  3m 39s | Avg:  3m 39s | Max:  3m 39s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total:  7m 33s | Avg:  3m 46s | Max:  3m 59s
      🟩 GCC                Pass: 100%/2   | Total:  7m 22s | Avg:  3m 41s | Max:  3m 43s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 14m 55s | Avg:  3m 43s | Max:  3m 59s
    🟩 jobs
      🟩 Test               Pass: 100%/4   | Total: 14m 55s | Avg:  3m 43s | Max:  3m 59s
    
  • 🟩 stdpar: Pass: 100%/4 | Total: 13m 59s | Avg: 3m 29s | Max: 3m 36s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  7m 02s | Avg:  3m 31s | Max:  3m 36s
      🟩 arm64              Pass: 100%/2   | Total:  6m 57s | Avg:  3m 28s | Max:  3m 29s
    🟩 ctk
      🟩 12.9               Pass: 100%/4   | Total: 13m 59s | Avg:  3m 29s | Max:  3m 36s
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/4   | Total: 13m 59s | Avg:  3m 29s | Max:  3m 36s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 13m 59s | Avg:  3m 29s | Max:  3m 36s
    🟩 cxx
      🟩 NVHPC25.5          Pass: 100%/4   | Total: 13m 59s | Avg:  3m 29s | Max:  3m 36s
    🟩 cxx_family
      🟩 NVHPC              Pass: 100%/4   | Total: 13m 59s | Avg:  3m 29s | Max:  3m 36s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 13m 59s | Avg:  3m 29s | Max:  3m 36s
    🟩 jobs
      🟩 Build              Pass: 100%/4   | Total: 13m 59s | Avg:  3m 29s | Max:  3m 36s
    🟩 std
      🟩 17                 Pass: 100%/2   | Total:  7m 04s | Avg:  3m 32s | Max:  3m 36s
      🟩 20                 Pass: 100%/2   | Total:  6m 55s | Avg:  3m 27s | Max:  3m 29s
    
  • 🟩 cccl_c_parallel: Pass: 100%/3 | Total: 27m 00s | Avg: 9m 00s | Max: 13m 49s | Hits: 98%/495

    🟩 cpu
      🟩 amd64              Pass: 100%/3   | Total: 27m 00s | Avg:  9m 00s | Max: 13m 49s | Hits:  98%/495   
    🟩 ctk
      🟩 12.9               Pass: 100%/3   | Total: 27m 00s | Avg:  9m 00s | Max: 13m 49s | Hits:  98%/495   
    🟩 cudacxx
      🟩 nvcc12.9           Pass: 100%/3   | Total: 27m 00s | Avg:  9m 00s | Max: 13m 49s | Hits:  98%/495   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/3   | Total: 27m 00s | Avg:  9m 00s | Max: 13m 49s | Hits:  98%/495   
    🟩 cxx
      🟩 GCC13              Pass: 100%/3   | Total: 27m 00s | Avg:  9m 00s | Max: 13m 49s | Hits:  98%/495   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/3   | Total: 27m 00s | Avg:  9m 00s | Max: 13m 49s | Hits:  98%/495   
    🟩 gpu
      🟩 h100               Pass: 100%/1   | Total: 11m 17s | Avg: 11m 17s | Max: 11m 17s | Hits:  98%/165   
      🟩 rtx2080            Pass: 100%/2   | Total: 15m 43s | Avg:  7m 51s | Max: 13m 49s | Hits:  98%/330   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  1m 54s | Avg:  1m 54s | Max:  1m 54s | Hits:  98%/165   
      🟩 Test               Pass: 100%/2   | Total: 25m 06s | Avg: 12m 33s | Max: 13m 49s | Hits:  98%/330   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
CCCL Packaging
libcu++
+/- CUB
Thrust
CUDA Experimental
stdpar
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- CCCL Packaging
libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- stdpar
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 157)

# Runner
93 linux-amd64-cpu16
17 windows-amd64-cpu16
14 linux-amd64-gpu-rtxa6000-latest-1
13 linux-amd64-gpu-h100-latest-1
10 linux-arm64-cpu16
7 linux-amd64-gpu-rtx2080-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

@bernhardmgruber
Copy link
Contributor

Thank you for changing the PR to use proper cherry-picks. May I ask why the last commit, f72b1c7, is necessary? This seems to not come from a commit on main.

@bdice
Copy link
Contributor Author

bdice commented Nov 23, 2025

@bernhardmgruber The cherry-picked commits are missing a number of other changes on the main branch that are needed to compile the tests and benchmarks that were added in the original PRs. I have added all the relevant library/header changes but CI fails without commit f72b1c7. I do not know how to adapt those tests/benchmarks to compile properly so I removed them. This PR’s total diff is exactly equivalent to what @trxcllnt has used in internal builds of XGBoost and other libraries so we believe it is correct and safe to merge.

@bernhardmgruber bernhardmgruber merged commit d69eb55 into NVIDIA:branch/3.1.x Nov 23, 2025
168 checks passed
@github-project-automation github-project-automation bot moved this from In Review to Done in CCCL Nov 23, 2025
rockhowse pushed a commit to rapidsai/rapids-cmake that referenced this pull request Nov 24, 2025
Admin merging per approval of `build-eng`

Updates CCCL to get backports needed for XGBoost compatibility with CCCL
3.1.x.

Depends on NVIDIA/cccl#6727
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Status: Done

Development

Successfully merging this pull request may close these issues.

3 participants