Separate XGBoost JLL package into CPU and GPU versions #214

BenCurran98 · 2025-09-10T06:37:56Z

This is to fix the issue mentioned here where XGBoost wasn't working when users had CUDA v13 present. Now the behaviour of the package is to attempt to load GPU-compatible artifacts for XGBoost via XGBoost_GPU_jll, but if it can't find a compatible artifact it falls back to the standard CPU build in XGBoost_jll

BenCurran98 · 2025-09-10T06:41:01Z

Also the underlying XGBoost library has been updated to v2.1.4

ExpandingMan · 2025-09-11T21:47:16Z

Thanks! A few minor concerns, see above.

BenCurran98 · 2025-09-12T01:30:10Z

@ExpandingMan I can't see any comments above? :)

src/Lib.jl

ExpandingMan · 2025-09-11T21:44:22Z

src/Lib.jl

+# only enable GPU support if there is a valid binary compatible with this system
+# should we place a warning here?
+if XGBoost_GPU_jll.is_available()
+    lib_xgboost = XGBoost_GPU_jll.libxgboost


Why change the symbol to lib_xgboost? Is the generator now generating that instead?

I wasn't sure if there would be any confusion if I used the same symbol as the libraries exported by the JLL's. I can change it back if you like though

I don't think it's even exported here, but if it were one could just use import to avoid masking.

Please change this back, it's more verbose and should be unnecessary.

ExpandingMan · 2025-09-11T21:46:54Z

test/runtests.jl


-has_cuda() && @testset "cuda" begin
+# only test GPU cababilities if an appropriate device + artifact is available
+has_cuda() && XGBoost_GPU_jll.is_available() && @testset "cuda" begin


Is there any scenario in which we would expect has_cuda() to be true but the xgboost cuda binaries not to be available? I would think that we'd want this to fail anyway if XGBoost_GPU_jll.is_available() fails here.

In the case where the user has an incompatible CUDA version with XGBoost_GPU_jll we won't want it to fail I think? Although that's more to stop it failing if e.g. a user has CUDA 13 which isn't supported yet by XGBoost.jl, rather than specifically for the test suite. I can condition this bit just on the artifact being available though

I haven't looked into the PR yet, just trying to see if I can help with the issue.

XGBoost has a build_info function that exposes build-time information like the CUDA version used for compiling XGBoost. If it's a CPU-only build, it should be reflected in the build_info as well. You can access this field without touching CTK13-specific functions (hence no runtime error).

I feel pretty comfortable saying that if has_cuda() is true but the library isn't available the tests should fail anyway. Pkg is supposed to handle ensuring it's installed and if it's missing something would have gone wrong.

I would say let's make this has_cuda() only.

I haven't looked into the PR yet, just trying to see if I can help with the issue.

XGBoost has a build_info function that exposes build-time information like the CUDA version used for compiling XGBoost. If it's a CPU-only build, it should be reflected in the build_info as well. You can access this field without touching CTK13-specific functions (hence no runtime error).

I can't quite see where this function is defined in here?

ExpandingMan · 2025-09-12T13:33:37Z

Man, this github review feature feels so broken. Ok can you see it now?

BenCurran98 · 2025-09-14T22:42:12Z

Man, this github review feature feels so broken. Ok can you see it now?

Lol yep, thanks :)

ExpandingMan · 2025-09-16T20:26:48Z

Alright, I think this is fine, but I will allow a chance for more comments before merging.

ExpandingMan · 2025-09-19T15:50:59Z

Alright, let's do this. Thanks @BenCurran98 .

Separate XGBoost JLL package into CPU and GPU versions

90fddb6

ExpandingMan reviewed Sep 12, 2025

View reviewed changes

Address PR feedback

2648ab0

BenCurran98 requested a review from ExpandingMan September 15, 2025 06:18

Trigger cuda tests if cuda is available

c72b515

ExpandingMan merged commit 00faf62 into dmlc:master Sep 19, 2025
5 of 6 checks passed

Separate XGBoost JLL package into CPU and GPU versions #214

Separate XGBoost JLL package into CPU and GPU versions #214

Uh oh!

Conversation

BenCurran98 commented Sep 10, 2025

Uh oh!

BenCurran98 commented Sep 10, 2025

Uh oh!

ExpandingMan commented Sep 11, 2025

Uh oh!

BenCurran98 commented Sep 12, 2025

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ExpandingMan commented Sep 12, 2025

Uh oh!

BenCurran98 commented Sep 14, 2025

Uh oh!

ExpandingMan commented Sep 16, 2025

Uh oh!

ExpandingMan commented Sep 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants