-
Notifications
You must be signed in to change notification settings - Fork 31
Pull requests: openxla/tokamax
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Switch to pyrefly for Python type checking for many Tokamax python libraries.
#912
opened May 18, 2026 by
copybara-service
Bot
Loading…
[NFC] Replace no longer needed manual tiling hack.
#910
opened May 13, 2026 by
copybara-service
Bot
Loading…
[pallas:triton] Rolling forward compilation to PTX
#909
opened May 13, 2026 by
copybara-service
Bot
Loading…
Fix wrong
rhs gradient for empty groups in Pallas-Triton ragged dot.
#889
opened Apr 28, 2026 by
renos
Loading…
Perform bucketed compute in SM90 MGPU quant ragged dot.
#882
opened Apr 27, 2026 by
copybara-service
Bot
Loading…
Tokamax autotuning cache update. Auto-generated by xmanager/247825154
#878
opened Apr 22, 2026 by
copybara-service
Bot
Loading…
Change benchmarking timers to run multiple iterations together.
#841
opened Apr 8, 2026 by
stzeng
Collaborator
Loading…
Triton and Mosaic for linear_softmax_cross_entropy_loss
#801
opened Mar 27, 2026 by
captainpete
Loading…
[Mosaic GPU] Remove the partitioned from MGPU APIs
#777
opened Mar 13, 2026 by
copybara-service
Bot
Loading…
Reduce the number of arg specs tested under the ragged_dot API test.
#770
opened Mar 11, 2026 by
copybara-service
Bot
Loading…
Add missing
tcgen05.fence instructions to SM100 MGPU attention.
#757
opened Mar 9, 2026 by
copybara-service
Bot
Loading…
Add
cuda-bench group of dependencies to pyproject.toml
#676
opened Feb 19, 2026 by
andportnoy
Member
Loading…
Partition and parallelize numpy RNG when initializing large arrays.
#654
opened Feb 13, 2026 by
copybara-service
Bot
Loading…
[pallas] Passing
backend= in addition to compiler_params= is redundant
#643
opened Feb 12, 2026 by
copybara-service
Bot
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.