-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Pull requests: antirez/ds4
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
rocm: fix distributed inference on unified-memory APUs (strix halo / gfx1151)
#407
opened Jun 13, 2026 by
kyuz0
Loading…
[3/N] add prefetch support for CUDA backend : running ds4 for any GPU with cache (2.75 x faster!)
#402
opened Jun 12, 2026 by
yiakwy-xpu-ml-framework-team
Loading…
Add multi-column attn-out low projection kernel for small batches
#399
opened Jun 11, 2026 by
rwl4
Loading…
CUDA backend (DGX-Spark) — refactored into modular .cuh files mirroring ROCm structure
#398
opened Jun 11, 2026 by
gundemirbas
Loading…
ROCm runtime: configurable weight cache limit and arena chunk size via environment variables
#397
opened Jun 11, 2026 by
gundemirbas
Loading…
Add env-gated prompt-lookup speculative decoding for greedy generation
#396
opened Jun 11, 2026 by
rwl4
Loading…
fix(kv-cache): refresh cold anchor after partial prefix hits
#394
opened Jun 11, 2026 by
TerryChengTW
Loading…
3 tasks done
Add teaching mode to ds4-agent, with teach-bench benchmark
#391
opened Jun 11, 2026 by
rowantrollope
Loading…
Clamp MTP draft depth to the prefill capacity
#381
opened Jun 10, 2026 by
pandysp
Contributor
Loading…
feat: add native Agent Skills support to ds4-agent
#380
opened Jun 10, 2026 by
fry69
Contributor
Loading…
Keep live KV reusable when clients strip transient metadata blocks
#378
opened Jun 10, 2026 by
adv0r
Loading…
[2/N] add cuda imatrix support for custom RL model
#377
opened Jun 10, 2026 by
yiakwy-xpu-ml-framework-team
Loading…
ds4_server: Add /health endpoint that returns HTTP 200 once model is fully loaded
#374
opened Jun 9, 2026 by
mcmalayalam
Loading…
Fix agent edit: accept [upto] markers indented or padded with blanks (+ golden cases)
#373
opened Jun 9, 2026 by
rinaldofesta
Loading…
Add continuous depth-1 MTP speculation (DS4_MTP_CONTINUOUS)
#371
opened Jun 9, 2026 by
pandysp
Contributor
Loading…
[1/N] add fp8 fp32 scale support for custom RL model
#368
opened Jun 9, 2026 by
yiakwy-xpu-ml-framework-team
Loading…
make: consistent ROCm targets (rocm-strix-halo / rocm-generic) + portable lib paths (#357, #179)
#365
opened Jun 8, 2026 by
jamesburton
Loading…
Previous Next
ProTip!
Filter pull requests by the default branch with base:main.