-
Notifications
You must be signed in to change notification settings - Fork 144
Pull requests: facebookresearch/generative-recommenders
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
update block size for standalone_cint_v4
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#145
opened Nov 22, 2024 by
zhaozhul
Loading…
standalone_cint_v4
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#144
opened Nov 22, 2024 by
zhaozhul
Loading…
loop unroll for hstu attn bwd
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#143
opened Nov 21, 2024 by
LinjianMa
Loading…
Convert directory fbcode/hammer to use the Ruff Formatter
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#142
opened Nov 20, 2024 by
tpolasek
Loading…
Prepare for "Fix type-safety of This label is managed by the Meta Open Source bot.
fb-exported
torch.nn.Module
instances": fbcode/h*
CLA Signed
#141
opened Nov 20, 2024 by
ezyang
Loading…
Add complete_cumsum cpu and meta ops
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#140
opened Nov 20, 2024 by
jiyuanzFB
Loading…
add pytorch implementations for jagged operations
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#137
opened Nov 19, 2024 by
zhaozhul
Loading…
Replace Triton addmm withThis label is managed by the Meta Open Source bot.
fb-exported
torch.addmm
for AMD to achieve better training performance
CLA Signed
#133
opened Nov 19, 2024 by
yoyoyocmu
Loading…
Change autotune key for ragged_hstu_attention to support dynamic batch size
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#132
opened Nov 18, 2024 by
AlbertDachiChen
Loading…
Bug fix: detecting contextual length in triton hstu attn code
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#130
opened Nov 18, 2024 by
lic225
Loading…
Fix type-safety of This label is managed by the Meta Open Source bot.
fb-exported
torch.nn.Module
instances
CLA Signed
#129
opened Nov 18, 2024 by
ezyang
Loading…
move sorted_kv_pairs from hammer/ops/cuda/ to hammer/ops/cpp/
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#128
opened Nov 18, 2024 by
jiyuanzFB
Loading…
copy complete_cumsum kernel to ops/cpp/
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#127
opened Nov 15, 2024 by
jiyuanzFB
Loading…
Redefine FBGEMM targets with gpu_cpp_library (Re-land attempt of D64863809)
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#126
opened Nov 15, 2024 by
q10
Loading…
num_stages=0 becomes num_stages=2
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#125
opened Nov 14, 2024 by
nmacchioni
Loading…
Fix max_attn_len numerical
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#124
opened Nov 14, 2024 by
hanli0612
Loading…
Enable HSTU SMEM for TW and PW with autotuning for both SMEM preload and maxnregs
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
#92
opened Oct 7, 2024 by
plotfi
Loading…
[Triton SMEM] Add not-yet-landed usage of Triton SMEM feature with autotuning
CLA Signed
This label is managed by the Meta Open Source bot.
[WIP] TMA Version of HSTU (Autotuned)
CLA Signed
This label is managed by the Meta Open Source bot.
#71
opened Aug 21, 2024 by
plotfi
Loading…
TMA version of hstu
CLA Signed
This label is managed by the Meta Open Source bot.
#57
opened Jul 24, 2024 by
manman-ren
•
Draft
ProTip!
Follow long discussions with comments:>50.