Tags · sail-sg/oat

v0.2.4

chore: minor updates on logging and resource allocation (#73)

Dec 23, 2025
1b52eed
zip
tar.gz
Notes

v0.2.3

feat: add fp16 training (#70)

Oct 31, 2025
c1a074c
zip
tar.gz
Notes

v0.2.2

feat: support LoRA RL training (#64)

* feat: support LoRA RL training

* minor

Oct 2, 2025
e1164ac
zip
tar.gz
Notes

v0.2.1

fix: truncated importance sampling to handle precision mismatch (#62)

* support tis

* make tis default

Aug 24, 2025
f9adda7
zip
tar.gz
Notes

v0.2.0

update readme

Jul 24, 2025
7238daf
zip
tar.gz
Notes

v0.1.4

feat: reduce vram footprint (#56)

* feat: add fused lm head to reduce vram usage

* fix sft slicing and add dry run

* minor

* bump version

Jul 9, 2025
e6fa2ec
zip
tar.gz
Notes

v0.1.3.post2

Add one file for k8s job launcher

Jun 27, 2025
78d6f95
zip
tar.gz
Notes

v0.1.2

feat: fix deps, refactor apis, allow resume training (#39)

* fix deps, refactor apis

* bump version

* updates

* actor identity

* fix ref offload

* training resume

* bump version

May 6, 2025
52ceaa7
zip
tar.gz
Notes

v0.1.0

Upgrade to vllm V1 (0.8.4) and use actor api init() (#38)

* updates

* bump version

Apr 18, 2025
43532b3
zip
tar.gz
Notes

v0.0.9

Upgrade vllm for more efficient collocation (#34)

* upgrade vllm & adopt collective_rpc

* use .float() for kl & increase timeout to 60m

* speed up minibatch training

* add constant lr scheduler

* update

* updates

* fix non_eos detection

* changes

* minor

* update

* ratio

* updates

Mar 21, 2025
59eb01b
zip
tar.gz
Notes

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v0.2.4

v0.2.3

v0.2.2

v0.2.1

v0.2.0

v0.1.4

v0.1.3.post2

v0.1.2

v0.1.0

v0.0.9

Tags: sail-sg/oat