Skip to content

Tags: sail-sg/oat

Tags

v0.2.4

Toggle v0.2.4's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
chore: minor updates on logging and resource allocation (#73)

v0.2.3

Toggle v0.2.3's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
feat: add fp16 training (#70)

v0.2.2

Toggle v0.2.2's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
feat: support LoRA RL training (#64)

* feat: support LoRA RL training

* minor

v0.2.1

Toggle v0.2.1's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
fix: truncated importance sampling to handle precision mismatch (#62)

* support tis

* make tis default

v0.2.0

Toggle v0.2.0's commit message
update readme

v0.1.4

Toggle v0.1.4's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
feat: reduce vram footprint (#56)

* feat: add fused lm head to reduce vram usage

* fix sft slicing and add dry run

* minor

* bump version

v0.1.3.post2

Toggle v0.1.3.post2's commit message
Add one file for k8s job launcher

v0.1.2

Toggle v0.1.2's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
feat: fix deps, refactor apis, allow resume training (#39)

* fix deps, refactor apis

* bump version

* updates

* actor identity

* fix ref offload

* training resume

* bump version

v0.1.0

Toggle v0.1.0's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Upgrade to vllm V1 (0.8.4) and use actor api init() (#38)

* updates

* bump version

v0.0.9

Toggle v0.0.9's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Upgrade vllm for more efficient collocation (#34)

* upgrade vllm & adopt collective_rpc

* use .float() for kl & increase timeout to 60m

* speed up minibatch training

* add constant lr scheduler

* update

* updates

* fix non_eos detection

* changes

* minor

* update

* ratio

* updates