Tags: sail-sg/oat
Toggle v0.2.4's commit message
chore: minor updates on logging and resource allocation (#73 )
Toggle v0.2.3's commit message
feat: add fp16 training (#70 )
Toggle v0.2.2's commit message
feat: support LoRA RL training (#64 )
* feat: support LoRA RL training
* minor
Toggle v0.2.1's commit message
fix: truncated importance sampling to handle precision mismatch (#62 )
* support tis
* make tis default
Toggle v0.2.0's commit message
Toggle v0.1.4's commit message
feat: reduce vram footprint (#56 )
* feat: add fused lm head to reduce vram usage
* fix sft slicing and add dry run
* minor
* bump version
Toggle v0.1.3.post2's commit message
Add one file for k8s job launcher
Toggle v0.1.2's commit message
feat: fix deps, refactor apis, allow resume training (#39 )
* fix deps, refactor apis
* bump version
* updates
* actor identity
* fix ref offload
* training resume
* bump version
Toggle v0.1.0's commit message
Upgrade to vllm V1 (0.8.4) and use actor api init() (#38 )
* updates
* bump version
Toggle v0.0.9's commit message
Upgrade vllm for more efficient collocation (#34 )
* upgrade vllm & adopt collective_rpc
* use .float() for kl & increase timeout to 60m
* speed up minibatch training
* add constant lr scheduler
* update
* updates
* fix non_eos detection
* changes
* minor
* update
* ratio
* updates
You can’t perform that action at this time.