🚖
On The Journey to Neverland
Kaggle 3x Expert, Data Scientist focusing on deep sequence modeling for TimeSeries, Computer Vision and NLP
Highlights
- Pro
Stars
Large language models
3 repositories
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
🤗 smolagents: a barebones library for agents that think in code.