-
GMU
- Fairfax, VA
- zye1996.github.io
Stars
A latent text-to-image diffusion model
Google Research
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
A guidance language for controlling large language models.
Instruct-tune LLaMA on consumer hardware
Get started with building Fullstack Agents using Gemini 2.5 and LangGraph
Free MLOps course from DataTalks.Club
Foundational Models for State-of-the-Art Speech and Text Translation
LAVIS - A One-stop Library for Language-Vision Intelligence
QLoRA: Efficient Finetuning of Quantized LLMs
Official inference library for Mistral models
MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks
My blogs and code for machine learning. http://cnblogs.com/pinard
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…
Inpaint anything using Segment Anything and inpainting models.
YOLOv6: a single-stage object detection framework dedicated to industrial applications.
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
Language-Agnostic SEntence Representations
骆驼(Luotuo): Open Sourced Chinese Language Models. Developed by 陈启源 @ 华中师范大学 & 李鲁鲁 @ 商汤科技 & 冷子昂 @ 商汤科技
Debugging, monitoring and visualization for Python Machine Learning and Data Science
We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…
The hub for EleutherAI's work on interpretability and learning dynamics
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
Easily compute clip embeddings and build a clip retrieval system with them
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
A data generation pipeline for creating semi-realistic synthetic multi-object videos with rich annotations such as instance segmentation masks, depth maps, and optical flow.
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜