Stars
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Official Implementation of the paper: "Two are better than one: Context window extension with multi-grained self-injection"
A library for advanced large language model reasoning
Codebase for Aria - an Open Multimodal Native MoE
800,000 step-level correctness labels on LLM solutions to MATH problems
Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation
[NeurIPS 2024 Oral🔥] DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs.
VPTQ, A Flexible and Extreme low-bit quantization algorithm
[NeurIPS 24 Spotlight] MaskLLM: Learnable Semi-structured Sparsity for Large Language Models
Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"
The First Multimodal Seach Engine Pipeline and Benchmark for LMMs
RewardBench: the first evaluation tool for reward models.
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation
📋 A list of open LLMs available for commercial use.
OLMoE: Open Mixture-of-Experts Language Models
[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
🚀 Awesome System for Machine Learning AI System 🚀 Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys…