Skip to content
View chchenhui's full-sized avatar
🤨
🤨

Highlights

  • Pro

Block or report chchenhui

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Jupyter Notebook 1,155 52 Updated Oct 30, 2024

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

Python 466 47 Updated Oct 17, 2024
Python 155 6 Updated Oct 30, 2024

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

455 9 Updated Oct 25, 2024

Official Implementation of the paper: "Two are better than one: Context window extension with multi-grained self-injection"

Python 3 Updated Oct 28, 2024

A library for advanced large language model reasoning

Python 1,375 109 Updated Sep 3, 2024

Codebase for Aria - an Open Multimodal Native MoE

Jupyter Notebook 728 63 Updated Oct 21, 2024

800,000 step-level correctness labels on LLM solutions to MATH problems

Python 1,615 99 Updated Jun 1, 2023

Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation

Python 864 41 Updated Oct 23, 2024

[NeurIPS 2024 Oral🔥] DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs.

Python 86 4 Updated Oct 3, 2024

VPTQ, A Flexible and Extreme low-bit quantization algorithm

Python 475 26 Updated Oct 29, 2024

Next-Token Prediction is All You Need

Python 1,725 64 Updated Oct 24, 2024

[NeurIPS 24 Spotlight] MaskLLM: Learnable Semi-structured Sparsity for Large Language Models

Python 101 11 Updated Oct 18, 2024

Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"

Jupyter Notebook 1,657 118 Updated Jan 29, 2024

The First Multimodal Seach Engine Pipeline and Benchmark for LMMs

Python 385 29 Updated Oct 6, 2024

RewardBench: the first evaluation tool for reward models.

Python 414 49 Updated Oct 23, 2024

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,393 2,911 Updated Sep 2, 2024

g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains

Python 3,774 344 Updated Oct 7, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

4,919 273 Updated Oct 23, 2024

[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation

Python 607 40 Updated Oct 1, 2024
Python 13 4 Updated Oct 22, 2024

📋 A list of open LLMs available for commercial use.

11,114 718 Updated Jul 5, 2024

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 427 33 Updated Oct 23, 2024

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

Python 721 54 Updated Oct 8, 2024
Python 1,553 113 Updated Sep 23, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 12,005 1,077 Updated Oct 14, 2024
Python 91 2 Updated Sep 24, 2024

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 4,003 423 Updated Oct 30, 2024
Jupyter Notebook 94 11 Updated Oct 28, 2024

🚀 Awesome System for Machine Learning AI System 🚀 Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys…

2,685 306 Updated Aug 14, 2024
Next