-
MIT
- Cambridge, MA
-
06:03
(UTC -04:00) - https://hjbahng.github.io/
Stars
Qwen3.6 is the large language model series developed by Qwen team, Alibaba Group.
Our library for RL environments + evals
H-Net: Hierarchical Network with Dynamic Chunking
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Entropy Based Sampling and Parallel CoT Decoding
Lets make video diffusion practical!
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …
Robust recipes to align language models with human and AI preferences
📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).
https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
For optimization algorithm research and development.
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
Reaching LLaMA2 Performance with 0.1M Dollars
Official inference library for Mistral models
A high-throughput and memory-efficient inference and serving engine for LLMs
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
[ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"
RUCAIBox / POPE
Forked from AoiDragon/POPEThe official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
Instruct-tune LLaMA on consumer hardware
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
A latent text-to-image diffusion model
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Instant neural graphics primitives: lightning fast NeRF and more