Stars
[NeurIPS D&B '25] The one-stop repository for large language model (LLM) unlearning. Supports TOFU, MUSE, WMDP, and many unlearning methods with easy feature extensibility.
Existing Literature about Machine Unlearning
slime is an LLM post-training framework for RL Scaling.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
About Awesome things towards foundation agents. Papers / Repos / Blogs / ...
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
A Zotero plugin for syncing items and notes into Notion
[CVPR2024] MMA-Diffusion: MultiModal Attack on Diffusion Models
《EasyOffer》(<大模型面经合集>)是针对LLM宝宝们量身打造的大模型暑期实习Offer指南,主要记录大模型暑期实习和秋招准备的一些常见大厂手撕代码、大厂面经经验、常见大厂思考题等;小白一个,正在学习ing......有问题各位大佬随时指正,希望大家都能拿到心仪Offer!
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
A high-throughput and memory-efficient inference and serving engine for LLMs
verl: Volcano Engine Reinforcement Learning for LLMs
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Simple, safe way to store and distribute tensors
Inpaint anything using Segment Anything and inpainting models.
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels