Lists (7)
Sort Name ascending (A-Z)
Stars
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
Official repository for "PosterO: Structuring Layout Trees to Enable Language Models in Generalized Content-Aware Layout Generation" (CVPR 2025).
Machine Learning Engineering Open Book
🥢像老乡鸡🐔那样做饭。主要部分于2024年完工,非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》,并做归纳、编辑与整理。CookLikeHOC.
A PyTorch native platform for training generative AI models
Model Merging in LLMs, MLLMs, and Beyond: Methods, Theories, Applications and Opportunities. arXiv:2408.07666.
Tools for merging pretrained large language models.
Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
Now we have become very big, Different from the original idea. Collect premium software in various categories.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights i…
Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.
Minimal reproduction of DeepSeek R1-Zero
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & TIS & vLLM & Ray & Dynamic Sampling & Async Agentic RL)
[ICML 2024] TrustLLM: Trustworthiness in Large Language Models
[TMLR 2025🔥] A survey for the autoregressive models in vision.
A PyTorch implementation of Model Agnostic Meta-Learning (MAML) that faithfully reproduces the results from the original paper.
[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.
A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
Curated list of datasets and tools for post-training.
Official inference repo for FLUX.1 models
Run PyTorch LLMs locally on servers, desktop and mobile