-
Beijing University of Posts and Telecommunications
- Beijing
-
10:58
(UTC +08:00)
Stars
RISE: Reliable Improvement in Self-Evolving Vision-Language Models
TransitLM: A Large-Scale Dataset and Benchmark for Map-Free Transit Route Generation
[SIGGRAPH 2026] MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation
This is the official repository of "LLaTiSA: Towards Difficulty-Stratified Time Series Reasoning from Visual Perception to Semantics".
[2026 CVPR]Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation
[ICLR2026] Video-STAR: Reinforcing Open-Vocabulary Action Recognition with Tools
DreamX-World: A General-Purpose Interactive World Model
🦞 Just talk to your agent — it learns and EVOLVES 🧬.
SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning
Let Skills Evolve Collectively with Agentic Evolver
This repo has scripts to compare various powerful RL methods
A comprehensive benchmark specifically designed to evaluate the interactive response capabilities of world models in 4D settings.
Official implementation of the CVPR 2026 paper "Adapting In-context Generation for Enhanced Composed Image Retrieval"
Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
[ICLR2026] Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models
Offitial implementation of the ICASSP 2026 paper "Relational Dual-Granularity Distillation for Text-Based Person Retrieval"
Official implementation of the AAAI 2026 paper "Modality and Task Adaptation for Enhanced Zero-shot Composed Image Retrieval"
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Official implementation of the ICME 2025 paper "Slot Inversion for Asymmetric Composed Image Retrieval"
Offitial implementation of the ICASSP 2025 paper "Object-Centric Discriminative Learning for Text-Based Person Retrieval"
Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.