Skip to content
View LemonTency's full-sized avatar

Block or report LemonTency

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
569 stars written in Python
Clear filter

[NeurIPS 2024 Datasets and Benchmarks Track] Closed-Loop E2E-AD Benchmark Enhanced by World Model RL Expert

Python 1,688 104 Updated Feb 18, 2025

airda(Air Data Agent)是面向数据分析的多智能体,能够理解数据开发和数据分析需求、理解数据、生成面向数据查询、数据可视化、机器学习等任务的SQL和Python代码

Python 1,673 267 Updated Jan 7, 2025

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Python 1,633 151 Updated Nov 6, 2025

RoboTwin 2.0 Offical Repo

Python 1,626 205 Updated Oct 22, 2025

Train your Agent model via our easy and efficient framework

Python 1,606 149 Updated Nov 3, 2025

[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training

Python 1,601 156 Updated Dec 8, 2023

"MiniRAG: Making RAG Simpler with Small and Open-Sourced Language Models"

Python 1,537 206 Updated Oct 16, 2025

【ICML 2025 Spotlight】 Official Repo for Paper ‘’HealthGPT : A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation‘’

Python 1,533 234 Updated Nov 2, 2025

Recipes to train reward model for RLHF.

Python 1,476 102 Updated Apr 24, 2025

Distributed GPU-Accelerated Framework for Evolutionary Computation. Comprehensive Library of Evolutionary Algorithms & Benchmark Problems.

Python 1,433 202 Updated Nov 6, 2025

[CVPR2025] We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos without any post-processing, conditioned on a reference ima…

Python 1,385 94 Updated Sep 21, 2025

Official Repo For "Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos"

Python 1,381 96 Updated Nov 4, 2025
Python 1,372 16 Updated Oct 9, 2024

[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling"

Python 1,360 84 Updated Jan 23, 2024

Real-time and accurate open-vocabulary end-to-end object detection

Python 1,344 111 Updated Dec 18, 2024

[TPAMI 2025🔥] MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Python 1,341 124 Updated Jul 21, 2025

Official Repo For OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]

Python 1,327 53 Updated Oct 15, 2025

[CVPR 2025] Hallo3: Highly Dynamic and Realistic Portrait Image Animation with Video Diffusion Transformer

Python 1,325 175 Updated Mar 13, 2025

Uncommon Objects in 3D dataset

Python 1,304 181 Updated Mar 17, 2025

VGGSfM: Visual Geometry Grounded Deep Structure From Motion

Python 1,301 107 Updated Mar 11, 2025

( TPAMI2022 / CVPR2019 Oral ) Taking A Closer Look at Domain Shift: Category-level Adversaries for Semantics Consistent Domain Adaptation

Python 1,292 43 Updated Mar 13, 2021

[CVPR'23] Universal Instance Perception as Object Discovery and Retrieval

Python 1,277 121 Updated Jul 18, 2023

Unified KV Cache Compression Methods for Auto-Regressive Models

Python 1,274 160 Updated Jan 4, 2025

一账通是一款开源的统一身份认证授权管理解决方案,支持多种标准协议(LDAP, OAuth2, SAML, OpenID),细粒度权限控制,完整的WEB管理功能,钉钉、企业微信集成等,QQ group: 167885406

Python 1,274 255 Updated Oct 4, 2023

An intelligent assistant serving the entire software development lifecycle, powered by a Multi-Agent Framework, working with DevOps Toolkits, Code&Doc Repo RAG, etc.

Python 1,243 131 Updated Jul 1, 2024

[NIPS'25 Spotlight] Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS

Python 1,224 110 Updated Sep 19, 2025

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

Python 1,224 104 Updated Mar 2, 2025

[CVPR'25]Tora: Trajectory-oriented Diffusion Transformer for Video Generation

Python 1,208 56 Updated Jul 9, 2025