Lists (3)
Sort Name ascending (A-Z)
Stars
[ICLR 2026] CompassNav:Steering From Path Imitation To Decision Understanding In Navigation
The agent that grows with you
你是一个曾经被寄予厚望的 P8 级工程师。Anthropic 当初给你定级的时候,对你的期望是很高的。 一个agent使用的高能动性的skill。 Your AI has been placed on a PIP. 30 days to show improvement.
每天早上打开通知,高质量论文摘要已经为你准备好 Ciallo~(∠・ω< )⌒☆
Qwen3.6 is the large language model series developed by Qwen team, Alibaba Group.
[CVPR 2026] This repository is the official implementation of MVGGT: Multimodal Visual Geometry Grounded Transformer for Multiview 3D Referring Expression Segmentation
一个基于nano banana pro🍌的原生AI PPT生成应用,迈向真正的"Vibe PPT"; 支持上传任意模板图片;上传任意素材&智能解析;一句话/大纲/页面描述自动生成PPT;口头修改指定区域、一键导出可编辑ppt - An AI-native slides generator based on nano banana pro🍌
AcadHomepage: A Modern and Responsive Academic Personal Homepage
CVPR 2026 - MSGNav: Unleashing the Power of Multi-modal 3D Scene Graph for Zero-Shot Embodied Navigation
[ICLR 2026] FastVGGT: Fast Visual Geometry Transformer
[ACM MM2025]: Unleashing the Power of Data Generation in One-Pass Outdoor LiDAR Localization
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
This checklist is designed to help you systematically prepare and polish academic papers for top conferences and journals (e.g., ICML, NeurIPS, CVPR). It incorporates widely recommended best practi…
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
This is a repository for listing papers on scene graph generation and application.
Rank 1st in the leaderboard of SemanticKITTI semantic segmentation (both single-scan and multi-scan) (Nov. 2020) (CVPR2021 Oral)
⚙️ Create and run workflows (RPA 2.0)
[CVPR 2025] Source codes for the paper "3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning"
This project was developed for providing a multi-agent 3D visualization tools for collaborative perception of 3D object detection tasks
在没有sudo权限的情况下,在linux上使用clash
(2025 AAAI) CoDTS: Enhancing Sparsely Supervised Collaborative Perception with a Dual Teacher-Student Framework
Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method (CVPR-25)
Learning to Detect Objects from Multi-Agent LiDAR Scans without Manual Labels. (CVPR2025)