-
北京交通大学
- BeiJing
- https://orcid.org/0000-0003-4635-7032
Lists (2)
Sort Name ascending (A-Z)
Stars
⛷ Lightweight Markdown app to help you write great sentences.
本项目致力于设计和实现一个创新的基于嵌入式AI的ROI区域视频传输系统,旨在通过智能识别和优先传输视频中的关键区域(ROI),大幅提高视频监控和远程通信的效率与质量。利用先进的嵌入式AI技术,本系统能够在不牺牲视频质量的前提下,显著降低数据传输的带宽需求,为安全监控、远程教学、医疗健康等应用领域带来革命性的改进。
An implementation of DashGaussian, a powerful 3DGS training acceleration method. Accepted by CVPR 2025 (highlight).
ARTDECO unifies 3D foundation priors with structured scene representations, enabling robust and generalizable 3D reconstruction of diverse real-world scenes using only monocular video.
Code for "FlashWorld: High-quality 3D Scene Generation within Seconds"
A 3DGS framework for omni urban scene reconstruction and simulation.
HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation
A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…
Tongyi Deep Research, the Leading Open-source Deep Research Agent
[CVPR 2025] FineVQ: Fine-Grained User Generated Content Video Quality Assessment
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
[SIGGRAPH Asia 2025] WorldExplorer: Towards Generating Fully Navigable 3D Scenes
AIPPT Online editor,Base On ChatPPT, supports document editing services throughout the entire process, including import, export, layout beautification, online editing, playback, and presentation an…
🦜🔗 The platform for reliable agents.
Integrate the DeepSeek API into popular softwares
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Long-form streaming TTS system for multi-speaker dialogue generation
[SIGGRAPH 2025] LAM: Large Avatar Model for One-shot Animatable Gaussian Head
[ACM CSUR 2025] Understanding World or Predicting Future? A Comprehensive Survey of World Models
An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Coze your way to AI Agent creation.
[ACL 2025 Main] SceneGenAgent: Precise Industrial Scene Generation with Coding Agent