Stars
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
A lightweight LMM-based Document Parsing Model
MiroThinker is an open source deep research agent optimized for research and prediction. It achieves a 80.8% Avg@8 score on the challenging GAIA benchmark.
Klavis AI (YC X25): MCP integration platforms that let AI agents use tools reliably at any scale
The open source platform for AI-native application development.
Align Anything: Training All-modality Model with Feedback
HunyuanVideo-1.5: A leading lightweight video generation model
Nexent is a zero-code platform for auto-generating agents — no orchestration, no complex drag-and-drop required. Nexent also offers powerful capabilities for agent running control, data processing …
Recommendation Algorithm大规模推荐算法库,包含推荐系统经典及最新算法LR、Wide&Deep、DSSM、TDM、MIND、Word2Vec、Bert4Rec、DeepWalk、SSR、AITM,DSIN,SIGN,IPREC、GRU4Rec、Youtube_dnn、NCF、GNN、FM、FFM、DeepFM、DCN、DIN、DIEN、DLRM、MMOE、PLE、ESM…
The next generation deep reinforcement learning tookit
PromptEnhancer is a prompt-rewriting tool, refining prompts into clearer, structured versions for better image generation.
Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI, specializing in vision-language reasoning.
DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, AlignedOTA, and distillation enhancement.
LimiX: Unleashing Structured-Data Modeling Capability for Generalist Intelligence https://arxiv.org/abs/2509.03505
🚀 EvoAgentX: Building a Self-Evolving Ecosystem of AI Agents
MiroFlow is an agent framework that enables tool-use agent tasks, featuring a reproducible GAIA score of 82.4%.
GigaBrain-0: A World Model-Powered Vision-Language-Action Model
VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model
cube studio开源云原生一站式机器学习/深度学习/大模型AI平台/MaaS/mlops/人工智能平台/训推平台,算法全链路流程,算力租赁平台,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务,VGPU虚拟化,云边端协同,边缘计算,自动化标注平台,deepseek等大模型sft微调/奖励模型/强化学习训练,vllm/ollama/mindie大模型多机推理,私有…
【ICML 2025 Spotlight】 Official Repo for Paper ‘’HealthGPT : A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation‘’
A Semantic Controllable Self-Supervised Learning Framework to learn general human representations from massive unlabeled human images, which can benefit downstream human-centric tasks to the maximu…
HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation.
Synthetic Data Generation Platform By DataArcTech