Skip to content
View wmpscc's full-sized avatar
:octocat:
Focusing
:octocat:
Focusing

Organizations

@android-nuc @apachecn @CVI-SZU

Block or report wmpscc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Cook up amazing multimodal AI applications effortlessly with MiniCPM-o

Python 547 72 Updated May 20, 2026

Code for the Molmo2 Vision-Language Model

Python 540 36 Updated Mar 18, 2026

Unofficial PyTorch reproduction of DeepSeek's Thinking with Visual Primitives.

Python 57 4 Updated May 15, 2026

Clone of DeepSeek Thinking-with-Visual-Primitives

Makefile 103 99 Updated Apr 30, 2026

《御舆:解码 Agent Harness》42万字拆解 AI Agent 的Harness骨架与神经 —— Claude Code 架构深度剖析,15 章从对话循环到构建你自己的 Agent Harness。在线阅读网站:

3,340 723 Updated Apr 6, 2026

Beyond SFT-to-RL: Pre-alignment via Black-BoxOn-Policy Distillation for Multimodal RL

Python 78 2 Updated May 6, 2026

Universal memory layer for AI Agents

Python 56,279 6,409 Updated May 20, 2026

LightAgent: Lightweight AI agent framework with memory, mcp & skill. Supports multi-agent collaboration, self-learning, and major LLMs (OpenAI/DeepSeek/Qwen). Open-source with MCP/SSE protocol inte…

Python 977 126 Updated Apr 26, 2026

Multi-agent orchestration system with pipeline workflows, shared memory, and cooperative task execution

TypeScript 2 Updated Apr 26, 2026

[CVPR 2026 Oral] Pixel Diffusion Transformers for Image Generation

Python 643 49 Updated Apr 9, 2026

Enjoy the magic of Diffusion models!

Python 12,445 1,204 Updated May 19, 2026

The agent that grows with you

Python 159,334 25,835 Updated May 20, 2026

PDF Parser for AI-ready data. Automate PDF accessibility. Open-source.

Java 21,427 1,995 Updated May 20, 2026

AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods

TypeScript 52,067 6,222 Updated May 20, 2026
TypeScript 7,185 895 Updated May 17, 2026

A lightweight, open-source OpenClaw version built into your Claude Code.

TypeScript 1,120 201 Updated May 20, 2026

The repo is finally unlocked. enjoy the party! The fastest repo in history to surpass 100K stars ⭐. Join Discord: https://discord.gg/5TUQKqFWd Built in Rust using oh-my-codex.

Rust 192,092 109,949 Updated May 20, 2026

A unified evaluation suite for speech-to-text translation, covering SpeechLLMs, SFMs, and cascaded systems across diverse real-world speech phenomena.

Jupyter Notebook 32 4 Updated Apr 25, 2026

Toonflow 是开源一站式 AI 短剧创作工具,将小说、剧本快速转化为动画短剧。集成 AI 编剧、智能分镜、角色与视频生成,跨平台桌面端轻量部署,助力创作者低成本批量产出视觉内容。Toonflow is an open-source AI tool that turns stories and scripts into animated short dramas. Features AI…

HTML 8,181 1,417 Updated May 12, 2026

OpenClaw-RL: Train any agent simply by talking

Python 5,354 582 Updated May 12, 2026

🏛️ 三省六部制 · OpenClaw Multi-Agent Orchestration System — 9 specialized AI agents with real-time dashboard, model config, and full audit trails

Python 15,823 1,668 Updated May 6, 2026

🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org

Python 17,004 1,911 Updated May 19, 2026

Deep Researcher: Multi-agent, multi-model agenticAI workflow for autonomous literature review using CrewAI

Python 1 Updated Mar 6, 2026

A Large-scale Video Action Dataset

Python 466 13 Updated Jan 16, 2026
Python 1,822 81 Updated Dec 16, 2025

slime is an LLM post-training framework for RL Scaling.

Python 5,737 804 Updated May 20, 2026

Helios: Real Real-Time Long Video Generation Model

Python 1,831 143 Updated Apr 16, 2026
Python 25 1 Updated Jan 31, 2026

A simple and effective LLM pruning approach.

Python 864 130 Updated Aug 9, 2024
Next