Skip to content
View mxin262's full-sized avatar

Block or report mxin262

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
363 results for source starred repositories
Clear filter

Public repository for Agent Skills

Python 64,172 6,334 Updated Feb 4, 2026

GLM-OCR: Accurate × Fast × Comprehensive

Python 605 38 Updated Feb 6, 2026

Kimi Code CLI is your next CLI agent.

Python 6,058 562 Updated Feb 5, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 168,525 26,920 Updated Feb 6, 2026

Step3-VL-10B: A compact yet frontier multimodal model achieving SOTA performance at the 10B scale, matching open-source models 10-20x its size.

390 26 Updated Jan 21, 2026

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 3,574 241 Updated Jan 14, 2026

GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image Generation.

Python 733 43 Updated Feb 2, 2026

MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation Model

1,042 42 Updated Jan 8, 2026

An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone

Python 23,150 3,654 Updated Jan 20, 2026

STEP-GUI: The top GUI agent solution in the galaxy. Developed by the StepFun-GELab team and powered by StepFun’s cutting-edge research capabilities.

Python 1,977 172 Updated Jan 23, 2026

LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling

Python 187 11 Updated Jan 26, 2026
Python 9,891 626 Updated Jan 30, 2026

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Python 141 7 Updated Dec 17, 2025

SAM 3D Objects

Python 5,893 645 Updated Feb 3, 2026

Official implementation of URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding (AAAI 2026 Oral).

Python 34 Updated Feb 4, 2026

Native Multimodal Models are World Learners

Python 1,449 55 Updated Dec 30, 2025

Contexts Optical Compression

Python 22,393 2,059 Updated Jan 27, 2026

QeRL enables RL for 32B LLMs on a single H100 GPU.

Python 481 48 Updated Nov 27, 2025

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,390 210 Updated Jan 8, 2026

codes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)

Python 754 72 Updated Feb 4, 2026

MCP for xiaohongshu.com

Go 8,521 1,344 Updated Feb 4, 2026

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,500 132 Updated Jan 30, 2026

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 18,176 1,403 Updated Jan 21, 2026

A lightweight Python library for simulating Chinese handwriting

Python 2,218 258 Updated Apr 6, 2024

The official repository of the dots.vlm1 instruct models proposed by rednote-hilab.

Dockerfile 284 7 Updated Sep 26, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,736 2,030 Updated Jan 13, 2026

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 7,237 422 Updated Dec 31, 2025
Next