Skip to content
View mxin262's full-sized avatar

Block or report mxin262

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
288 stars written in Python
Clear filter

Public repository for Agent Skills

Python 104,840 11,560 Updated Mar 25, 2026

real time face swap and one-click video deepfake with only a single image

Python 83,725 12,248 Updated Mar 27, 2026

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 73,225 10,044 Updated Mar 26, 2026

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 70,315 8,395 Updated Jan 25, 2026

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 59,571 9,419 Updated Mar 9, 2026

Inference code for Llama models

Python 59,274 9,827 Updated Jan 26, 2025

AI agents running research on single-GPU nanochat training automatically

Python 58,965 8,174 Updated Mar 26, 2026

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 57,432 4,754 Updated Mar 28, 2026

Ultralytics YOLO 🚀

Python 55,118 10,594 Updated Mar 28, 2026

Grok open release

Python 51,527 8,468 Updated Aug 30, 2024

Making large AI models cheaper, faster and more accessible

Python 41,377 4,523 Updated Mar 16, 2026

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,455 4,786 Updated Jun 2, 2025

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

Python 36,893 5,233 Updated Mar 3, 2026

"🐈 nanobot: The Ultra-Lightweight OpenClaw"

Python 36,689 6,304 Updated Mar 27, 2026

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 34,825 4,292 Updated Aug 6, 2024

Let us control diffusion models!

Python 33,782 3,005 Updated Feb 25, 2024

OpenMMLab Detection Toolbox and Benchmark

Python 32,547 9,851 Updated Aug 21, 2024

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,262 4,003 Updated Jul 17, 2024

The official Meta Llama 3 GitHub site

Python 29,296 3,528 Updated Jan 26, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 28,357 2,636 Updated Mar 28, 2026

Fully open reproduction of DeepSeek-R1

Python 25,969 2,414 Updated Nov 24, 2025

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,751 2,913 Updated Sep 2, 2024

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 24,988 3,479 Updated Mar 27, 2026

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,620 2,748 Updated Aug 12, 2024

An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone

Python 24,611 3,876 Updated Mar 6, 2026

A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone

Python 24,211 1,878 Updated Mar 7, 2026

Graph Neural Network Library for PyTorch

Python 23,614 3,970 Updated Mar 27, 2026

Contexts Optical Compression

Python 22,761 2,093 Updated Jan 27, 2026
Next