Skip to content
View Gwarestrin's full-sized avatar

Block or report Gwarestrin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)

Python 791 55 Updated Jul 5, 2025

The awesome collection of OpenClaw Skills. Formerly known as Moltbot, originally Clawdbot.

9,807 927 Updated Feb 5, 2026

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 3,914 316 Updated Jun 12, 2025

Mobile and Web client for Codex and Claude Code, with realtime voice, encryption and fully featured

TypeScript 10,436 785 Updated Jan 29, 2026

Transform your favorite cities into beautiful, minimalist designs. MapToPoster lets you create and export visually striking map posters with code.

Python 9,389 847 Updated Jan 30, 2026

Chrome DevTools for coding agents

TypeScript 23,335 1,382 Updated Feb 5, 2026

Open Source Visualized Route Tracing Tool for macOS, Windows, and Linux. 跨平台可视化路由追踪工具。

C# 3,595 150 Updated Jan 24, 2026

Gemini Nano Banana / Pro watermark maintenance tool

C++ 1,278 117 Updated Feb 2, 2026

Official code for the paper "Examining Post-Training Quantization for Mixture-of-Experts: A Benchmark"

Python 28 5 Updated Jun 30, 2025

A full-featured download manager.

JavaScript 50,701 4,816 Updated Jul 11, 2024

Axel tries to accelerate HTTP/FTP downloading process by using multiple connections for one file. It can use multiple mirrors for a download. Wilmer van der Gaast is the upstream author of Axel. Y …

Batchfile 19 8 Updated Sep 26, 2021

Toolkit of BDD100K Dataset for Heterogeneous Multitask Learning - CVPR 2020 Oral Paper

Python 526 74 Updated Mar 9, 2024

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

Python 24,069 1,655 Updated Dec 29, 2025

🏗 Build container images for your Java applications.

Java 14,300 1,466 Updated Feb 5, 2026

An elegant and deeply customizable lyrics visualizer & versatile music player, built with WinUI3/Win2D | 一款优雅且高度自定义的歌词可视化与全能音乐播放应用,基于 WinUI3/Win2D 构建

C# 1,512 44 Updated Feb 5, 2026

Personal CRM. Remember everything about your friends, family and business relationships.

PHP 24,209 2,412 Updated Nov 15, 2025

CV/resume generator for academics and engineers, YAML to PDF

Python 15,515 1,062 Updated Feb 5, 2026

一款轻量级、高度可定制的 Windows桌面和任务栏硬件性能监控工具,支持监测 CPU、GPU、内存、磁盘、网速、FPS 计数、插件扩展及内存清理。A lightweight, customizable hardware monitor for the Windows desktop & taskbar. Features CPU/GPU/RAM/Network monitoring, FP…

C# 3,923 161 Updated Feb 5, 2026

一个基于nano banana pro🍌的原生AI PPT生成应用,迈向真正的"Vibe PPT"; 支持上传任意模板图片;上传任意素材&智能解析;一句话/大纲/页面描述自动生成PPT;口头修改指定区域、一键导出可编辑ppt - An AI-native PPT generator based on nano banana pro🍌

Python 11,627 1,345 Updated Feb 5, 2026

[2025] Efficient Vision Language Models: A Survey

47 3 Updated Jul 14, 2025

vscode 注释翻译插件, 不干扰正常代码,方便快速阅读源码。

TypeScript 748 102 Updated Jan 6, 2025

Visualizer for neural network, deep learning and machine learning models

JavaScript 32,351 3,070 Updated Feb 5, 2026
Python 58 7 Updated Dec 23, 2025

Run Stable Diffusion on Android Devices with Snapdragon NPU acceleration. Also supports CPU/GPU inference.

Kotlin 1,638 97 Updated Dec 17, 2025

Automate your mobile devices with natural language commands - an LLM agnostic mobile Agent 🤖

Python 7,601 771 Updated Feb 5, 2026

Source code for the paper "Empowering LLM to use Smartphone for Intelligent Task Automation"

Python 440 62 Updated Mar 22, 2024

Open-AutoGLM混合方案 - 在手机上运行AI自动化,无需电脑

Kotlin 728 204 Updated Dec 10, 2025

An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone

Python 23,138 3,654 Updated Jan 20, 2026

Context-aware AI assistant for your desktop. Ready to respond intelligently, seamlessly integrating multiple LLMs and MCP tools.

C# 5,423 315 Updated Feb 5, 2026

Open-Source Frontier Voice AI

Python 22,940 2,504 Updated Feb 3, 2026
Next