Skip to content
View thisisiron's full-sized avatar
😵‍💫
😵‍💫

Organizations

@ai-rush-2019 @bcaitech1 @ml-zip

Block or report thisisiron

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A diffusion-based framework for document OCR that replaces autoregressive decoding with block-level parallel diffusion decoding.

Python 556 35 Updated Mar 31, 2026

🚀 Open source Claude Code CLI source code. Advanced AI Agent for developers. Includes TypeScript codebase for LLM tool-calling, agentic workflows, and terminal UI. Remember this is just the skeleto…

TypeScript 2,619 4,055 Updated Apr 4, 2026

A Scientific Multimodal Foundation Model

772 41 Updated Mar 27, 2026

Ghostty-based macOS terminal with vertical tabs and notifications for AI coding agents

Swift 14,477 1,059 Updated Apr 17, 2026

AI Agent Framework, the Pydantic way

Python 16,434 1,951 Updated Apr 17, 2026

A complete AI agency at your fingertips - From frontend wizards to Reddit community ninjas, from whimsy injectors to reality checkers. Each agent is a specialized expert with personality, processes…

Shell 81,521 13,050 Updated Apr 12, 2026

OmX - Oh My codeX: Your codex is not alone. Add hooks, agent teams, HUDs, and so much more.

TypeScript 23,757 2,026 Updated Apr 17, 2026

From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models

Python 108 Updated Feb 27, 2026

A list of AI autonomous agents

27,321 2,740 Updated Feb 26, 2025

torchange - A Unified Change Representation Learning Benchmark Library

Python 235 20 Updated Feb 1, 2026

GeoVLM-R1: Reinforcement Fine-Tuning for Improved Remote Sensing Reasoning

Shell 28 Updated Mar 27, 2026

A comprehensive and up-to-date compilation of datasets, tools, methods, review papers, and competitions for remote sensing change detection.

2,184 390 Updated Apr 16, 2026

This repo contains a curative list of scene change detection(SCD), including papers, videos, codes, and related websites.

142 8 Updated Apr 14, 2026

An agentic skills framework & software development methodology that works.

Shell 157,263 13,663 Updated Apr 16, 2026

A collection of 100+ specialized Claude Code subagents covering a wide range of development use cases

Shell 17,563 2,006 Updated Apr 17, 2026

Intelligent automation and multi-agent orchestration for Claude Code

Python 33,769 3,666 Updated Apr 16, 2026

45 tips for getting the most out of Claude Code, from basics to advanced - includes a custom status line script, cutting the system prompt in half, using Gemini CLI as Claude Code's minion, and Cla…

JavaScript 7,752 571 Updated Apr 2, 2026

A Claude Code plugin that shows what's happening - context usage, active tools, running agents, and todo progress

JavaScript 19,767 881 Updated Apr 11, 2026

🚀 Beautiful highly customizable statusline for Claude Code CLI with powerline support, themes, and more.

TypeScript 7,729 331 Updated Apr 17, 2026

A Claude Code plugin that automatically captures everything Claude does during your coding sessions, compresses it with AI (using Claude's agent-sdk), and injects relevant context back into future …

TypeScript 61,045 5,043 Updated Apr 17, 2026

Visual Causal Flow

Python 2,709 226 Updated Feb 3, 2026

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

JavaScript 159,375 24,783 Updated Apr 16, 2026

Curated Claude Code plugin marketplace

852 161 Updated Apr 14, 2026

🧮 Calculator for vision tokens in VLMs.

Python 1 Updated Jan 11, 2026

The absolute trainer to light up AI agents.

Python 16,919 1,475 Updated Apr 3, 2026

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 4,054 679 Updated Apr 10, 2026

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Python 3,341 270 Updated Jan 18, 2025

This is the official repository for our recent work: PIDNet

Python 784 132 Updated Dec 18, 2025

The official implementation of "Deep Dual-resolution Networks for Real-time and Accurate Semantic Segmentation of Road Scenes"

Python 479 55 Updated Jun 19, 2023

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,684 252 Updated Jan 8, 2026
Next