Skip to content
View Ironieser's full-sized avatar
  • Independent Researcher
  • US

Block or report Ironieser

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Fully Open Framework for Democratized Multimodal Training

Python 1,093 75 Updated Jun 18, 2026

[ICLR 2026 Oral] FlashVID: Efficient Video Large Language Models via Training-free Tree-based Spatiotemporal Token Merging

Python 106 Updated Apr 30, 2026

A lightweight GPU node manager designed for agentic coding workflows on SLURM clusters.

Python 1 Updated Jun 9, 2026

Professional Antigravity Account Manager & Switcher. One-click seamless account switching for Antigravity Tools. Built with Tauri v2 + React (Rust).专业的 Antigravity 账号管理与切换工具。为 Antigravity 提供一键无缝账号切…

Rust 29,793 3,223 Updated Jun 17, 2026

A Super AI Lab with massive AI Doctors as Assistants. Best IDE for Research via AI Power.

JavaScript 1,008 106 Updated Jun 16, 2026
Python 271 16 Updated May 14, 2026

This repository contains the official implementation of the research papers, "MobileCLIP" CVPR 2024 and "MobileCLIP2" TMLR August 2025

Python 1,551 123 Updated Apr 15, 2026

[ICLR 2026] The official repo of "MMTok: Multimodal Coverage Maximization for Efficient Inference of VLMs"

Python 41 4 Updated Mar 11, 2026

Small toolchain to turn spoken audio into ASS subtitles using Whisper, and optionally burn them into video with FFmpeg (hardsubs).

Python 2 Updated Feb 10, 2026

Lightweight Image Video Action Generation Inference Framework

Python 2,424 221 Updated Jun 18, 2026

Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.

Python 7,479 1,206 Updated Jun 17, 2026

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python 12,021 1,558 Updated Mar 17, 2026

[ICLR 2026] Official repo for "FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting"

Python 50 5 Updated Oct 9, 2025

Reinforcement Learning via Self-Distillation (SDPO)

Python 957 107 Updated Feb 18, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 379,404 79,422 Updated Jun 18, 2026

slime is an LLM post-training framework for RL Scaling.

Python 6,210 905 Updated Jun 18, 2026

[ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark

Python 144 6 Updated Jun 4, 2025

A Massive Multi-Discipline Lecture Understanding Benchmark

Python 34 Updated Apr 20, 2026

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 19,556 2,139 Updated Jun 17, 2026

A plugin for Mac WeChat

Objective-C 22,622 3,562 Updated Feb 13, 2025

[CVPR'26 Highlight] Cupid: A 3D generator that links 2D image with camera

Jupyter Notebook 213 7 Updated Mar 3, 2026

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,582 65 Updated Jun 14, 2025

[CVPR 2026] Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens

Python 285 17 Updated Aug 2, 2025

Agentic AI research papers, benchmarks, frameworks, and tools curated across 24 domains.

153 4 Updated Jun 15, 2026

Virtual whiteboard for sketching hand-drawn like diagrams

TypeScript 125,597 14,071 Updated Jun 17, 2026

Short Text Topic Modeling, JAVA

Java 162 41 Updated May 24, 2020

Materials and demo code for CSE 572 tutorial sessions (environment setup, Git, Python projects).

Python 1 Updated Sep 27, 2025

📚 Collection of token-level model compression resources.

198 9 Updated Sep 3, 2025

FlashInfer: Kernel Library for LLM Serving

Python 5,820 1,062 Updated Jun 18, 2026
Next