Ironieser

Ironieser

Independent Researcher

34 followers · 32 following

Independent Researcher
US

Achievements

Lists (10)

Sort

Stars

EvolvingLMMs-Lab / LLaVA-OneVision-2

Fully Open Framework for Democratized Multimodal Training

Python 1,093 75 Updated Jun 18, 2026

Fanziyang-v / FlashVID

[ICLR 2026 Oral] FlashVID: Efficient Video Large Language Models via Training-free Tree-based Spatiotemporal Token Merging

Python 106 Updated Apr 30, 2026

Ironieser / hive-cli

A lightweight GPU node manager designed for agentic coding workflows on SLURM clusters.

Python 1 Updated Jun 9, 2026

lbjlaq / Antigravity-Manager

Professional Antigravity Account Manager & Switcher. One-click seamless account switching for Antigravity Tools. Built with Tauri v2 + React (Rust).专业的 Antigravity 账号管理与切换工具。为 Antigravity 提供一键无缝账号切…

Rust 29,793 3,223 Updated Jun 17, 2026

OpenLAIR / dr-claw

A Super AI Lab with massive AI Doctors as Assistants. Best IDE for Research via AI Power.

JavaScript 1,008 106 Updated Jun 16, 2026

Snitro / Pointer-CAD

Python 271 16 Updated May 14, 2026

apple / ml-mobileclip

This repository contains the official implementation of the research papers, "MobileCLIP" CVPR 2024 and "MobileCLIP2" TMLR August 2025

Python 1,551 123 Updated Apr 15, 2026

Ironieser / MMTok

[ICLR 2026] The official repo of "MMTok: Multimodal Coverage Maximization for Efficient Inference of VLMs"

Python 41 4 Updated Mar 11, 2026

Ironieser / video_subtitles

Small toolchain to turn spoken audio into ASS subtitles using Whisper, and optionally burn them into video with FFmpeg (hardsubs).

Python 2 Updated Feb 10, 2026

ModelTC / LightX2V

Lightweight Image Video Action Generation Inference Framework

Python 2,424 221 Updated Jun 18, 2026

Lightricks / LTX-2

Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.

Python 7,479 1,206 Updated Jun 17, 2026

lpcvai / 26LPCVC_Track2_Sample_Solution

Python 11 6 Updated May 1, 2026

QwenLM / Qwen3-TTS

Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…

Python 12,021 1,558 Updated Mar 17, 2026

lcqysl / FrameThinker

[ICLR 2026] Official repo for "FrameThinker: Learning to Think with Long Videos via Multi-Turn Frame Spotlighting"

Python 50 5 Updated Oct 9, 2025

lasgroup / SDPO

Reinforcement Learning via Self-Distillation (SDPO)

Python 957 107 Updated Feb 18, 2026

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 379,404 79,422 Updated Jun 18, 2026

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 6,210 905 Updated Jun 18, 2026

wenhaochai / aurora

[ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark

Python 144 6 Updated Jun 4, 2025

Espere-1119-Song / Video-MMLU

A Massive Multi-Discipline Lecture Understanding Benchmark

Python 34 Updated Apr 20, 2026

SWE-agent / SWE-agent

SWE-agent takes a GitHub issue and tries to automatically fix it, using your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2024]

Python 19,556 2,139 Updated Jun 17, 2026

MustangYM / WeChatExtension-ForMac

A plugin for Mac WeChat

Objective-C 22,622 3,562 Updated Feb 13, 2025

cupid3d / Cupid

[CVPR'26 Highlight] Cupid: A 3D generator that links 2D image with camera

Jupyter Notebook 213 7 Updated Mar 3, 2026

ByteDance-Seed / Seed1.5-VL

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,582 65 Updated Jun 14, 2025