Skip to content
View aba122's full-sized avatar

Block or report aba122

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OmniVideo-100K: A Dataset for Audio-Visual Reasoning through Structured Scripts and Evidence Chains

Python 43 2 Updated Jun 16, 2026

符合nature论文学术表达和科研绘图的Skill

Python 20,708 1,221 Updated Jun 17, 2026
Python 1 Updated May 4, 2026

Interactive deep-dives into ML papers

TypeScript 8 2 Updated Jun 14, 2026

AI 驱动的学术论文深度分析工具:MinerU 解析 + Claude 生成图文技术文章 + GitHub 代码级创新点定位,结果自动存入 Obsidian vault

Python 15 1 Updated Mar 25, 2026

WeChat 4.0 database decryptor - extract keys from memory, decrypt SQLCipher 4 databases, real-time message monitor

Python 4,092 2,182 Updated Jun 10, 2026

Official implementation of "PyVision-RL: Forging Open Agentic Vision Models via RL."

Python 67 2 Updated Feb 25, 2026

✨✨ [ICLR 2026] Think Beyond Images

Python 581 37 Updated Sep 23, 2025

Transform arXiv papers into a single LaTeX source that can be used as a prompt for asking LLMs questions about the paper.

Python 163 10 Updated Jun 10, 2026

Code2World: A GUI World Model via Renderable Code Generation

Python 321 18 Updated Feb 12, 2026
Python 61 Updated Feb 9, 2026

[ICLR 2026] Harder Is Better: Boosting Mathematical Reasoning via Difficulty-Aware GRPO and Multi-Aspect Question Reformulation

Python 129 1 Updated May 17, 2026

A curated list of vibe coding references, collaborating with AI to write code.

4,763 560 Updated Apr 16, 2026

🤖 Awesome list of AI Agents

2,425 633 Updated Jun 14, 2026

[ICLR26] Official implementation of the paper "Urban Socio-Semantic Segmentation with Vision-Language Reasoning"

Python 175 4 Updated Mar 12, 2026

[ACL 2026 Findings] Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization

Python 176 6 Updated Mar 9, 2026

Dingo: A Comprehensive AI Data, Model and Application Quality Evaluation Tool

Python 715 75 Updated Jun 16, 2026

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …

Python 14,546 1,483 Updated Jun 17, 2026

[ICLR2026] There is No VAE: End-To-End Pixel-Space Generative Modeling Via Self-Supervised Pre-Training

Shell 149 4 Updated Mar 27, 2026

[NeurIPS' 2025] JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent

Python 827 36 Updated Apr 4, 2026
Python 27 Updated Oct 10, 2025

✨✨ [ICLR 2026] MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models

Python 42 4 Updated Apr 10, 2025

Aesthetically-Relevant-Image-Captioning

Python 35 1 Updated Apr 26, 2023

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 6,548 380 Updated Jun 17, 2026

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 1 Updated Mar 12, 2026

③[ICML2024] [IQA, IAA, VQA] All-in-one Foundation Model for visual scoring. Can efficiently fine-tune to downstream datasets.

Python 603 33 Updated Mar 12, 2025

A Survey of Reinforcement Learning for Large Reasoning Models

TeX 2,466 130 Updated Nov 9, 2025

🔎 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including PSNR, SSIM, LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...

Python 3,281 247 Updated May 31, 2026
Next