Highlights
- Pro
Lists (2)
Sort Name ascending (A-Z)
Stars
Official Implementation of "Visual-ERM: Reward Modeling for Visual Equivalence"
[ICLR 2026] RIVER: A Real-Time Interaction Benchmark for Video LLMs
🔥🔥🔥 [Awesome] Latest Papers, Codes & Datasets on Streaming / Online Video Understanding — Building Always-on, Real-time Video AI 🤖
A framework for few-shot evaluation of language models.
[CVPR 2026] Residual Decoder Adapter: ID-Preserving Tokenizer Adaption for Autoregressive Text Rendering
[ICLR 2026 Oral] Through the Lens of Contrast: Self-Improving Visual Reasoning in VLMs
Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"
[ICSE 2026] Official implementation for "ADARULE: LLM-Driven Natural Language to LTL Conversion via Pattern-Adaptive Rule Induction"
Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]
CC GUI 客户端(专为开发者打造的VibeCoding平台)
Academic Research Skills for Claude Code: research → write → review → revise → finalize
Repository for "Data Selection for Fine-tuning Vision Language Models via Cross Modal Alignment Trajectories", ICML 2026
Tempo: Small Vision-Language Models are Smart Compressors for Long Video Understanding
Bridge local AI coding agents (Claude Code, Cursor, Gemini CLI, Codex) to messaging platforms (Feishu/Lark, DingTalk, Slack, Telegram, Discord, LINE, WeChat Work). Chat with your AI dev assistant f…
SenseNova-U series: Native Unified Paradigm with NEO-unify from the First Principles
[ACL '26 Main] CharTide: Data-Centric Chart-to-Code Generation via Tri-Perspective Tuning and Inquiry-Driven Evolution
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫
小红书(XiaoHongShu、RedNote)链接提取/作品采集工具:提取账号发布、收藏、点赞、专辑作品链接;提取搜索结果作品、用户链接;采集小红书作品信息;提取小红书作品下载地址;下载小红书作品文件
这是一个基于Playwright的小红书自动搜索和评论MCP,可以帮助用户自动登录小红书、搜索特定关键词、获取笔记内容以及发布智能评论。
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
[NeurIPS 2025] Open-source Multi-agent Poster Generation from Papers
Teams-first Multi-agent orchestration for Claude Code
Auto-register & manage accounts for ChatGPT, Cursor, Kiro, Grok, Windsurf, Trae & 13+ AI platforms · Protocol/browser dual-mode · Plugin-based · One-click Mac/Windows desktop app
A Claude Code plugin that shows what's happening - context usage, active tools, running agents, and todo progress
Official codes for "Read or Ignore? A Unified Benchmark for Typographic-Attack Robustness and Text Recognition in Vision-Language Models"