Skip to content
View fengxin-zhxx's full-sized avatar

Highlights

  • Pro

Block or report fengxin-zhxx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official Implementation of "Visual-ERM: Reward Modeling for Visual Equivalence"

Python 64 Updated Mar 23, 2026

[ICLR 2026] RIVER: A Real-Time Interaction Benchmark for Video LLMs

Python 10 Updated Apr 20, 2026

🔥🔥🔥 [Awesome] Latest Papers, Codes & Datasets on Streaming / Online Video Understanding — Building Always-on, Real-time Video AI 🤖

296 27 Updated Jun 2, 2026

A framework for few-shot evaluation of language models.

Python 13,003 3,353 Updated Jun 2, 2026

[CVPR 2026] Residual Decoder Adapter: ID-Preserving Tokenizer Adaption for Autoregressive Text Rendering

Python 6 Updated Jun 7, 2026

[ICLR 2026 Oral] Through the Lens of Contrast: Self-Improving Visual Reasoning in VLMs

19 Updated Apr 29, 2026

Official repo for "PAPO: Perception-Aware Policy Optimization for Multimodal Reasoning"

Python 146 11 Updated Feb 4, 2026
Python 20 1 Updated Jan 26, 2025

[ICSE 2026] Official implementation for "ADARULE: LLM-Driven Natural Language to LTL Conversion via Pattern-Adaptive Rule Induction"

Python 2 Updated Jan 3, 2026
Python 144 3 Updated Apr 27, 2026

Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]

Python 877 46 Updated Dec 14, 2025

CC GUI 客户端(专为开发者打造的VibeCoding平台)

TypeScript 3,201 268 Updated Jun 17, 2026

Academic Research Skills for Claude Code: research → write → review → revise → finalize

Python 32,752 2,685 Updated Jun 18, 2026

Repository for "Data Selection for Fine-tuning Vision Language Models via Cross Modal Alignment Trajectories", ICML 2026

Python 2 1 Updated May 2, 2026

Tempo: Small Vision-Language Models are Smart Compressors for Long Video Understanding

Python 72 3 Updated Apr 29, 2026

每日Arxiv VLA论文爬取

HTML 9 1 Updated Jun 18, 2026

Bridge local AI coding agents (Claude Code, Cursor, Gemini CLI, Codex) to messaging platforms (Feishu/Lark, DingTalk, Slack, Telegram, Discord, LINE, WeChat Work). Chat with your AI dev assistant f…

Go 12,678 1,194 Updated Jun 17, 2026

SenseNova-U series: Native Unified Paradigm with NEO-unify from the First Principles

Python 3,260 285 Updated Jun 15, 2026

[ACL '26 Main] CharTide: Data-Centric Chart-to-Code Generation via Tri-Perspective Tuning and Inquiry-Driven Evolution

7 Updated Apr 29, 2026

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫

Python 51,411 10,759 Updated Jun 18, 2026

小红书(XiaoHongShu、RedNote)链接提取/作品采集工具:提取账号发布、收藏、点赞、专辑作品链接;提取搜索结果作品、用户链接;采集小红书作品信息;提取小红书作品下载地址;下载小红书作品文件

Python 11,606 1,739 Updated Jun 17, 2026

这是一个基于Playwright的小红书自动搜索和评论MCP,可以帮助用户自动登录小红书、搜索特定关键词、获取笔记内容以及发布智能评论。

Python 116 25 Updated Sep 20, 2025

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 13,910 2,480 Updated Jun 19, 2026

[NeurIPS 2025] Open-source Multi-agent Poster Generation from Papers

Python 3,799 278 Updated Jun 8, 2026

企业级 LLM 中转服务可用性监控系统,实时追踪服务状态并提供可视化仪表板。

Go 1,023 85 Updated Jun 15, 2026

Teams-first Multi-agent orchestration for Claude Code

TypeScript 36,620 3,323 Updated Jun 18, 2026

Auto-register & manage accounts for ChatGPT, Cursor, Kiro, Grok, Windsurf, Trae & 13+ AI platforms · Protocol/browser dual-mode · Plugin-based · One-click Mac/Windows desktop app

Python 2,767 945 Updated Jun 14, 2026

npx ccusage

Rust 16,342 669 Updated Jun 19, 2026

A Claude Code plugin that shows what's happening - context usage, active tools, running agents, and todo progress

JavaScript 25,425 1,157 Updated Jun 18, 2026

Official codes for "Read or Ignore? A Unified Benchmark for Typographic-Attack Robustness and Text Recognition in Vision-Language Models"

Python 4 Updated Jan 24, 2026
Next