Skip to content
View canqin001's full-sized avatar

Highlights

  • Pro

Block or report canqin001

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🦞 Just talk to your agent β€” it learns and EVOLVES 🧬.

Python 3,437 442 Updated Jun 7, 2026

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Python 840 65 Updated May 17, 2026

[ICML'26] Agent0 Series: Self-Evolving Agents from Zero Data

Python 1,220 144 Updated Feb 17, 2026

Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"

Python 592 50 Updated Nov 4, 2025

Salesforce AI Research's open diffusion language model

Python 64 7 Updated Jun 2, 2026

Tongyi Deep Research, the Leading Open-source Deep Research Agent

Python 19,418 1,489 Updated Feb 27, 2026

Codebase for paper ToolVQA: A Dataset for Multi-step Reasoning VQA with External Tools

Python 30 Updated Nov 3, 2025

[TMLR 2026] Survey: https://arxiv.org/pdf/2507.20198

363 23 Updated May 29, 2026

[NeurIPS 2025] HoliTom: Holistic Token Merging for Fast Video Large Language Models

Python 81 Updated Oct 10, 2025

Official implementation of BLIP3o-Series

Python 1,658 79 Updated Nov 29, 2025

Why Vision Language Models Struggle with Visual Arithmetic? Towards Enhanced Chart and Geometry Understanding

Jupyter Notebook 10 Updated Jun 2, 2026

VidKV: Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models

Python 26 1 Updated Mar 26, 2025

[EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents

Python 664 49 Updated Jan 11, 2026

Pretraining and inference code for a large-scale depth-recurrent language model

Python 894 79 Updated Dec 29, 2025

Triple Pont Masking

Python 6 1 Updated Oct 15, 2024

This repository contains the code and released models for the paper Segmenting Text and Learning Their Rewards for Improved RLHF in Language Model, accepted at TMLR.

Python 19 Updated Jan 8, 2025

[CVPR 2025] DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models

Python 111 4 Updated Nov 22, 2025

A collection of resources on applications of multi-modal learning in medical imaging.

966 81 Updated Jun 4, 2026

Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines

Python 128 10 Updated Nov 6, 2024

A suite of image and video neural tokenizers

Jupyter Notebook 1,724 89 Updated Feb 11, 2025

Self-training LLaVA for medical

Python 16 1 Updated Nov 3, 2024

Efficient DiT architecture for text2any tasks, ICLR2025

446 22 Updated May 10, 2025

πŸŒ‹πŸ‘΅πŸ» Yo'LLaVA: Your Personalized Language and Vision Assistant (NeurIPS 2024)

Python 123 9 Updated Mar 26, 2025

PyTorch extensions for high performance and large scale training.

Python 3,409 298 Updated Apr 26, 2025

(NeurIPS 2024 Oral πŸ”₯) Improved Distribution Matching Distillation for Fast Image Synthesis

Python 1,374 71 Updated Mar 5, 2025
Python 11 1 Updated Mar 25, 2024

A simple bash script for switching between installed versions of CUDA.

Shell 663 142 Updated Dec 19, 2018

Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.

Python 470 45 Updated Jan 18, 2023

LLM inference in C/C++

C++ 116,858 19,641 Updated Jun 16, 2026
Next