-
Shenzhen University
- Shenzhen | Hongkong
-
06:53
(UTC +08:00) - https://panlinchao.github.io/
Highlights
- Pro
Starred repositories
Our survey's paper list on Agentic AI, continuously updated with the latest research.
DART-GUI: Efficient Multi-turn RL for GUI Agents via Decoupled Training and Adaptive Data Curation
ScaleCUA is the open-sourced computer use agents that can operate on corss-platform environments (Windows, macOS, Ubuntu, Android).
Building a comprehensive and handy list of papers for GUI agents
[Up-to-date] Large Language Model Agent: A Survey on Methodology, Applications and Challenges
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Official PyTorch Implementation of MC3D-AD: A Unified Geometry-aware Reconstruction Model for Multi-category 3D Anomaly Detection. Accepted by IJCAI 2025.
iCAN-SZU / MC3D-AD
Forked from jiayi-art/MC3D-ADOfficial PyTorch Implementation of MC3D-AD: A Unified Geometry-aware Reconstruction Model for Multi-category 3D Anomaly Detection. Accepted by IJCAI 2025.
The simplest, fastest repository for training/finetuning small-sized VLMs.
Github repository for ACL 2025 paper: Recent Advances in Speech Language Models: A Survey.
A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.
Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
A most Frontend Collection and survey of vision-language model papers, and models GitHub repository. Continuous updates.
The development and future prospects of large multimodal reasoning models.
Reading list for research topics in multimodal machine learning
Machine Learning Engineering Open Book
Famous Vision Language Models and Their Architectures
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
Latest Advances on System-2 Reasoning
Paper List of Inference/Test Time Scaling/Computing
The official implementation for paper: Vision-Language Models are Strong Noisy Label Detectors
This is a code repository for paper "Does Confusion Really Hurt Novel Class Discovery?".
Official code for paper "OpenCIL: Benchmarking Out-of-Distribution Detection in Class-Incremental Learning"
An Extendible (General) Continual Learning Framework based on Pytorch - official codebase of Dark Experience for General Continual Learning
Awesome Incremental Learning