-
Nankai University & The Hong Kong Polytechnic University
- Hong Kong
-
01:36
(UTC +08:00) - https://xiaohainku.github.io/
Stars
Pixio: a capable vision encoder dedicated to dense prediction, simply by pixel reconstruction
[CVPR 2025] Shift the Lens: Environment-Aware Unsupervised Camouflaged Object Detection
[ICCV 2025] Beyond Single Images: Retrieval Self-Augmented Unsupervised Camouflaged Object Detection
Resumes generated using the GitHub informations
MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips
Self-contained, minimalistic implementation of diffusion models with Pytorch.
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.
Awesome curated collection of images and prompts generated by gemini-2.5-flash-image (aka Nano Banana) state-of-the-art image generation and editing model. Explore AI generated visuals created with…
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.
Paper list and datasets for industrial image anomaly/defect detection (updating). 工业异常/瑕疵检测论文及数据集检索库(持续更新)。
The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trained model checkpoints, and example notebooks that show how t…
Repository of the ICLR paper "AnyUp: Universal Feature Upsampling".
A list of awesome works for camouflage/concealed object detection (COD).
Official implementation of "Seg4Diff: Unveiling Open-Vocabulary Segmentation in Text-to-Image Diffusion Transformers" (NeurIPS 2025)
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持简中、繁中、English、日本語,提供 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 等代码实现
Awesome Lists for Tenure-Track Assistant Professors and PhD students. (助理教授/博士生生存指南)
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
A curated list of awesome 3D scene generation papers. (arXiv 2505.05474)
[ICCV'25 oral] Official Code for "LoftUp: Learning a Coordinate-Based Feature Upsampler for Vision Foundation Models"
State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
NeurIPS 2025 Spotlight; ICLR2024 Spotlight; CVPR 2024; EMNLP 2024
Code for Scaling Language-Free Visual Representation Learning (WebSSL)
hehuapei / visitor-badge
Forked from jwenjian/visitor-badgeA badge generator service to count visitors of your markdown file. Fork From:jwenjian/visitor-badge
AcadHomepage: A Modern and Responsive Academic Personal Homepage