-
Beijing Jiaotong University
- Beijing
- https://scholar.google.com/citations?user=po1KXtwAAAAJ&hl=en
- https://orcid.org/0000-0002-3743-7738
Highlights
- Pro
Lists (24)
Sort Name ascending (A-Z)
Change Detection
ChatGPT
CLIP
Cloud Detection
Cloud Removal
Computer Vision
Consistency Models
Deepfake Detection
Diffusion
Face Generation
Face Recognition
Image Restoration
Linux
Machine Learning
Music Source Separation
Obejct Detection
Pansharpening
Point Cloud Completion
Reinforcement Learning
RLHF
Semantic Segmentation
Talking Face Generation
Toolbox
Video Prediction
Stars
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
✨ [ICLR'26] WithAnyone is capable of generating high-quality, controllable, and ID consistent images
[NeurIPS 2023] FaceDNeRF: Semantics-Driven Face Reconstruction, Prompt Editing and Relighting with Diffusion Models
⛹️ Pytorch ReID: A tiny, friendly, strong pytorch implement of person re-id / vehicle re-id baseline. Tutorial 👉https://github.com/layumi/Person_reID_baseline_pytorch/tree/master/tutorial
[SIGGRAPH Asia 2024 & IJCV 2025] Follow-Your-Emoji & Follow-Your-Emoji-Faster: This repo is the official implementation of "Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait An…
An open-source AI agent that brings the power of Gemini directly into your terminal.
JoyAI-Image is the unified multimodal foundation model for image understanding, text-to-image generation, and instruction-guided image editing.
Linux.do 论坛信任等级监控 Chrome 扩展,支持 Credit 积分查看与 5 种游戏主题风格
Zed 编辑器汉化版 / 中文版 / 多语言版 — AI 驱动全自动翻译构建,支持简繁中文/日语/韩语 | Zed Editor Localized with AI-powered translation pipeline
🚀 Transparent proxy injector for Antigravity. Force SOCKS5/HTTP proxy without TUN mode on Windows. | 专为 Antigravity 打造的免 TUN 强制代理工具,支持 DLL 注入与进程流量劫持。
[CVPR 2025] StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
MotionStream: Real-Time Video Generation with Interactive Motion Controls
基于 https://www.wenshushu.cn (文叔叔) 上传与下载文件的 Python 脚本
One sentence creates an AI-driven world — generate maps, characters, and watch stories emerge on their own. 一句话生成一个AI自主驱动的世界.
Official Implementation of Paper Transfer between Modalities with MetaQueries
A unified and fully open-source framework for instruction-guided and reference-guided video editing using natural language.
FastSESR: Fast Scene-level Explicit Surface Reconstruction
Self-evolving agent: grows skill tree from 3.3K-line seed, achieving full system control with 6x less token consumption
将博导十年科研经验炼化为可直接调用的 AI 技能。从 Idea 构思到论文投稿,你的 AI 科研副导师。
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
Celeb-DF++: A Large-scale Challenging Video DeepFake Benchmark for Generalizable Forensics
Edit Banana: A framework for converting statistical formats into editable.
[TIP2023] Lightweight Spatial Boosting Network for Detecting Salient Objects in RGB-Thermal Images
[MICCAI 2024 Oral] The official code of "TinyU-Net: Lighter Yet Better U-Net with Cascaded Multi-receptive Fields"
This repository contains the official implementation of the research papers, "MobileCLIP" CVPR 2024 and "MobileCLIP2" TMLR August 2025
Code for "SAMNet: Stereoscopically Attentive Multi-scale Network for Lightweight Salient Object Detection" and "Lightweight Salient Object Detection via Hierarchical Visual Perception Learning"
Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"