Skip to content
View YuxuanSnow's full-sized avatar

Highlights

  • Pro

Organizations

@tum-phoenix

Block or report YuxuanSnow

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
253 stars written in Python
Clear filter

Python tool for converting files and office documents to Markdown.

Python 82,630 4,666 Updated Oct 20, 2025

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 69,572 8,390 Updated Sep 20, 2025

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…

Python 64,153 6,506 Updated Sep 19, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 61,969 7,494 Updated Nov 6, 2025

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 48,211 3,988 Updated Nov 6, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 47,985 3,927 Updated Nov 6, 2025

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 47,564 6,643 Updated Jun 11, 2025

小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频 | 评论爬虫、微博帖子 | 评论爬虫、百度贴吧帖子 | 百度贴吧评论回复爬虫 | 知乎问答文章|评论爬虫

Python 38,695 8,783 Updated Nov 6, 2025

Let us control diffusion models!

Python 33,260 2,978 Updated Feb 25, 2024

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,745 2,932 Updated Sep 2, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 23,908 2,659 Updated Aug 12, 2024

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,190 1,665 Updated Sep 24, 2025

Rembg is a tool to remove images background

Python 20,954 2,160 Updated Oct 25, 2025

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Python 19,325 2,724 Updated Oct 17, 2025

Lets make video diffusion practical!

Python 16,114 1,547 Updated Oct 16, 2025

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 13,332 2,547 Updated Jun 26, 2024

Code to accompany "A Method for Animating Children's Drawings of the Human Figure"

Python 12,716 1,127 Updated Sep 3, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,363 1,523 Updated Apr 24, 2025

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 12,286 1,203 Updated Oct 28, 2025

Official implementation of AnimateDiff.

Python 11,845 1,021 Updated Jul 31, 2024

Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).

Python 10,917 993 Updated Nov 5, 2025

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 9,208 946 Updated Aug 12, 2024

Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX, CoreML!

Python 9,137 1,187 Updated Apr 2, 2024

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,328 808 Updated Oct 31, 2025

More relighting!

Python 8,277 520 Updated Feb 20, 2025

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Python 7,824 595 Updated Jul 17, 2024

Your image is almost there!

Python 7,650 442 Updated Jul 26, 2024

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,871 504 Updated May 31, 2024

DUSt3R: Geometric 3D Vision Made Easy

Python 6,713 710 Updated Sep 24, 2025

Infinite Photorealistic Worlds using Procedural Generation

Python 6,693 541 Updated Oct 18, 2025
Next