-
Nankai University
- Tianjin
-
02:32
(UTC +08:00) - https://montaellis.github.io
Starred repositories
Pythonic AI generation of images and videos
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
tensorboard for pytorch (and chainer, mxnet, numpy, ...)
Minimal PyTorch implementation of YOLOv3
🔥 2D and 3D Face alignment library build using pytorch
Python for《Deep Learning》,该书为《深度学习》(花书) 数学推导、原理剖析与源码级别代码实现
Large World Model -- Modeling Text and Video with Millions Context
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
[CVPR 2020] 3D Photography using Context-aware Layered Depth Inpainting
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025
Official PyTorch implementation of StyleGAN3
Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.
TripoSR: Fast 3D Object Reconstruction from a Single Image