-
Nankai University
- Tianjin
-
06:34
(UTC +08:00) - https://montaellis.github.io
Starred repositories
😘 让你“爱”上 GitHub,解决访问时图裂、加载慢的问题。(无需安装)
State-of-the-art 2D and 3D Face Analysis Project
Generative Models by Stability AI
Industry leading face manipulation platform
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source …
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Graph Neural Network Library for PyTorch
MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone
《Designing Data-Intensive Application》DDIA 第一版 / 第二版 中文翻译
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
WebUI extension for ControlNet
Janus-Series: Unified Multimodal Understanding and Generation Models
An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.
pix2tex: Using a ViT to convert images of equations into LaTeX code.
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Convert Machine Learning Code Between Frameworks
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
An open source implementation of CLIP.
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.