Lists (3)
Sort Name ascending (A-Z)
Stars
FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Image-to-Image Translation in PyTorch
Official inference repo for FLUX.1 models
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Fast and memory-efficient exact attention
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
WebUI extension for ControlNet
State-of-the-Art Text Embeddings
PyTorch implementations of Generative Adversarial Networks.
Train transformer language models with reinforcement learning.
Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).
verl: Volcano Engine Reinforcement Learning for LLMs
End-to-End Object Detection with Transformers
Wan: Open and Advanced Large-Scale Video Generative Models
Ongoing research training transformer models at scale
Question and Answer based on Anything.
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
✨ 一站式 LLM 聊天机器人平台及开发框架 ✨ 支持 QQ、QQ频道、Telegram、企微、飞书、钉钉 | 知识库、MCP 服务器、OpenAI、DeepSeek、Gemini、硅基流动、月之暗面、Ollama、OneAPI、Dify
A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.
An open source implementation of CLIP.
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.