Lists (3)
Sort Name ascending (A-Z)
Starred repositories
A latent text-to-image diffusion model
Examples and guides for using the OpenAI API
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
🔊 Text-Prompted Generative Audio Model
Google Research
A guidance language for controlling large language models.
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
FinRL®: Financial Reinforcement Learning. 🔥
LangGPT: Empowering everyone to become a prompt expert! 🚀 📌 结构化提示词(Structured Prompt)提出者 📌 元提示词(Meta-Prompt)发起者 📌 最流行的提示词落地范式 | Language of GPT The pioneering framework for structured & meta-prompt…
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…
MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
Inpaint anything using Segment Anything and inpainting models.
Using Low-rank adaptation to quickly fine-tune diffusion models.
A real-time approach for mapping all human pixels of 2D RGB images to a 3D surface-based model of the body
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
COCO API - Dataset @ http://cocodataset.org/
Reference models and tools for Cloud TPUs.
Code repo for realtime multi-person pose estimation in CVPR'17 (Oral)
Segment Anything in High Quality [NeurIPS 2023]
Open-source and strong foundation image recognition models.