- HangZhou
Lists (8)
Sort Name ascending (A-Z)
Stars
Examples and guides for using the OpenAI API
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
🔊 Text-Prompted Generative Audio Model
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
Anthropic's Interactive Prompt Engineering Tutorial
A simple screen parsing tool towards pure vision based GUI agent
面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版
Get started with building Fullstack Agents using Gemini 2.5 and LangGraph
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
A hands-on introduction to video technology: image, video, codec (av1, vp9, h265) and more (ffmpeg encoding). Translations: 🇺🇸 🇨🇳 🇯🇵 🇮🇹 🇰🇷 🇷🇺 🇧🇷 🇪🇸
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
stable diffusion webui colab
StableLM: Stability AI Language Models
LangGPT: Empowering everyone to become a prompt expert! 🚀 📌 结构化提示词(Structured Prompt)提出者 📌 元提示词(Meta-Prompt)发起者 📌 最流行的提示词落地范式 | Language of GPT The pioneering framework for structured & meta-prompt…
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.
A small package to create visualizations of PyTorch execution graphs
[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Kandinsky 2 — multilingual text2image latent diffusion model
智能视频多语言AI配音/翻译工具 - Linly-Dubbing — “AI赋能,语言无界”
A colab gradio web UI for running Large Language Models
The /llms.txt file, helping language models use your website
Code for "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates", Siggraph Asia 2024
Concept Sliders for Precise Control of Diffusion Models
A toolkit for speaker diarization.
Examples and guides for using the LLMs