Qwen-Image is a powerful image generation foundation model
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Strong, Economical, and Efficient Mixture-of-Experts Language Model
High-Resolution Image Synthesis with Latent Diffusion Models
Access to Anthropic's safety-first language model APIs
State-of-the-art TTS model under 25MB
Renderer for the harmony response format to be used with gpt-oss
Tool for exploring and debugging transformer model behaviors
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Provides convenient access to the Anthropic REST API from any Python 3
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Revolutionizing Database Interactions with Private LLM Technology
Pushing the Limits of Mathematical Reasoning in Open Language Models
Unified Multimodal Understanding and Generation Models
Repo of Qwen2-Audio chat & pretrained large audio language model
High-Resolution Image Synthesis with Latent Diffusion Models
Extension index for stable-diffusion-webui
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Sharp Monocular Metric Depth in Less Than a Second
Example Discord bot written in Python that uses the completions API
Global weather forecasting model using graph neural networks and JAX
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
ChatGPT integration with Unity Editor