Official DeiT repository
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
Official PyTorch Implementation of "Scalable Diffusion Models"
Dia-1.6B generates lifelike English dialogue and vocal expressions
Official implementation of DreamCraft3D
Text-to-image model optimized for artistic quality and safe generation
This repository contains the official implementation of FastVLM
This repository contains the official implementation of research
Reproduces results of "Fixing the train-test resolution discrepancy"
FlashMLA: Efficient Multi-head Latent Attention Kernels
A PyTorch library for implementing flow matching algorithms
GLIDE: a diffusion-based text-conditional image synthesis model
GLM-4 series: Open Multilingual Multimodal Chat LMs
Open Multilingual Multimodal Chat LMs
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Compact hybrid reasoning language model for intelligent responses
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Implementation of model parallel autoregressive transformers on GPUs
The official PyTorch implementation of Google's Gemma models
New set of lightweight state-of-the-art, open foundation models
A Family of Open Foundation Models for Code Intelligence
Large-scale xAI model for local inference with SGLang, Grok-2.5
Hermes 4 FP8: hybrid reasoning Llama-3.1-405B model by Nous Research
Efficient 13B MoE language model with long context and reasoning modes
Tencent’s 36-language state-of-the-art translation model