Implementation of model parallel autoregressive transformers on GPUs
Global weather forecasting model using graph neural networks and JAX
A Unified Framework for Text-to-3D and Image-to-3D Generation
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Ling is a MoE LLM provided and open-sourced by InclusionAI
One-click local MCP server installation in desktop apps
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Implementation of "MobileCLIP" CVPR 2024
A series of math-specific large language models of our Qwen2 series
High-resolution models for human tasks
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion
Implementation of the Surya Foundation Model for Heliophysics
DeepMind model for tracking arbitrary points across videos & robotics
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
VMZ: Model Zoo for Video Modeling
FAIR Sequence Modeling Toolkit 2
Open-source, high-performance Mixture-of-Experts large language model
Powerful open source image generation model
Open-Source Financial Large Language Models!
Blazeface is a lightweight model that detects faces in images
A Conversational Speech Generation Model
Runtime extension of Proximus enabling Deployment on AMD Ryzen™ AI
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Detect faces in an image
A CNN model that predicts human joints from RGB images of a person