An AI-powered security review GitHub Action using Claude
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Let us control diffusion models
PyTorch code and models for the DINOv2 self-supervised learning
Learning Continuous Signed Distance Functions for Shape Representation
DeepSeek Coder: Let the Code Write Itself
DeepSeek LLM: Let there be answers
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
Dataset of GPT-2 outputs for research in detection, biases, and more
A Unified Framework for Text-to-3D and Image-to-3D Generation
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Ling is a MoE LLM provided and open-sourced by InclusionAI
One-click local MCP server installation in desktop apps
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Implementation of "MobileCLIP" CVPR 2024
Chat & pretrained large audio language model proposed by Alibaba Cloud
A series of math-specific large language models of our Qwen2 series
Qwen3-omni is a natively end-to-end, omni-modal LLM
DeepMind model for tracking arbitrary points across videos & robotics
VMZ: Model Zoo for Video Modeling
FAIR Sequence Modeling Toolkit 2
Open-source, high-performance Mixture-of-Experts large language model
Powerful open source image generation model
Open-Source Financial Large Language Models!
Blazeface is a lightweight model that detects faces in images