Reference implementation of the Transformer architecture optimized
Custom BLEURT model for evaluating text similarity using PyTorch
Inference code for scalable emulation of protein equilibrium ensembles
ClinicalBERT model trained on MIMIC notes for clinical NLP tasks
CLIP, Predict the most relevant text snippet given an image
CLIP ViT-bigG/14: Zero-shot image-text model trained on LAION-2B
An Open Bilingual Chat LLM | Open Source Bilingual Conversation LLM
The ChatGPT Retrieval Plugin lets you easily find personal documents
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
The Clay Foundation Model - An open source AI model and interface
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
CodeGeeX2: A More Powerful Multilingual Code Generation Model
GPT4V-level open-source multi-modal model based on Llama3-8B
Official repo for consistency models
Code release for ConvNeXt V2 model
Clean and efficient FP8 GEMM kernels with fine-grained scaling
DeepSeek LLM: Let there be answers
Towards Ultimate Expert Specialization in Mixture-of-Experts Language
Advancing Formal Mathematical Reasoning via Reinforcement Learning
Towards Real-World Vision-Language Understanding
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
685B model with improved agents and consistency
Official DeiT repository
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
Sharp Monocular Metric Depth in Less Than a Second