Tencent’s 36-language state-of-the-art translation model
Multimodal-Driven Architecture for Customized Video Generation
Diffusion Transformer with Fine-Grained Chinese Understanding
Multimodal Diffusion with Representation Alignment
A Customizable Image-to-Video Model based on HunyuanVideo
Release for Improved Denoising Diffusion Probabilistic Models
Code for reproducing key results in the paper
Personalize Any Characters with a Scalable Diffusion Transformer
Jan-v1-edge: efficient 1.7B reasoning model optimized for edge devices
Kimi K2: 1T-param MoE model for advanced coding and agentic reasoning
llama.go is like llama.cpp in pure Golang
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
Open-source pre-training implementation of Google's LaMDA in PyTorch
Language modeling in a sentence representation space
Llama 3.2–1B: Multilingual, instruction-tuned model for mobile AI
Instruction-tuned 1.2B LLM for multilingual text generation by Meta
PyTorch implementation of MAE
A library for Multilingual Unsupervised or Supervised word Embeddings
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Code release for "Masked-attention Mask Transformer
Per-Pixel Classification is Not All You Need for Semantic Segmentation
JetBrains’ 4B parameter code model for completions
code for Mesh R-CNN, ICCV 2019
ICLR2024 Spotlight: curation/training code, metadata, distribution
Repo for external large-scale work