-
DeepSeek AI
- Shanghai, China
- https://scholar.google.com/citations?user=A6K6bkoAAAAJ&hl=en
- @DoubilitySteven
- in/wen-liu-89644a1b7
Stars
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
Code release of our paper "DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation".
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
[NeurIPS 2024] Classification Done Right for Vision-Language Pre-Training
Janus-Series: Unified Multimodal Understanding and Generation Models
Implementation of Recurrent Interface Network (RIN), for highly efficient generation of images and video without cascading networks, in Pytorch
Code for "Don’t drop your samples! Coherence-aware training benefits Conditional diffusion" CVPR 2024 Highlight
Code for PointInfinity: Resolution-Invariant Point Diffusion Models
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Minimal implementation of scalable rectified flow transformers, based on SD3's approach
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
[SIGGRAPH Asia'24 & TOG] Gaussian Opacity Fields: Efficient Adaptive Surface Reconstruction in Unbounded Scenes
DeepSeek-VL: Towards Real-World Vision-Language Understanding
MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning
A Pytorch Implementation of Finite Scalar Quantization
AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
This is the implementation of our paper in ECCV 2020.
An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference.
A framework to enable multimodal models to operate a computer.