-
Peking University
- Shenzhen
-
06:09
(UTC +08:00)
Lists (9)
Sort Name ascending (A-Z)
Stars
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
A latent text-to-image diffusion model
Neural Networks: Zero to Hero
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
《动手学大模型Dive into LLMs》系列编程实践教程
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
[🔥updating ...] AI 自动量化交易机器人(完全本地部署) AI-powered Quantitative Investment Research Platform. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant
llama3 implementation one matrix multiplication at a time
PyTorch code and models for the DINOv2 self-supervised learning method.
MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks
A series of large language models trained from scratch by developers @01-ai
AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。
🎓 无需编写任何代码即可轻松创建漂亮的学术网站 Easily create a beautiful academic résumé or educational website using Hugo and GitHub. No code.
Acceptance rates for the major AI conferences
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
A simplified implemention of Faster R-CNN that replicate performance from origin paper
Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
OneDiff: An out-of-the-box acceleration library for diffusion models.
Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.
NVIDIA curated collection of educational resources related to general purpose GPU programming.
This repo contains the code for 1D tokenizer and generator
Frontier Multimodal Foundation Models for Image and Video Understanding
Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.