Stars
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Mosaico - The data platform for Physical AI
from vibe coding to agentic engineering - practice makes claude perfect
Use Garry Tan's exact Claude Code setup: 23 opinionated tools that serve as CEO, Designer, Eng Manager, Release Manager, Doc Engineer, and QA
Python library for building and running distributed data pipelines using Ray
1000UserGuide:对独立开发者和创业者来说,找到前1000个早期用户太关键了。这里精心整理了300多个国内外渠道,适合独立开发者和创业者推广产品的渠道。
The open-source CapCut alternative
入门资料整理:1.多因子股票量化框架开源教程 2.学界和业界的经典资料收录 3.AI + 金融的相关工作,包括LLM, Agent, benchmark(evaluation), etc.
FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.
[SIGMOD 2026] F3: The Open-Source Data File Format for the Future
An Open-Source Asynchronous Coding Agent
The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!
Lightweight coding agent that runs in your terminal
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
🧠「大模型」2小时完全从0训练64M的小参数LLM!Train a 64M-parameter LLM from scratch in just 2h!
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
Robust Speech Recognition via Large-Scale Weak Supervision
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
A PyTorch native platform for training generative AI models
Coding a Multimodal (Vision) Language Model from scratch in PyTorch with full explanation: https://www.youtube.com/watch?v=vAmKB7iPkWw
Scalable data pre processing and curation toolkit for LLMs
This repository is responsible for the LLVM-related parts of Jeandle.
Jeandle is a Just-in-Time compiler for Java. It is built on OpenJDK and leverages the LLVM compiler infrastructure to generate machine code, aiming to provide powerful compilation optimizations and…
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step