Highlights
- Pro
Stars
- All languages
- APL
- Assembly
- Batchfile
- Boogie
- C
- C#
- C++
- CMake
- CSS
- Clojure
- Coq
- Cuda
- Dockerfile
- Emacs Lisp
- F#
- Fortran
- Go
- HTML
- Haskell
- Java
- JavaScript
- JetBrains MPS
- Julia
- Jupyter Notebook
- Kotlin
- Lean
- Lua
- MLIR
- Markdown
- Nim
- OCaml
- PHP
- Python
- Racket
- Rocq Prover
- Ruby
- Rust
- SMT
- Scala
- Shell
- Slash
- Solidity
- Swift
- SystemVerilog
- TeX
- TypeScript
- Verilog
- Vim Script
- Vue
- WebAssembly
- Zig
Machine Learning Engineering Open Book
The official implementation of "Self-play LLM Theorem Provers with Iterative Conjecturing and Proving"
[NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models
Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"
Formal verification tool for Rust: check 100% of execution cases of your programs to make safer applications.
Our library for RL environments + evals
Build your own visual reasoning model
A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.
My learning notes for ML SYS.
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
FlashMLA: Efficient Multi-head Latent Attention Kernels
Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.
[ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding
Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Tr…
Tutorial on large language models for genomics
🚀 Efficient implementations for emerging model architectures
A 7B parameter model for mathematical reasoning
Learning Universal Predictors
A series of technical report on Slow Thinking with LLM
Visualize the intermediate output of Mistral 7B
Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.
Python program to generate anki cards from obsidian markdown notes
Fight the forgetting curve by reviewing flashcards & entire notes on Obsidian
Script to add flashcards from text/markdown files to Anki