Skip to content
View xu3kev's full-sized avatar

Highlights

  • Pro

Organizations

@cs4789-s21

Block or report xu3kev

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 4,524 491 Updated Apr 22, 2026

Record for work at Google DeepMind

HTML 482 33 Updated Dec 29, 2025

Machine Learning Engineering Open Book

Python 18,129 1,151 Updated May 18, 2026

The official implementation of "Self-play LLM Theorem Provers with Iterative Conjecturing and Proving"

Python 121 10 Updated Mar 28, 2025

[NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models

Python 242 22 Updated Oct 14, 2025

Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"

Jupyter Notebook 623 56 Updated Oct 7, 2025

Formal verification tool for Rust: check 100% of execution cases of your programs to make safer applications.

Rocq Prover 1,129 45 Updated Jun 16, 2026

Our library for RL environments + evals

Python 4,199 561 Updated Jun 16, 2026

Build your own visual reasoning model

Jupyter Notebook 423 28 Updated Jan 13, 2026

A FlashAttention implementation for JAX with support for efficient document mask computation and context parallelism.

Python 167 9 Updated Nov 11, 2025

My learning notes for ML SYS.

Python 6,532 443 Updated Jun 8, 2026

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

Python 132,820 21,488 Updated Jun 16, 2026

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 12,705 1,062 Updated Apr 30, 2026

Democratizing Reinforcement Learning for LLMs

Python 5,623 577 Updated Jun 16, 2026

Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.

Jupyter Notebook 629 52 Updated Feb 24, 2025

[ICLR2025 Oral] ChartMoE: Mixture of Diversely Aligned Expert Connector for Chart Understanding

Jupyter Notebook 100 9 Updated Apr 1, 2025

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Tr…

Jupyter Notebook 681 34 Updated Mar 10, 2025

Tutorial on large language models for genomics

280 55 Updated Jun 5, 2025
Python 10 5 Updated Nov 20, 2024

🚀 Efficient implementations for emerging model architectures

Python 5,224 556 Updated Jun 11, 2026

A 7B parameter model for mathematical reasoning

Python 44 5 Updated Jun 16, 2026

Learning Universal Predictors

Python 84 12 Updated Aug 1, 2024

A series of technical report on Slow Thinking with LLM

Python 765 41 Updated Aug 13, 2025

Visualize the intermediate output of Mistral 7B

Python 395 17 Updated Jan 22, 2025

Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.

Python 203 25 Updated Mar 7, 2025

Python program to generate anki cards from obsidian markdown notes

Python 36 5 Updated Oct 30, 2025

Fight the forgetting curve by reviewing flashcards & entire notes on Obsidian

TypeScript 2,423 281 Updated Jun 15, 2026

Script to add flashcards from text/markdown files to Anki

Python 2,002 195 Updated Jun 16, 2024
Next