Skip to content
View penglin03's full-sized avatar

Highlights

  • Pro

Organizations

@wsu-db

Block or report penglin03

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Summaries and resources for Designing Machine Learning Systems book (Chip Huyen, O'Reilly 2022)

4,902 972 Updated Jun 9, 2026

Fit interpretable models. Explain blackbox machine learning.

C++ 6,878 784 Updated Jun 8, 2026

Distributed Compiler based on Triton for Parallel Systems

Python 1,457 150 Updated Apr 22, 2026

A lightweight sandboxing tool for enforcing filesystem and network restrictions on arbitrary processes at the OS level, without requiring a container.

TypeScript 4,388 320 Updated Jun 11, 2026

Framework for creating high fidelity and complex RL environments and evaluation tasks

Python 254 32 Updated Jun 11, 2026

A protocol for connecting any editor to any agent

Rust 3,372 271 Updated Jun 11, 2026

An annotated implementation of the Transformer paper.

Jupyter Notebook 7,308 1,546 Updated Apr 7, 2024

Solve puzzles. Improve your pytorch.

Jupyter Notebook 4,109 374 Updated Jul 15, 2024

The open source coding agent.

TypeScript 173,257 20,841 Updated Jun 12, 2026

NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRaven-13B and baselines.

Python 323 24 Updated Sep 29, 2023

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,483 431 Updated Sep 13, 2024

Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…

TypeScript 2,875 3,454 Updated Apr 22, 2026

AlloyDB is a distributed SQL database.

Go 75 14 Updated Dec 23, 2022

Lakehouse native graph engine with git-style workflows

Rust 289 24 Updated Jun 11, 2026

C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.

C++ 1,461 142 Updated May 26, 2026

SWE-bench: Can Language Models Resolve Real-world Github Issues?

Python 5,144 892 Updated Apr 1, 2026

Collection of the system designs driven by LLMs

TypeScript 40 11 Updated Mar 18, 2026

A collection of 500+ real-world ML & LLM system design case studies from 100+ companies. Learn how top tech firms implement GenAI in production.

1,596 236 Updated Jun 21, 2025

AI system design guide for engineers building production AI systems and evals.

1,736 351 Updated Jun 10, 2026

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.

Python 20,635 2,005 Updated Jun 11, 2026

Scalable toolkit for efficient model reinforcement

Python 1,722 422 Updated Jun 12, 2026

Run Graph Queries with Lance

Rust 154 28 Updated Jun 6, 2026

Lance Namespace is an open specification for describing access and operations against a collection of tables in a multimodal lakehouse

Java 55 41 Updated Jun 11, 2026

Agent framework for the JVM. Pronounced Em-BAY-bel /ɛmˈbeɪbəl/

Kotlin 3,658 359 Updated Jun 9, 2026

A library to convert a pydantic model to a pyarrow schema

Python 53 8 Updated May 10, 2025

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 1,538 251 Updated Jun 12, 2026

A fast, feature-rich static code analyzer & language server for Python

Rust 2,865 43 Updated May 10, 2025

Python Type Checker / Language Server

Rust 1,082 35 Updated Jun 11, 2026

Apache Paimon Python The Python implementation of Apache Paimon.

Python 19 11 Updated May 15, 2026

Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, a…

Rust 6,624 706 Updated Jun 11, 2026
Next