p208p2002

Philip p208p2002

NLP Engineer and Full Stack Developer

72 followers · 49 following

Delta Research Center
Taiwan, Taipei
08:24 (UTC +08:00)
blog.philip-huang.tech

Achievements

Organizations

Starred repositories

xai-org / x-algorithm

Algorithm powering the For You feed on X

Rust 26,203 4,505 Updated May 15, 2026

vllm-project / guidellm

Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs

Python 1,284 171 Updated Jun 18, 2026

callowayproject / bump-my-version

A small command line tool to simplify releasing software by updating all version strings in your source code by the correct increment and optionally commit and tag the changes.

Python 615 39 Updated Jun 18, 2026

ROCm / rocm-examples

A collection of examples for the ROCm software stack

C++ 296 95 Updated Jun 18, 2026

fzyzcjy / torch_memory_saver

Allow torch tensor memory to be released and resumed later

Python 251 60 Updated May 16, 2026

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes for ML SYS.

Python 6,541 445 Updated Jun 18, 2026

Nardien / agent-distillation

Official Code Repository for the paper "Distilling LLM Agent into Small Models with Retrieval and Code Tools"

Python 245 32 Updated Oct 22, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Python 5,820 1,062 Updated Jun 18, 2026

mbzuai-oryx / Awesome-LLM-Post-training

Awesome Reasoning LLM Tutorial/Survey/Guide

Python 2,450 164 Updated Apr 6, 2026

huggingface / Math-Verify

Python 1,156 55 Updated Jan 10, 2026

xlite-dev / LeetCUDA

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 11,283 1,160 Updated Jun 18, 2026

hongshanli23 / DSUC

Minimal example for DeepSpeed Universal Checkpoint

Python 1 Updated Sep 20, 2024

QwenLM / Self-Lengthen

Python 99 12 Updated Nov 6, 2024

huggingface / lighteval

Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends

Python 2,453 492 Updated Jun 9, 2026

namtuanly / WikiTableSet

WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia

Python 32 2 Updated Jun 12, 2025

stas00 / ipyexperiments

Automatic GPU+CPU memory profiling, re-use and memory leaks detection using jupyter/ipython experiment containers

Jupyter Notebook 234 14 Updated Dec 15, 2023

ogx-ai / ogx

Open GenAI Stack

Python 8,412 1,315 Updated Jun 18, 2026

pytorch / torchchat

Run PyTorch LLMs locally on servers, desktop and mobile

Python 3,624 246 Updated Sep 10, 2025

SpursGoZmy / Awesome-Tabular-LLMs

We collect papers about "large language models (LLM) for table-related tasks", e.g., using LLM for Table QA task. “表格+LLM”相关论文整理

632 45 Updated Apr 9, 2026

princeton-nlp / ALCE

[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627

Python 518 51 Updated Oct 9, 2024

Seeed-Projects / RAG_based_on_Jetson

This project has implemented the RAG function on Jetson and supports TXT and PDF document formats. It uses MLC for 4-bit quantization of the Llama2-7b model, utilizes ChromaDB as the vector databas…

Python 12 1 Updated May 16, 2024

mosaicml / llm-foundry

LLM training code for Databricks foundation models

Python 4,410 589 Updated Mar 25, 2026

emilk / egui

egui: an easy-to-use immediate mode GUI in Rust that runs on both web and native

Rust 29,433 2,051 Updated Jun 18, 2026

klausmeyer / docker-registry-browser

🐳 Web Interface for the Docker Registry HTTP API V2 written in Ruby on Rails.

Ruby 693 62 Updated Jun 17, 2026

Joxit / docker-registry-ui

The simplest and most complete UI for your private docker registry v2 and v3

Riot 3,469 361 Updated Jan 19, 2026

XiongjieDai / GPU-Benchmarks-on-LLM-Inference

Multiple NVIDIA GPUs or Apple Silicon for Large Language Model Inference?

Jupyter Notebook 1,922 75 Updated May 13, 2024

wwxu21 / CUT

Source code of "Reasons to Reject? Aligning Language Models with Judgments"

Python 58 5 Updated Feb 29, 2024

argilla-io / distilabel

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 3,283 245 Updated Jun 15, 2026