-
Georgia Institute of Technology
- Atlanta, GA
- stefanheng.github.io
- in/stefan-heng-41690716b
- @yuzhao_heng
Highlights
- Pro
Stars
An extremely fast Python package and project manager, written in Rust.
Rich is a Python library for rich text and beautiful formatting in the terminal.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
A playbook for systematically maximizing the performance of deep learning models.
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Python composable command line interface toolkit
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
Minimal reproduction of DeepSeek R1-Zero
A collection of GPT system prompts and various prompt injection/leaking knowledge.
OpenChat: Advancing Open-source Language Models with Imperfect Data
A Ruby gem that beautifies the terminal's ls command, with color and font-awesome icons. 🎉
Aligning pretrained language models with instruction data generated by themselves.
Machine Learning and Computer Vision Engineer - Technical Interview Questions
A Python implementation of John Gruber’s Markdown with Extension support.
A Python module for creating Excel XLSX files.
Simple cross-platform colored terminal text in Python
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
800,000 step-level correctness labels on LLM solutions to MATH problems
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
Yet another alternative curriculum vitae/résumé class with LaTeX
PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437