Skip to content
View bzz's full-sized avatar

Organizations

@apache @JetBrains @opensourcedesign @SeoulTech @JetBrains-Research @mloncode @go-enry

Block or report bzz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

Python 3,805 273 Updated Feb 17, 2026

AI-powered terminal assistant for LaTeX academic papers — verifies, fixes, and polishes your paper for conference submission with reviewable diff patches and checkpoint safety.

Python 16 1 Updated Feb 14, 2026

Tool for generating high quality Synthetic datasets

Python 1,505 212 Updated Oct 28, 2025

A command line tool that draw plots on the terminal.

Ruby 4,685 64 Updated Jan 18, 2026

Data generation and training repository for SERA: Soft-Verified Efficient Repository Agents.

Python 117 15 Updated Feb 2, 2026

Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.

Python 839 67 Updated Dec 26, 2025

The absolute trainer to light up AI agents.

Python 14,792 1,259 Updated Feb 11, 2026

[COLM 2025] "D3: A Dataset for Training Code LMs to Act Diff-by-Diff"

Python 3 1 Updated Oct 8, 2025

A Claude Code plugin to recover conversation context when Claude loses track

Python 16 Updated Jan 30, 2026

Memorization empirical study validating LoRA-without-regret - Thinking Machines Featured Project

Python 1 Updated Dec 31, 2025

Replication of Open Character Training: fine-tuning LLMs to embody consistent character personas using constitutional methods

Python 1 Updated Jan 1, 2026

Constitutional AI from Base Models - Tinker Featured Project

Python 1 Updated Dec 31, 2025

On-Policy Context Distillation: Comparing distillation methods for knowledge transfer. Teacher-seeded GKD achieves 58-71% GSM8K accuracy while hybrid approaches collapse.

Python 1 Updated Jan 1, 2026

Ideas for projects related to Tinker

168 9 Updated Nov 6, 2025

An interface library for RL post training with environments.

Python 1,153 177 Updated Feb 16, 2026

Google's URL parsing library

C++ 32 17 Updated Feb 2, 2015

WHATWG-compliant and fast URL parser written in modern C++, part of Internet Archive, Node.js, Clickhouse, Redpanda, Kong, Telegram, Adguard, Datadog and Cloudflare Workers.

C++ 1,692 121 Updated Feb 6, 2026

Simple high-throughput inference library

Python 155 10 Updated May 14, 2025

🤗 Benchmark Large Language Models Reliably On Your Data

HTML 431 39 Updated Dec 30, 2025

This is the official code for the paper: Robust Utility-Preserving Text Anonymization Based on Large Language Models

Python 10 2 Updated Jul 7, 2025

CursorCore: Assist Programming through Aligning Anything

Python 133 13 Updated Feb 14, 2025

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 5,197 452 Updated Feb 16, 2026
TypeScript 358 26 Updated Dec 1, 2025

A repository to unravel the language of GPUs, making their kernel conversations easy to understand

Python 202 8 Updated Jun 1, 2025

[ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)

Python 19 1 Updated Feb 11, 2025

[ICLR 2025] 🚀 CodeMMLU Evaluator: A framework for evaluating LM models on CodeMMLU MCQs benchmark.

Python 29 3 Updated Apr 21, 2025

Diffusion on syntax trees for program synthesis

Python 482 30 Updated Jun 27, 2024

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,965 288 Updated May 15, 2025
Next