Skip to content
View codekk's full-sized avatar
  • embryonic
  • Earth

Block or report codekk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A reinforcement learning agent built from scratch in C++, trained on the cart-pole environment.

C++ 4 1 Updated Aug 25, 2025

Dermatology ddx dataset, Jax implementations of Monte Carlo conformal prediction, plausibility regions and statistical annotation aggregation from our recent work on uncertain ground truth (TMLR'23…

Python 679 49 Updated Mar 28, 2024

End-to-End Local-First Text-to-SQL Pipelines

Python 426 40 Updated Feb 14, 2025

Synthetic Dialog Generation and Analysis with LLMs

Python 118 23 Updated Dec 15, 2025

official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)

Python 115 8 Updated Jul 31, 2024

A framework for optimizing DSPy programs with RL

Python 302 26 Updated Nov 18, 2025

Share evaluation outcomes and implementation tips for using open safety models in Trust & Safety workflows

Python 77 11 Updated Dec 16, 2025

Scalable toolkit for efficient model reinforcement

Python 1,185 203 Updated Dec 28, 2025

OpenCE (Open Context Engineering): A community toolkit to implement, evaluate, and combine LLM context strategies (RAG, ACE, Compression). Evolved from the `ACE-open` reproduction.

Python 333 47 Updated Nov 14, 2025

An interface library for RL post training with environments.

Python 870 138 Updated Dec 27, 2025

Microsoft Agent Framework

Python 11 2 Updated Dec 21, 2025

Open-source repository for Semantic Modeling Language (SML)

122 14 Updated Dec 3, 2025

Simplifying reinforcement learning for complex game environments

C 4,663 350 Updated Dec 27, 2025

A RL env with procedurally generated symbolic reasoning data

Python 29 2 Updated Oct 23, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 66,350 12,245 Updated Dec 28, 2025

Contexts Optical Compression

Python 21,647 1,940 Updated Oct 25, 2025

A framework for few-shot evaluation of language models.

Python 11,048 2,927 Updated Dec 23, 2025

Official code release for "SuperBPE: Space Travel for Language Models"

Jupyter Notebook 77 9 Updated Nov 18, 2025

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 340 34 Updated Dec 23, 2025

The best ChatGPT that $100 can buy.

Python 39,381 5,007 Updated Dec 28, 2025

An open-source Text2SQL tool that transforms natural language into SQL using graph-powered schema understanding. Ask your database questions in plain English, QueryWeaver handles the weaving.

TypeScript 284 27 Updated Dec 24, 2025

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,745 463 Updated Oct 14, 2025

Multi-Agent System: Data Analysis → Optimization → Business Insights

Python 3 1 Updated Sep 27, 2025
Python 1,486 158 Updated Nov 15, 2025

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 3,728 313 Updated Nov 13, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,788 100 Updated Mar 18, 2025
Python 2,235 195 Updated Nov 29, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 81,861 12,260 Updated Dec 27, 2025
Next