codekk

Krishna M codekk

Infinity Parser AI Engineer. Product Builder

48 followers · 544 following

embryonic
Earth

Achievements

Starred repositories

RobinLmn / cart-pole-rl

A reinforcement learning agent built from scratch in C++, trained on the cart-pole environment.

C++ 4 1 Updated Aug 25, 2025

google-deepmind / uncertain_ground_truth

Dermatology ddx dataset, Jax implementations of Monte Carlo conformal prediction, plausibility regions and statistical annotation aggregation from our recent work on uncertain ground truth (TMLR'23…

Python 679 49 Updated Mar 28, 2024

ulab-uiuc / Multi-agent-evolve

Python 128 4 Updated Dec 2, 2025

premAI-io / premsql

End-to-End Local-First Text-to-SQL Pipelines

Python 426 40 Updated Feb 14, 2025

idiap / sdialog

Synthetic Dialog Generation and Analysis with LLMs

Python 118 23 Updated Dec 15, 2025

nakamotoo / Cal-QL

official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)

Python 115 8 Updated Jul 31, 2024

Ziems / arbor

A framework for optimizing DSPy programs with RL

Python 302 26 Updated Nov 18, 2025

roostorg / model-community

Share evaluation outcomes and implementation tips for using open safety models in Trust & Safety workflows

Python 77 11 Updated Dec 16, 2025

NVIDIA-NeMo / RL

Scalable toolkit for efficient model reinforcement

Python 1,185 203 Updated Dec 28, 2025

sci-m-wang / OpenCE

OpenCE (Open Context Engineering): A community toolkit to implement, evaluate, and combine LLM context strategies (RAG, ACE, Compression). Evolved from the `ACE-open` reproduction.

Python 333 47 Updated Nov 14, 2025

meta-pytorch / OpenEnv

An interface library for RL post training with environments.

Python 870 138 Updated Dec 27, 2025

steveh250 / MAF-RFP-Factory

Microsoft Agent Framework

Python 11 2 Updated Dec 21, 2025

semanticdatalayer / SML

Open-source repository for Semantic Modeling Language (SML)

122 14 Updated Dec 3, 2025

PufferAI / PufferLib

Simplifying reinforcement learning for complex game environments

C 4,663 350 Updated Dec 27, 2025

sileod / reasoning_core

A RL env with procedurally generated symbolic reasoning data

Python 29 2 Updated Oct 23, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 66,350 12,245 Updated Dec 28, 2025

aakaran / reasoning-with-sampling

Python 362 47 Updated Nov 7, 2025

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 21,647 1,940 Updated Oct 25, 2025

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 11,048 2,927 Updated Dec 23, 2025

PythonNut / superbpe

Official code release for "SuperBPE: Space Travel for Language Models"

Jupyter Notebook 77 9 Updated Nov 18, 2025

ServiceNow / PipelineRL

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 340 34 Updated Dec 23, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 39,381 5,007 Updated Dec 28, 2025

FalkorDB / QueryWeaver

An open-source Text2SQL tool that transforms natural language into SQL using graph-powered schema understanding. Ask your database questions in plain English, QueryWeaver handles the weaving.

TypeScript 284 27 Updated Dec 24, 2025