kmj1122

Follow

MJ Kwon kmj1122

Follow

PhD student in Computer Science at the University of Virginia.

1 follower · 5 following

Highlights

Pro

Stars

yardenas / actsafe

Scaling safe exploration to vision control

Python 15 3 Updated Feb 19, 2025

ApocalypseX / COSTA

Code for AAMAS 2024 "Cost-aware Offline Safe Meta Reinforcement Learning with Robust In-Distribution Online Task Adaptation"

Python 9 Updated Apr 19, 2024

gaoyinfeng / PIWM

(T-IV) Dream to Drive with Predictive Individual World Model

Python 39 2 Updated Aug 8, 2025

visioncortex / vtracer

Raster to Vector Graphics Converter

Rust 5,156 351 Updated Oct 17, 2025

UT-Austin-RobIn / lang4sim2real

Code for Data Collection & Training in Sim+Real Envs: [RSS 2024] Natural Language Can Help Bridge the Sim2Real Gap

Python 11 Updated Oct 25, 2025

google / adk-python

An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.

Python 16,547 2,643 Updated Dec 19, 2025

superagent-ai / grok-cli

An open-source AI agent that brings the power of Grok directly into your terminal.

TypeScript 2,161 286 Updated Nov 27, 2025

MizuhoAOKI / jax_generative_models

Minimal JAX implementation unifying Diffusion and Flow Matching algorithms as alternative strategies for transporting data distributions.

Python 61 3 Updated Dec 19, 2025

nightly / quantitative-reward-monitoring

Code for our AAAI 26 paper: "Expressive Temporal Specifications for Reward Monitoring"

Jupyter Notebook 2 Updated Nov 28, 2025

whitemech / logaut

LOGics formalisms to AUTomata

Python 11 2 Updated Sep 8, 2023

PKU-Alignment / SafeDreamer

ICLR 2024: SafeDreamer: Safe Reinforcement Learning with World Models

Python 86 4 Updated Apr 8, 2024

FengMingquan-sjtu / Logic_Point_Processes_ICLR

Python 10 4 Updated Mar 8, 2022

Chengzhi-Cao / LGR

Python 1 Updated Jun 27, 2024

chankyo123 / reductive-lie-neuron

Official PyTorch implementation of 'Reductive Lie Neurons'

Python 7 Updated Nov 12, 2025

karpathy / llm-council

LLM Council works together to answer your hardest questions

Python 11,632 2,145 Updated Nov 22, 2025

rbalestr-lab / lejepa

Python 749 64 Updated Dec 9, 2025

ZHZisZZ / dllm

dLLM: Simple Diffusion Language Modeling

Python 1,445 145 Updated Dec 19, 2025

lasgroup / SafetyPolytope

Learning Safety Constraints for Large Language Models (ICML2025)

Python 25 4 Updated Aug 4, 2025

IDSIA / hhmarl_2D

Heterogeneous Hierarchical Multi Agent Reinforcement Learning for Air Combat

Python 151 19 Updated Apr 3, 2025

ShangtongZhang / rl-theory-in-lean

Towards Formalizing RL Theory

Lean 39 Updated Nov 6, 2025

PKU-Alignment / safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,568 128 Updated Nov 24, 2025

AndrejOrsula / space_robotics_bench

Robot Learning Beyond Earth

Python 98 12 Updated Dec 1, 2025

Pushp-Kharat1 / PkBoost

PKBoost: Adaptive GBDT for Concept Drift, Built from scratch in Rust, PKBoost manages changing data distributions in fraud detection with a fraud rate of 0.2%. It shows less than 2% degradation und…

Rust 58 2 Updated Dec 16, 2025

tkkim-robot / online_adaptive_cbf

Implementation of the Online Adaptive CBF for safety-critical navigation for input constrained systems.

Python 51 5 Updated Dec 12, 2025

harvard-edge / cs249r_book

Introduction to Machine Learning Systems

JavaScript 11,012 1,234 Updated Dec 18, 2025

karpathy / minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,217 985 Updated Jul 1, 2024

nico-bohlinger / one_policy_to_run_them_all

Python 206 21 Updated May 15, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 38,886 4,909 Updated Dec 9, 2025

aCodeDog / SafeHumanoidsPolicy

End-to-End Humanoid Robot Safe and Comfortable Locomotion Policy

Python 12 2 Updated Oct 23, 2025

Zhefan-Xu / NavRL

[IEEE RA-L'25] NavRL: Learning Safe Flight in Dynamic Environments (NVIDIA Isaac/Python/ROS1/ROS2)

C++ 1,206 127 Updated Jul 3, 2025