Skip to content
View kmj1122's full-sized avatar

Highlights

  • Pro

Block or report kmj1122

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Scaling safe exploration to vision control

Python 15 3 Updated Feb 19, 2025

Code for AAMAS 2024 "Cost-aware Offline Safe Meta Reinforcement Learning with Robust In-Distribution Online Task Adaptation"

Python 9 Updated Apr 19, 2024

(T-IV) Dream to Drive with Predictive Individual World Model

Python 39 2 Updated Aug 8, 2025

Raster to Vector Graphics Converter

Rust 5,156 351 Updated Oct 17, 2025

Code for Data Collection & Training in Sim+Real Envs: [RSS 2024] Natural Language Can Help Bridge the Sim2Real Gap

Python 11 Updated Oct 25, 2025

An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.

Python 16,547 2,643 Updated Dec 19, 2025

An open-source AI agent that brings the power of Grok directly into your terminal.

TypeScript 2,161 286 Updated Nov 27, 2025

Minimal JAX implementation unifying Diffusion and Flow Matching algorithms as alternative strategies for transporting data distributions.

Python 61 3 Updated Dec 19, 2025

Code for our AAAI 26 paper: "Expressive Temporal Specifications for Reward Monitoring"

Jupyter Notebook 2 Updated Nov 28, 2025

LOGics formalisms to AUTomata

Python 11 2 Updated Sep 8, 2023

ICLR 2024: SafeDreamer: Safe Reinforcement Learning with World Models

Python 86 4 Updated Apr 8, 2024
Python 1 Updated Jun 27, 2024

Official PyTorch implementation of 'Reductive Lie Neurons'

Python 7 Updated Nov 12, 2025

LLM Council works together to answer your hardest questions

Python 11,632 2,145 Updated Nov 22, 2025
Python 749 64 Updated Dec 9, 2025

dLLM: Simple Diffusion Language Modeling

Python 1,445 145 Updated Dec 19, 2025

Learning Safety Constraints for Large Language Models (ICML2025)

Python 25 4 Updated Aug 4, 2025

Heterogeneous Hierarchical Multi Agent Reinforcement Learning for Air Combat

Python 151 19 Updated Apr 3, 2025

Towards Formalizing RL Theory

Lean 39 Updated Nov 6, 2025

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,568 128 Updated Nov 24, 2025

Robot Learning Beyond Earth

Python 98 12 Updated Dec 1, 2025

PKBoost: Adaptive GBDT for Concept Drift, Built from scratch in Rust, PKBoost manages changing data distributions in fraud detection with a fraud rate of 0.2%. It shows less than 2% degradation und…

Rust 58 2 Updated Dec 16, 2025

Implementation of the Online Adaptive CBF for safety-critical navigation for input constrained systems.

Python 51 5 Updated Dec 12, 2025

Introduction to Machine Learning Systems

JavaScript 11,012 1,234 Updated Dec 18, 2025

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,217 985 Updated Jul 1, 2024

The best ChatGPT that $100 can buy.

Python 38,886 4,909 Updated Dec 9, 2025

End-to-End Humanoid Robot Safe and Comfortable Locomotion Policy

Python 12 2 Updated Oct 23, 2025

[IEEE RA-L'25] NavRL: Learning Safe Flight in Dynamic Environments (NVIDIA Isaac/Python/ROS1/ROS2)

C++ 1,206 127 Updated Jul 3, 2025
Next