Skip to content
View kmj1122's full-sized avatar

Highlights

  • Pro

Block or report kmj1122

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PaperBanana: Automating Academic Illustration For AI Scientists

JavaScript 999 44 Updated Feb 2, 2026

AirLLM 70B inference with single 4GB GPU

Jupyter Notebook 10,689 972 Updated Sep 3, 2025

[ICLR 2025] Robust Gymnasium: A Unified Modular Benchmark for Robust Reinforcement Learning.

Python 83 6 Updated Aug 11, 2025

JMP is a Mixed Precision library for JAX.

Python 211 16 Updated Jan 30, 2025

Develop your agent for generals.io!

Python 73 7 Updated Feb 2, 2026

The interface between probabilistic model checking and data-driven policy learning.

Python 13 2 Updated Feb 3, 2026

A Python implementation of ProVe. ProVe is a formal verifier for safety property for Artificial Neural Networks, presented at the 37th conference on Uncertainty in Artificial Intelligence.

Python 7 2 Updated Jun 4, 2021

RL environments and tools for spacecraft autonomy research, built on Basilisk. Developed by the AVS Lab.

Python 98 11 Updated Feb 3, 2026

Github repo for NeurIPS 2024 paper "Safe LoRA: the Silver Lining of Reducing Safety Risks when Fine-tuning Large Language Models"

Python 25 2 Updated Dec 21, 2025

Differentiable convex optimization layers

Python 2,047 185 Updated Feb 4, 2026

LLM inference in C/C++

C++ 94,365 14,758 Updated Feb 4, 2026

Real-time webcam demo with SmolVLM and llama.cpp server

HTML 5,510 893 Updated May 12, 2025

Scaling safe exploration to vision control

Python 14 3 Updated Feb 19, 2025

Code for AAMAS 2024 "Cost-aware Offline Safe Meta Reinforcement Learning with Robust In-Distribution Online Task Adaptation"

Python 9 Updated Apr 19, 2024

(T-IV) Dream to Drive with Predictive Individual World Model

Python 42 3 Updated Aug 8, 2025

Raster to Vector Graphics Converter

Rust 5,369 365 Updated Feb 4, 2026

Code for Data Collection & Training in Sim+Real Envs: [RSS 2024] Natural Language Can Help Bridge the Sim2Real Gap

Python 12 Updated Oct 25, 2025

An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.

Python 17,471 2,839 Updated Feb 4, 2026

An open-source AI agent that brings the power of Grok directly into your terminal.

TypeScript 2,311 293 Updated Jan 10, 2026

Minimal JAX implementation unifying Diffusion and Flow Matching algorithms as alternative strategies for transporting data distributions.

Python 63 3 Updated Dec 19, 2025

Code for our AAAI 26 paper: "Expressive Temporal Specifications for Reward Monitoring"

Jupyter Notebook 2 Updated Nov 28, 2025

LOGics formalisms to AUTomata

Python 11 2 Updated Sep 8, 2023

ICLR 2024: SafeDreamer: Safe Reinforcement Learning with World Models

Python 91 5 Updated Apr 8, 2024
Python 1 Updated Jun 27, 2024

Official PyTorch implementation of 'Reductive Lie Neurons'

Python 7 Updated Nov 12, 2025

LLM Council works together to answer your hardest questions

Python 14,156 2,855 Updated Nov 22, 2025
Python 861 78 Updated Jan 25, 2026

dLLM: Simple Diffusion Language Modeling

Python 1,706 169 Updated Jan 6, 2026

Learning Safety Constraints for Large Language Models (ICML2025)

Python 28 5 Updated Aug 4, 2025
Next