Skip to content
View Kiv's full-sized avatar

Block or report Kiv

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,652 970 Updated Jun 17, 2026

Aidan Bench attempts to measure <big_model_smell> in LLMs.

Python 318 15 Updated Jun 26, 2025

Data related to mind uploading project via prompt

HTML 29 5 Updated May 21, 2026

this is a TypeScript-based MCP server that implements a simple loom and makes it available for Claude to use.

JavaScript 23 3 Updated Feb 17, 2026

Machine Learning Engineering Open Book

Python 18,130 1,152 Updated May 18, 2026

Open-source desktop app for local LLMs. Text, vision, tool-calling, OpenAI/Anthropic-compatible API. 100% private.

Python 47,325 5,977 Updated Jun 2, 2026

David Attenborough narrates your life

Python 4,423 540 Updated Feb 23, 2026

Multiple NVIDIA GPUs or Apple Silicon for Large Language Model Inference?

Jupyter Notebook 1,923 75 Updated May 13, 2024

Code for the paper "Fishing for Magikarp"

Python 191 16 Updated Jun 17, 2026

AICI: Prompts as (Wasm) Programs

Rust 2,075 84 Updated Jan 22, 2025

PyTorch native post-training library

Python 5,775 730 Updated Jun 16, 2026

Blazingly 🔥 fast 🚀 memory vulnerabilities, written in 100% safe Rust. 🦀

Rust 5,396 114 Updated Sep 26, 2025

NVIDIA Linux open GPU with P2P support

C 1,378 142 Updated Jun 6, 2025

lightweight, standalone C++ inference engine for Google's Gemma models.

C++ 6,960 645 Updated Jun 17, 2026

Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.

Python 66,696 5,984 Updated Jun 17, 2026

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 18,515 1,508 Updated May 24, 2026

Extremely simple implementation of path patching (aka causal scrubbing) in PyTorch.

Jupyter Notebook 5 Updated Oct 5, 2023

Stanford NLP Python library for understanding and improving PyTorch models via interventions

Python 886 108 Updated Mar 6, 2026

If tinygrad wasn't small enough for you...

Python 811 100 Updated Mar 9, 2024

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

JavaScript 1,402 89 Updated Dec 3, 2024

Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.

HTML 244 90 Updated Aug 11, 2025
Python 8 3 Updated Nov 15, 2022

Named tensors with first-class dimensions for PyTorch

Jupyter Notebook 332 11 Updated Jun 14, 2023

Must-read Papers on Textual Adversarial Attack and Defense

Python 1,574 194 Updated Jun 4, 2025

The agent engineering platform.

Python 139,557 23,126 Updated Jun 17, 2026

A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick

Python 295 26 Updated Nov 25, 2023

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,497 776 Updated Jun 15, 2026
Next