Skip to content
View khazic's full-sized avatar

Organizations

@OpenRLHF

Block or report khazic

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 255 10 Updated Apr 17, 2026

🚀 Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support

Python 616 187 Updated Jun 23, 2026
Python 1,030 98 Updated May 13, 2026

A construction kit for reinforcement learning environment management.

Python 458 67 Updated Jun 23, 2026

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 3,401 754 Updated Jun 23, 2026

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 380,123 79,591 Updated Jun 23, 2026

OpenClaw中国插件:支持飞书,钉钉,QQ,企业微信,微信

TypeScript 3,951 345 Updated Jun 12, 2026

提供多款 Shadowrocket 规则,拥有强劲的广告过滤功能。每日 8 时重新构建规则。

27,820 1,861 Updated Jun 22, 2026

Collection of evals for Inspect AI

Python 552 358 Updated Jun 23, 2026

A modern GUI client based on Tauri, designed to run in Windows, macOS and Linux for tailored proxy experience

TypeScript 127,403 9,261 Updated Jun 23, 2026

A set of examples based on verl for end-to-end RL training recipes.

Python 299 139 Updated Jun 23, 2026

A visuailzation tool to make deep understaning and easier debugging for RLHF training.

Python 295 9 Updated Feb 20, 2025

Nano vLLM

Python 14,156 2,247 Updated Apr 26, 2026

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 6,534 611 Updated Jun 23, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 83,652 18,366 Updated Jun 23, 2026

📊 A minimalist, self-hosted WakaTime-compatible backend for coding statistics

Go 4,352 293 Updated Jun 22, 2026

⏰ Agenticly track worldwide conference deadlines (Website, Python Cli, Wechat Applet)

Rust 9,107 608 Updated Jun 23, 2026

Template and style files for ICLR

TeX 280 80 Updated Aug 21, 2025

[EMNLP2025] Remedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling

Python 17 1 Updated Nov 20, 2025

Google Research

Jupyter Notebook 38,220 8,442 Updated Jun 23, 2026

Chat Overleaf 谷歌插件,可自定义 LLM 供应商,提供 Overleaf 编辑器内划线问答以及获取文件内容作为当前上下文。

TypeScript 29 4 Updated Feb 9, 2026

Scalable toolkit for efficient model reinforcement

Python 1,753 433 Updated Jun 23, 2026

AI-native HTAP database with Git-for-Data and built-in vector search, serving as the data and memory backbone for intelligent agents and applications.

Go 1,850 300 Updated Jun 23, 2026

Machine Learning Containers for NVIDIA Jetson and JetPack-L4T

Jupyter Notebook 4,742 836 Updated Jun 22, 2026

ASR/NLP/TTS deep learning inference library for NVIDIA Jetson using PyTorch and TensorRT

Python 229 50 Updated Feb 9, 2024

NVIDIA Riva runnable tutorials

Jupyter Notebook 170 55 Updated Jun 15, 2026
Python 1,293 133 Updated May 20, 2026

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 855 58 Updated Jun 23, 2026

slime is an LLM post-training framework for RL Scaling.

Python 6,716 972 Updated Jun 23, 2026

GPU-optimized version of the MuJoCo physics simulator, designed for NVIDIA hardware.

Python 1,314 171 Updated Jun 23, 2026
Next