Skip to content
View imoneoi's full-sized avatar
🎯
Tuning PPO
🎯
Tuning PPO

Organizations

@OpenOrca @FastEval

Block or report imoneoi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Efficient Triton Kernels for LLM Training

Python 6,243 507 Updated Mar 28, 2026

Hierarchical Reasoning Model Official Release

Python 12,378 1,809 Updated Sep 9, 2025

Monte Carlo tree search in JAX

Python 2,603 207 Updated Sep 2, 2025

Fused Adam-atan2 implementation

Python 5 7 Updated Apr 2, 2025

unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"

Python 88 16 Updated Jul 4, 2022

Github action to maximize the available disk space on Github runners

507 110 Updated Mar 28, 2025

BFloat16 Fused Adam Operator for PyTorch

Python 17 1 Updated Nov 16, 2024

[AAAI'25 Oral] Are Expressive Models Truly Necessary for Offline RL?

Python 14 4 Updated Dec 10, 2024
Python 769 54 Updated Jun 13, 2024

Grok open release

Python 51,513 8,468 Updated Aug 30, 2024

Typed command line interfaces with argparse and pydantic

Python 51 4 Updated Jan 10, 2025

[For SM90 and cuBLAS] PyTorch bindings for CUTLASS grouped GEMM.

Cuda 1 Updated Jan 24, 2024

Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""

Python 3,927 299 Updated Nov 25, 2024

Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)

C++ 851 63 Updated Mar 29, 2026

Typed Argument Parsing with Pydantic

Python 135 23 Updated Mar 23, 2026

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 7 1 Updated Dec 27, 2023

NVIDIA Linux open GPU kernel module source

C 16,846 1,634 Updated Mar 24, 2026

ICLR 2022 Paper, SOTA Table Pre-training Model, TAPEX: Table Pre-training via Learning a Neural SQL Executor

Python 299 39 Updated Feb 6, 2023

The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".

Python 66 3 Updated Apr 18, 2023
Python 1 Updated Nov 22, 2023

CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!

1,879 167 Updated May 9, 2023

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 41,868 7,395 Updated Mar 30, 2026

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,479 435 Updated Sep 13, 2024
JavaScript 20 2 Updated Apr 1, 2024

Code for paper Evolving Connectivity for Spiking Neural Networks

Python 22 4 Updated Oct 23, 2023

A multi-purpose LLM framework for RAG and data creation.

Python 629 48 Updated Jan 13, 2024

WireGuard Configuration Portal with LDAP connection

Go 1,657 177 Updated Mar 25, 2026

Curate better data for LLMs

Python 1,068 105 Updated Mar 19, 2024
Next