Skip to content
View ZacLing's full-sized avatar

Block or report ZacLing

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

JavaScript 135,684 19,947 Updated Apr 3, 2026
Python 3 Updated Oct 12, 2025

A resource repository for machine unlearning in large language models

561 31 Updated Mar 22, 2026

The official implementation of “ReGuLaR: Variational Latent Reasoning Guided by Rendered Chain-of-Thought”

Python 49 4 Updated Feb 2, 2026

Dataset for the temporal memory tests

13 1 Updated Jun 4, 2024

LLM Unlearning

Python 183 20 Updated Oct 20, 2023

Alpha Screening with LLM Reasoning via Reinforcement Learning

62 8 Updated Dec 30, 2025

bt - flexible backtesting for Python

Python 2,840 470 Updated Mar 31, 2026

Holmes is an interactive, text-based crime investigation game powered by a large language model (LLM). With each replay, the game offers a fresh narrative, ensuring a unique experience for players …

Python 20 6 Updated Oct 2, 2023

基于多智能体LLM的中文金融交易框架 - TradingAgents中文增强版

Python 23,249 4,848 Updated Feb 14, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,386 5,162 Updated Apr 3, 2026

Official implementation of the NeurIPS 2025 paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"

Python 326 39 Updated Jan 26, 2026

Nano vLLM

Python 12,670 1,862 Updated Nov 3, 2025

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 4,975 446 Updated Apr 3, 2026

在verl上做reward的定制开发

Python 148 7 Updated May 22, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 75,155 15,128 Updated Apr 3, 2026

llm & rl

Jupyter Notebook 281 28 Updated Oct 24, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 13,015 1,585 Updated Feb 27, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,424 3,571 Updated Apr 3, 2026

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,576 214 Updated Mar 28, 2026

Simple RL training for reasoning

Python 3,846 289 Updated Dec 23, 2025

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Python 845 53 Updated May 14, 2025

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python 9,297 911 Updated Apr 3, 2026

[ICCV 2019] Monocular depth estimation from a single image

Jupyter Notebook 4,470 986 Updated Aug 10, 2024

The official code for ICRA 2021 Paper: "Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation"

Python 7 3 Updated Apr 18, 2023

Self-supervised monocular depth estimation with a vision transformer

Python 183 21 Updated Apr 3, 2023

Reproduce R1 Zero on Logic Puzzle

Python 2,444 164 Updated Mar 20, 2025
Next