Skip to content
View ZacLing's full-sized avatar

Block or report ZacLing

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Python 680 73 Updated Apr 13, 2026

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

JavaScript 156,729 24,328 Updated Apr 15, 2026
Python 3 Updated Oct 12, 2025

A resource repository for machine unlearning in large language models

567 31 Updated Apr 14, 2026

The official implementation of “ReGuLaR: Variational Latent Reasoning Guided by Rendered Chain-of-Thought”

Python 49 4 Updated Feb 2, 2026

Dataset for the temporal memory tests

13 1 Updated Jun 4, 2024

LLM Unlearning

Python 184 20 Updated Oct 20, 2023

Alpha Screening with LLM Reasoning via Reinforcement Learning

64 8 Updated Dec 30, 2025

bt - flexible backtesting for Python

Python 2,849 471 Updated Mar 31, 2026

Holmes is an interactive, text-based crime investigation game powered by a large language model (LLM). With each replay, the game offers a fresh narrative, ensuring a unique experience for players …

Python 21 6 Updated Oct 2, 2023

基于多智能体LLM的中文金融交易框架 - TradingAgents中文增强版

Python 24,077 5,059 Updated Apr 15, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,840 5,365 Updated Apr 15, 2026

Official implementation of the NeurIPS 2025 paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"

Python 331 40 Updated Jan 26, 2026

Nano vLLM

Python 12,903 1,930 Updated Apr 13, 2026

The RL Bridge for LLM-based Agent Applications. Made Simple & Flexible.

Python 5,043 464 Updated Apr 15, 2026

在verl上做reward的定制开发

Python 148 7 Updated May 22, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 76,674 15,611 Updated Apr 15, 2026

llm & rl

Jupyter Notebook 283 29 Updated Oct 24, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 13,053 1,582 Updated Feb 27, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,695 3,659 Updated Apr 15, 2026

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,614 217 Updated Apr 14, 2026

Simple RL training for reasoning

Python 3,845 289 Updated Dec 23, 2025

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Python 843 53 Updated May 14, 2025

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Async RL)

Python 9,350 917 Updated Apr 15, 2026

[ICCV 2019] Monocular depth estimation from a single image

Jupyter Notebook 4,474 985 Updated Aug 10, 2024

The official code for ICRA 2021 Paper: "Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation"

Python 7 3 Updated Apr 18, 2023

Self-supervised monocular depth estimation with a vision transformer

Python 184 21 Updated Apr 3, 2023
Next