OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophisticated proprietary systems like the GPT-4 Code Interpreter. …

Python 1,728 217 Updated May 7, 2024

ShishirPatil / gorilla

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 12,908 1,382 Updated Apr 13, 2026

OpenBMB / ToolBench

[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.

Python 5,671 485 Updated May 21, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 18,667 2,793 Updated Jun 18, 2026

wuhao21 / sts2-cli

Headless Slay the Spire 2 CLI — play the full game from a terminal.

C# 207 37 Updated May 30, 2026

z-lab / dflash

DFlash: Block Diffusion for Flash Speculative Decoding

Python 5,168 373 Updated May 10, 2026

SWE-agent / mini-swe-agent

The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!

Python 5,272 719 Updated Jun 18, 2026

agentica-project / rllm

Jupyter Notebook 393 32 Updated Sep 17, 2025

Gen-Verse / Open-AgentRL

RLAnything (ICML 2026) & AutoTool (ICML 2026), DemyAgent: Open-Source RL for LLMs and Agentic Scenarios

Python 555 56 Updated Jun 12, 2026

JarvisPei / EvolveClaw

Make your OpenClaw agent self-improving

TypeScript 10 Updated Mar 31, 2026

JarvisPei / Behavioral-Fingerprinting

Python 9 1 Updated Sep 8, 2025

JarvisPei / FuseGPT

The implementation for the paper, FuseGPT: Learnable Layers Fusion of Generative Pre-trained Transformers.

Python 5 1 Updated Jan 15, 2025

JarvisPei / CMoE

[ACL 2026 Main] Analytical FFN-to-MoE Restructuring via Activation Pattern Analysis

Python 43 11 Updated Apr 24, 2026

JarvisPei / PreMoE

Proactive Inference for Efficient Mixture-of-Experts

Python 7 Updated Apr 24, 2026

JarvisPei / SCOPE

SCOPE: Self-evolving Context Optimization via Prompt Evolution - A framework for automatic prompt optimization

Python 78 6 Updated Mar 26, 2026

JarvisPei / MemDLM

MemDLM: Memory-enhanced Diffusion Language Model

Python 10 Updated Apr 8, 2026

Gen-Verse / OpenClaw-RL

OpenClaw-RL: Train any agent simply by talking

Python 5,505 597 Updated May 23, 2026

openclaw / openclaw

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 379,404 79,423 Updated Jun 18, 2026

ZHZisZZ / dllm

dLLM: Simple Diffusion Language Modeling

Python 2,583 271 Updated Jun 12, 2026

google-deepmind / loft

LOFT: A 1 Million+ Token Long-Context Benchmark

Python 233 17 Updated Apr 13, 2026

test-time-training / e2e

Official JAX implementation of End-to-End Test-Time Training for Long Context

Python 621 47 Updated Feb 15, 2026

pliang279 / awesome-phd-advice

Collection of advice for prospective and current PhD students

2,106 155 Updated Jul 10, 2024

modelscope / ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, …

Python 14,558 1,483 Updated Jun 18, 2026

SawyDust1228 / HSIC-DKL-Yield-Estimation

[ASPDAC23] High Dimensional Yield Estimation using Shrinkage Deep Features and Maximization of Integral Entropy Reduction

Jupyter Notebook 14 1 Updated Oct 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JarvisPei

Achievements