Stars
Hierarchical Expert Prompt for Large-Language-Models: An Approch Defeat Elite AI in TextStarCraft-II for the First Time
TextStarCraft2,a pure language env which support llms play starcraft2
[𝗜𝗖𝗠𝗟 𝟮𝟬𝟮𝟲] Dispersion loss counteracts embedding condensation and improves generalization in small language models
MathCode: A Frontier Mathematical Coding Agent
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
A lightweight alternative to OpenClaw that runs in containers for security. Connects to WhatsApp, Telegram, Slack, Discord, Gmail and other messaging apps,, has memory, scheduled jobs, and runs dir…
An agent-managed museum exhibit, built in Rust with Gajae-Code / LazyCodex — developed and maintained with no human intervention.
My learning notes for ML SYS.
A framework for few-shot evaluation of language models.
FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI…
An educational resource to help anyone learn deep reinforcement learning.
A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts Interested in World Modeling.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
[TMLR 2024] Efficient Large Language Models: A Survey
verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework
[ICLR 2026] Official code for TraceRL: Revolutionizing post-training for Diffusion LLMs, powering the SOTA TraDo series.
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement …
Collection of advice for prospective and current PhD students
Repair malformed JSON from LLMs, APIs, logs, and user input in Python.
This repository contains a LLM benchmark for the social deduction game `Resistance Avalon'
这是一个 Nginx 极简教程,目的在于帮助新手快速入门 Nginx。