-
University of science and technology of China
- ANHUI HEFEI
-
01:10
(UTC -12:00)
Highlights
- Pro
Stars
Standardized environment infrastructure for Agentic AI development.
VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications
[NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient
Qwen3Guard is a multilingual guardrail model series developed by the Qwen team at Alibaba Cloud.
An educational resource to help anyone learn deep reinforcement learning.
Train your Agent model via our easy and efficient framework
Desktop to-do and other plugins based on Vue and Electron
AGENTS.md — a simple, open format for guiding coding agents
A Comprehensive Survey on Long Context Language Modeling
A curated list of awesome commands, files, and workflows for Claude Code
Get started using GitHub in less than an hour.
Open-source search and retrieval database for AI applications.
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
A framework for few-shot evaluation of language models.
🔥 Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.
📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
Tiny-FSDP, a minimalistic re-implementation of the PyTorch FSDP
An easy-to-use Python framework to generate adversarial jailbreak prompts.
verl: Volcano Engine Reinforcement Learning for LLMs
🎓Automatically Update CV Papers Daily using Github Actions
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
My learning notes for ML SYS.
Official Repo for Open-Reasoner-Zero