-
Zhejiang University
- Hangzhou, China
-
16:17
(UTC +08:00) - https://tricktreat.github.io/
- @itricktreat
Highlights
- Pro
Lists (8)
Sort Name ascending (A-Z)
Stars
A curated collection of papers, technical reports, frameworks, and tools for on-policy distillation of large language models
Official code for "Self-Distilled Agentic Reinforcement Learning"
Official repository of "CollabVR: Collaborative Video Reasoning with Vision-Language and Video Generation Models"
Coding agent for DeepSeek models that runs in your terminal
Memory for 24/7 proactive agents like OpenClaw.
[ICML 2026] Milestone-Guided Policy Learning for Long-Horizon Language Agents
Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework
CL-bench: A Benchmark for Context Learning
🎨 Local-first, open-source alternative to Anthropic's Claude Design. ⚡ 19 Skills · ✨ 71 brand-grade Design Systems 🖼 Generate web · desktop · mobile prototypes · slides · images · videos · HyperFra…
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
An AI SKILL that provide design intelligence for building professional UI/UX multiple platforms
AI agent toolkit: coding agent CLI, unified LLM API, TUI & web UI libraries, Slack bot, vLLM pods
Easy Data Preparation with latest LLMs-based Operators and Pipelines.
The headless browser for AI agents and web scraping
AI agents running research on single-GPU nanochat training automatically
EvoSkill — An open-source framework that automatically discovers and synthesizes reusable agent skills from failed trajectories to improve coding agent performance.
🤗 ml-intern: an open-source ML engineer that reads papers, trains models, and ships ML models
Awesome list for AI agent harness engineering: tools, patterns, evals, memory, MCP, permissions, observability, and orchestration.
[ACL 2026 findings] Pause or Fabricate? Training Language Models for Grounded Reasoning
GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification
A curated list of AI Agent Swarm frameworks, multi-agent orchestration, swarm intelligence, and collaborative agent systems.
A curated list of AI Agent evolution, memory systems, multi-agent architectures, and self-improvement projects. | evomap.ai
A Systematic Analysis and Discussion of Claude Code for Designing Today's and Future AI Agent Systems
"DeepTutor: Agent-Native Personalized Learning Assistant"
A collection of DESIGN.md files inspired by popular brand design systems. Drop one into your project and let coding agents generate a matching UI.
[ACL 2026 main] Official code for "UI-Copilot: Advancing Long-Horizon GUI Automation via Tool-Integrated Policy Optimization"
SpatialEvo: Self-Evolving Spatial Intelligence via Deterministic Geometric Environments
Generate beautiful dark-themed system architecture diagrams as standalone HTML/SVG files. Works as a Claude AI skill.