- Toronto
Lists (15)
Sort Name ascending (A-Z)
Starred repositories
"🐈 nanobot: The Ultra-Lightweight Personal AI Agent"
A curated list of awesome Skills, resources, and tools for customizing coding agent workflows.
Tutorial on how to build a minimal software engineering agent that still scores high on SWE-bench verified
A minimal PyTorch implementation of probabilistic diffusion models for 2D datasets.
Minimal Claude Code alternative. Single Python file, zero dependencies, ~250 lines.
OpenClaw-RL: Train any agent simply by talking
RLAnything & DemyAgent: General and scalable agentic RL algorithms across terminal, GUI, SWE, and tool-call settings
τ-Bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains
an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM
This is a repo for a number of examples using the smolagents framework from Hugging Face.
The code for NeurIPS 2025 paper "A-Mem: Agentic Memory for LLM Agents"
This training offers an intensive exploration into the frontier of reinforcement learning techniques with large language models (LLMs). We will explore advanced topics such as Reinforcement Learnin…
Metrics, Benchmarks, and Practical Tools for Assessing Large Language Models
An introduction to the world of AI Agents
Deep Reinforcement Learning Hands-On, 3E_Published by Packt
Grokking Deep Reinforcement Learning
Code for "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates", Siggraph Asia 2024
qwen3-base family of models RL on gsm8k using verl, is there an RL power law on downstream tasks?
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!
An open-source AI agent that brings the power of Gemini directly into your terminal.
What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
Data Infrastructure providing a declarative, incremental approach for multimodal AI workloads.
A list of engineering manager resource links.
🌐 Make websites accessible for AI agents. Automate tasks online with ease.