-
Google
- Kirkland, WA
- https://www.linkedin.com/in/haiyuancao
Stars
An open-source Python SDK for analyzing, evaluating, and curating agent traces stored in BigQuery. Built on top of the BigQuery Agent Analytics, it provides a consumption-layer toolkit for agent ob…
The original nirholas/claude-code before DMCA and take down. Once everything is cleared, it will return. Working with Anthropic and Github to get everything back.
🦞 Just talk to your agent — it learns and EVOLVES 🧬.
An open-source, code-first toolkit for analysis and evaluating agent traces via BigQuery Agent Analytics Plugin
repo to collect the agent analytics notebook
Kode CLI — Design for post-human workflows. One unit agent for every human & computer task.
A simple yet powerful agent framework that delivers with open-source models
The absolute trainer to light up AI agents.
🔥 Comprehensive survey on Context Engineering: from prompt engineering to production-grade AI systems. hundreds of papers, frameworks, and implementation guides for LLMs and AI agents.
The power of Claude Code / GeminiCLI / CodexCLI + [Gemini / OpenAI / OpenRouter / Azure / Grok / Ollama / Custom Model / All Of The Above] working as one.
Copilot Chat extension for VS Code
A modular, documentation-driven framework using Cursor custom modes (VAN, PLAN, CREATIVE, IMPLEMENT) to provide persistent memory and guide AI through a structured development workflow with visual …
FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI…
The #1 open-source SWE-bench Verified implementation
An example starter repo for Python projects
⏩ Source-controlled AI checks, enforceable in CI. Powered by the open-source Continue CLI
Prompt, run, edit, and deploy full-stack web applications. -- bolt.new -- Help Center: https://support.bolt.new/ -- Community Support: https://discord.com/invite/stackblitz
📦 Repomix is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) or other AI tools lik…
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. Each technique has a detailed notebook tutorial.
Source code for Twitter's Recommendation Algorithm
Examples and guides for using the OpenAI API
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
⚡ TabPFN: Foundation Model for Tabular Data ⚡
Clubhouse API written in Python. Standalone client included. For reference and education purposes only.
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
Model interpretability and understanding for PyTorch
SSD: Single Shot MultiBox Detector | a PyTorch Tutorial to Object Detection
Image augmentation for machine learning experiments.