-
Northeastern University
- Boston, United States
-
23:17
(UTC -04:00) - https://khaiwang.github.io
Lists (1)
Sort Name ascending (A-Z)
Stars
A Datacenter Scale Distributed Inference Serving Framework
Deep-reading pipeline for research papers — LLM-powered reports, vector KB, and knowledge graph.
Automated bibliography verification and LaTeX quality auditing for papers.
It's a plugin extension in Zotero. Zotero MCP Plugin enables integration between AI assistants and Zotero through MCP. Zotero MCP Plugin 是一个 Zotero 插件,通过 MCP协议实现 AI 助手与 Zotero深度集成。插件支持文献检索、元 数据管理、全…
A terminal workspace with batteries included
Extract residual-stream activations and apply steering vectors (including activation oracles) to any vLLM model during inference.
Render markdown on the CLI, with pizzazz! 💅🏻
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
Bub it. Build it. A hook-first runtime for agents that live alongside people.
SGLang is a high-performance serving framework for large language models and multimodal models.
The nnsight package enables interpreting and manipulating the internals of deep learned models.
The NDIF server, which performs deep inference and serves nnsight requests remotely
khaiwang / ndif
Forked from ndif-team/ndifThe NDIF server, which performs deep inference and serves nnsight requests remotely
Bash is all you need - A nano claude code–like 「agent harness」, built from 0 to 1
Comprehensive open-source library of AI research and engineering skills for any AI model. Package the skills and your claude code/codex/gemini agent will be an AI research agent with full horsepowe…
LaTeX Template for Mike Morrison's #betterposter
A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.
Fault tolerance for PyTorch (HSDP, LocalSGD, DiLoCo, Streaming DiLoCo)
💫 Toolkit to help you get started with Spec-Driven Development
🚀 EvoAgentX: Building a Self-Evolving Ecosystem of AI Agents
GPT-Prompt-Hub is an open-source community-driven repository dedicated to the collection, sharing, and refinement of custom GPT prompts
Recommend new arxiv papers of your interest daily according to your Zotero libarary.
Mirage Persistent Kernel: Compiling LLMs into a MegaKernel
A High-Efficiency System of Large Language Model Based Search Agents
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
🎓Automatically Update CV Papers Daily using Github Actions