-
Greenhouse Software
- Alameda, CA
- @mcclurmc
Stars
Native BM25 Ranking Index in PostgreSQL
Train transformer language models with reinforcement learning.
DSPy: The framework for programming—not prompting—language models
Open Source Semantic Search for your AI Agent
Claude Code superpowers: core skills library
A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows
Open Source DeepWiki: AI-Powered Wiki Generator for GitHub/Gitlab/Bitbucket Repositories. Join the discord: https://discord.gg/gMwThUMeme
Supercharge Your LLM with the Fastest KV Cache Layer
💫 Toolkit to help you get started with Spec-Driven Development
Optimize prompts, code, and more with AI-powered Reflective Text Evolution
BLEURT is a metric for Natural Language Generation based on transfer learning.
Official Python toolkit for the Qwen3-ASR API. Parallel high‑throughput calls, robust long‑audio transcription, multi‑sample‑rate support.
An LLM-powered repository agent designed to assist developers and teams in generating documentation and understanding repositories quickly.
Typed interactions with the GitHub API v3
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-of-use, backed by research.
A list of Claude Code Sub-Agents submitted by the community.
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
Simplifying reinforcement learning for complex game environments
Official repository for the paper "LiveCodeBench: Holistic and Contamination Free Evaluation of Large Language Models for Code"
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
LiveBench: A Challenging, Contamination-Free LLM Benchmark
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
A topic-centric list of HQ open datasets.
A benchmark for emotional intelligence in large language models