Highlights
- Pro
Stars
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
AGENTS.md — a simple, open format for guiding coding agents
💫 Industrial-strength Natural Language Processing (NLP) in Python
Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
LLM prompts for structured software development because quality takes more than just "good vibes".
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
💫 Toolkit to help you get started with Spec-Driven Development
Runnable algo template for trading the Options Wheel strategy
Lightweight coding agent that runs in your terminal
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Supercharge Your LLM Application Evaluations 🚀
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction of AlphaFold 2
FoldFlow: SE(3)-Stochastic Flow Matching for Protein Backbone Generation
DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.
A versatile pairwise aligner for genomic and spliced nucleotide sequences
Burrow-Wheeler Aligner for short-read alignment (see minimap2 for long-read alignment)
Metaprogramming, verified meta-theory and implementation of Rocq in Rocq
Specifications of SAM/BAM and related high-throughput sequencing file formats
C library for high-throughput sequencing data formats
This is the development home of the workflow management system Snakemake. For general information, see
A DSL for data-driven computational pipelines
Homomer symmetry prediction from protein sequence