Skip to content
View stephenroller's full-sized avatar
🥫
🥫

Block or report stephenroller

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

Python 10,622 2,087 Updated Nov 3, 2023

Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach.

Python 1,128 95 Updated Oct 23, 2025

Slurm: A Highly Scalable Workload Manager

C 3,575 774 Updated Dec 19, 2025

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 17,447 2,853 Updated Nov 3, 2025

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,382 588 Updated Oct 28, 2024

My dotfiles

Vim Script 8 1 Updated Jun 21, 2025

λprompt - A functional programming interface for building AI systems

Python 380 22 Updated Jan 18, 2024

Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference, and more.

Python 216 15 Updated Jun 3, 2025

Development repository for the Triton language and compiler

MLIR 17,887 2,461 Updated Dec 20, 2025

Fast and memory-efficient exact attention

Python 21,196 2,232 Updated Dec 18, 2025

Repo for external large-scale work

Python 6,547 723 Updated Apr 27, 2024

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 10,323 1,008 Updated Dec 16, 2025

Ongoing research training transformer models at scale

Python 14,652 3,398 Updated Dec 20, 2025

Multi-GPU CUDA stress test

C++ 2,042 379 Updated Nov 4, 2025

Prefix-Tuning: Optimizing Continuous Prompts for Generation

Python 957 164 Updated Apr 26, 2024

Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

Shell 1,007 103 Updated Jul 29, 2024

A search engine for ParlAI's BlenderBot project (and probably other ones as well)

Python 130 47 Updated Dec 20, 2021

Code and data for the paper: "Unsupervised Common Sense Question Answering with Self-Talk"

Python 79 11 Updated Jul 19, 2021

📘 Automatic documentation from sources, for MkDocs.

Python 2,026 121 Updated Nov 30, 2025

A sharp cut(1) clone.

Rust 726 18 Updated Dec 1, 2025

OpenChat: Easy to use opensource chatting framework via neural networks

Python 432 56 Updated Jul 15, 2023

MADGRAD Optimization Method

Python 804 58 Updated Jan 27, 2025

With the aim of building next generation virtual assistants that can handle multimodal inputs and perform multimodal actions, we introduce two new datasets (both in the virtual shopping domain), th…

Python 133 37 Updated Oct 21, 2023

PyTorch extensions for high performance and large scale training.

Python 3,391 294 Updated Apr 26, 2025

A specification that python filesystems should adhere to.

Python 1,263 425 Updated Dec 17, 2025

ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems

Python 464 138 Updated Jun 17, 2024

An extended commonmark compliant parser, with bridges to docutils/sphinx

Python 855 218 Updated Dec 16, 2025

A suite of tools for managing crowdsourcing tasks from the inception through to data packaging for research use.

Python 312 76 Updated Dec 13, 2024

A comprehensive reference for all topics related to Natural Language Processing

Python 2,032 281 Updated Oct 12, 2025

Longformer: The Long-Document Transformer

Python 2,177 288 Updated Feb 8, 2023
Next