stephenroller

Follow

🥫

Stephen Roller stephenroller

🥫

Follow

NLP researcher working on large language models.

616 followers · 47 following

Achievements

Achievements

Stars

facebookresearch / ParlAI

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

Python 10,622 2,087 Updated Nov 3, 2023

character-ai / prompt-poet

Streamlines and simplifies prompt design for both developers and non-technical users with a low code approach.

Python 1,128 95 Updated Oct 23, 2025

SchedMD / slurm

Slurm: A Highly Scalable Workload Manager

C 3,575 774 Updated Dec 19, 2025

openai / evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 17,447 2,853 Updated Nov 3, 2025

FMInference / FlexLLMGen

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,382 588 Updated Oct 28, 2024

stephenroller / dotfiles

My dotfiles

Vim Script 8 1 Updated Jun 21, 2025

approximatelabs / lambdaprompt

λprompt - A functional programming interface for building AI systems

Python 380 22 Updated Jan 18, 2024

google-research / cascades

Python library which enables complex compositions of language models such as scratchpads, chain of thought, tool use, selection-inference, and more.

Python 216 15 Updated Jun 3, 2025

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 17,887 2,461 Updated Dec 20, 2025

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 21,196 2,232 Updated Dec 18, 2025

facebookresearch / metaseq

Repo for external large-scale work

Python 6,547 723 Updated Apr 27, 2024

huggingface / tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 10,323 1,008 Updated Dec 16, 2025

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

Python 14,652 3,398 Updated Dec 20, 2025

wilicc / gpu-burn

Multi-GPU CUDA stress test

C++ 2,042 379 Updated Nov 4, 2025

XiangLi1999 / PrefixTuning

Prefix-Tuning: Optimizing Continuous Prompts for Generation

Python 957 164 Updated Apr 26, 2024

bigscience-workshop / bigscience

Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

Shell 1,007 103 Updated Jul 29, 2024

JulesGM / ParlAI_SearchEngine

A search engine for ParlAI's BlenderBot project (and probably other ones as well)

Python 130 47 Updated Dec 20, 2021

vered1986 / self_talk

Code and data for the paper: "Unsupervised Common Sense Question Answering with Self-Talk"

Python 79 11 Updated Jul 19, 2021

mkdocstrings / mkdocstrings

📘 Automatic documentation from sources, for MkDocs.

Python 2,026 121 Updated Nov 30, 2025

sstadick / hck

A sharp cut(1) clone.

Rust 726 18 Updated Dec 1, 2025

hyunwoongko / openchat

OpenChat: Easy to use opensource chatting framework via neural networks

Python 432 56 Updated Jul 15, 2023

facebookresearch / madgrad

MADGRAD Optimization Method

Python 804 58 Updated Jan 27, 2025

facebookresearch / simmc

With the aim of building next generation virtual assistants that can handle multimodal inputs and perform multimodal actions, we introduce two new datasets (both in the virtual shopping domain), th…

Python 133 37 Updated Oct 21, 2023

facebookresearch / fairscale

PyTorch extensions for high performance and large scale training.

Python 3,391 294 Updated Apr 26, 2025

fsspec / filesystem_spec

A specification that python filesystems should adhere to.

Python 1,263 425 Updated Dec 17, 2025

thu-coai / ConvLab-2

ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue Systems

Python 464 138 Updated Jun 17, 2024

executablebooks / MyST-Parser

An extended commonmark compliant parser, with bridges to docutils/sphinx

Python 855 218 Updated Dec 16, 2025

facebookresearch / Mephisto

A suite of tools for managing crowdsourcing tasks from the inception through to data packaging for research use.

Python 312 76 Updated Dec 13, 2024

ivan-bilan / The-NLP-Pandect

A comprehensive reference for all topics related to Natural Language Processing

Python 2,032 281 Updated Oct 12, 2025

allenai / longformer

Longformer: The Long-Document Transformer

Python 2,177 288 Updated Feb 8, 2023