-
Barcelona Supercomputing Center
- Barcelona
- ggcr.github.io
Starred repositories
MoE training for Me and You and maybe other people
Testsuite data for html5lib, including the de-facto standard HTML parsing tests.
Simple high-throughput inference library
Research code artifacts for Code World Model (CWM) including inference tools, reproducibility, and documentation.
Mirage Persistent Kernel: Compiling LLMs into a MegaKernel
Train speculative decoding models effortlessly and port them smoothly to SGLang serving.
Access large language models from the command-line
Hurl, run and test HTTP requests with plain text.
Parrot is a C++ library for fused array operations using CUDA/Thrust. It provides efficient GPU-accelerated operations with lazy evaluation semantics, allowing for chaining of operations without un…
An open-source, code-first Go toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.
Fast, Flexible and Portable Structured Generation
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >74% on SWE-bench verified!
[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.
FlashInfer: Kernel Library for LLM Serving
Evolution of RTLs (REvolution: An Evolutionary Framework for RTL Generation driven by Large Language Models)