Skip to content
View Kyriection's full-sized avatar
🎨
Focusing
🎨
Focusing

Block or report Kyriection

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts

Python 110 3 Updated Oct 16, 2024

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 3,968 193 Updated Oct 29, 2024

NanoGPT (124M) quality in 2.67B tokens

Python 849 55 Updated Oct 30, 2024

Code for the paper: Why Transformers Need Adam: A Hessian Perspective

Jupyter Notebook 38 1 Updated Apr 26, 2024

A bibliography and survey of the papers surrounding o1

TeX 496 18 Updated Oct 30, 2024

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Python 641 46 Updated Sep 27, 2024

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 15,097 1,408 Updated Oct 15, 2024

Retrieval-Augmented Theorem Provers for Lean

Python 223 50 Updated Aug 29, 2024

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 20,046 2,547 Updated Oct 30, 2024

A new markup-based typesetting system that is powerful and easy to learn.

Rust 34,611 921 Updated Oct 29, 2024

Easiest way to build custom agents, in a no-code notion style editor, using simple macros.

TypeScript 17 Updated Oct 27, 2024

Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"

Python 105 1 Updated Oct 22, 2024

Materials for learning SGLang

71 3 Updated Oct 21, 2024

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

7,374 378 Updated Jul 16, 2023

The official repo for "LLoCo: Learning Long Contexts Offline"

Python 109 9 Updated Jun 15, 2024

Improving Alignment and Robustness with Circuit Breakers

Jupyter Notebook 149 16 Updated Sep 24, 2024

Fira: Can We Achieve Full-rank Training of LLMs Under Low-rank Constraint?

Python 74 2 Updated Oct 21, 2024

Long Context Extension and Generalization in LLMs

Python 38 1 Updated Sep 21, 2024

Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)

166 11 Updated Sep 22, 2024

Material for gpu-mode lectures

Jupyter Notebook 2,893 286 Updated Oct 21, 2024

LongRoPE is a novel method that can extends the context window of pre-trained LLMs to an impressive 2048k tokens.

Python 98 10 Updated Aug 23, 2024

High performance AI inference stack. Built for production. @ziglang / @openxla / MLIR / @bazelbuild

Zig 1,610 57 Updated Oct 29, 2024

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

4,914 273 Updated Oct 23, 2024

METIS - Serial Graph Partitioning and Fill-reducing Matrix Ordering

C 702 138 Updated Oct 27, 2023

Main development repository for GAP - Groups, Algorithms, Programming, a System for Computational Discrete Algebra

GAP 810 160 Updated Oct 29, 2024
Python 91 2 Updated Sep 24, 2024

This repo is based on https://github.com/jiaweizzhao/GaLore

Python 18 Updated Sep 18, 2024

Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding

JavaScript 66 4 Updated Oct 2, 2024
Next