Skip to content
View Kyriection's full-sized avatar
🎨
Focusing
🎨
Focusing

Block or report Kyriection

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Post-training with Tinker

Python 900 58 Updated Oct 9, 2025

A collection of LogitsProcessors to customize and enhance LLM behavior for specific tasks.

Python 359 24 Updated Jul 8, 2025

Kronos: A Foundation Model for the Language of Financial Markets

Python 7,072 1,473 Updated Sep 16, 2025

This is the repository of DEER, a Dynamic Early Exit in Reasoning method for Large Reasoning Language Models.

Python 170 5 Updated Jul 7, 2025

The official implementation for "Mitigating Overthinking in Large Reasoning Models via Manifold Steering"

6 Updated May 29, 2025
Python 106 1 Updated Jun 15, 2025

GPQA: A Graduate-Level Google-Proof Q&A Benchmark

Jupyter Notebook 412 42 Updated Sep 30, 2024

[COLM 2025] LIMO: Less is More for Reasoning

Python 1,030 51 Updated Jul 30, 2025

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 18,750 1,834 Updated Oct 6, 2025
Python 47 2 Updated Sep 16, 2024

Official code for SEAL: Steerable Reasoning Calibration of Large Language Models for Free

Python 42 4 Updated Apr 6, 2025

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Python 2,920 291 Updated Sep 30, 2025

Code for the paper: "Learning to Reason without External Rewards"

Python 360 41 Updated Jul 10, 2025
Python 19 2 Updated Oct 3, 2025

Inverse Scaling in Test-Time Compute

Python 22 2 Updated Jul 22, 2025

Stanford NLP Python library for benchmarking the utility of LLM interpretability methods

Python 136 18 Updated Jun 25, 2025

Representation Engineering: A Top-Down Approach to AI Transparency

Jupyter Notebook 893 112 Updated Aug 14, 2024

Tiny-FSDP, a minimalistic re-implementation of the PyTorch FSDP

Python 84 7 Updated Aug 20, 2025

The implementation of paper "AlphaSteer: Learning Refusal Steering with Principled Null-Space Constraint"

Python 23 2 Updated Jul 13, 2025

Kinetics: Rethinking Test-Time Scaling Laws

Python 80 2 Updated Jul 11, 2025

SkyRL: A Modular Full-stack RL Library for LLMs

Python 994 129 Updated Oct 9, 2025

Resa: Transparent Reasoning Models via SAEs

Python 43 3 Updated Sep 23, 2025

Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.

Python 343 53 Updated Jun 23, 2025

Self-Adapting Language Models

Python 805 143 Updated Aug 1, 2025

[ICLR 2025 Workshop] "Landscape of Thoughts: Visualizing the Reasoning Process of Large Language Models"

Jupyter Notebook 36 4 Updated Aug 16, 2025
Python 120 5 Updated Jun 11, 2025
Next