Skip to content
View IlyaGusev's full-sized avatar

Block or report IlyaGusev

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Compile programs directly into transformer weights. Includes a 2D convex-hull KV cache with O(log n) inference.

Python 206 40 Updated Jun 1, 2026

A non-saturating, open-ended environment for evaluating LLMs in Factorio

Python 1,001 83 Updated Jun 11, 2026

Генеративные хахашки

Python 11 Updated Feb 25, 2025

Code repository for the paper "Mission: Impossible Language Models."

Jupyter Notebook 56 9 Updated Sep 25, 2025
Python 14 2 Updated Jan 17, 2024

The repository for the code of the UltraFastBERT paper

Python 517 29 Updated Mar 24, 2024

Merge Transformers language models by use of gradient parameters.

Python 214 24 Updated Aug 8, 2024

SONAR, a new multilingual and multimodal fixed-size sentence embedding space, with a full suite of speech and text encoders and decoders.

Python 896 100 Updated Oct 10, 2025

Interpretability for sequence generation models 🐛 🔍

Python 467 41 Updated Apr 25, 2026

Using transformers to generate Russian poetry

Python 36 5 Updated Aug 21, 2023

Scripts that were used to scrape and process data from Yandex.Q

Python 2 Updated Dec 4, 2022

Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them

559 33 Updated Jun 25, 2024

A prize for finding tasks that cause large language models to show inverse scaling

621 27 Updated Oct 11, 2023

Russian coreference resolution made as simple and accessible as could be

JavaScript 11 Updated Sep 3, 2022

Official code for LEWIS, from: "LEWIS: Levenshtein Editing for Unsupervised Text Style Transfer", ACL-IJCNLP 2021 Findings by Machel Reid and Victor Zhong

Python 31 5 Updated Oct 24, 2022

Library for Russian rap generation.

Python 23 3 Updated Apr 19, 2025

Structured state space sequence models

Jupyter Notebook 2,906 361 Updated Jul 17, 2024

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 161,583 33,497 Updated Jun 14, 2026
Python 1 Updated Jun 24, 2021

1st place solution for RuSimpleSentEval

Python 9 1 Updated Apr 12, 2021

Probing suite for evaluation of Russian embedding and language models

Python 32 3 Updated Oct 1, 2024

C++ Chess Engine

C++ 74 16 Updated Oct 17, 2024

Russian GPT3 models.

Python 2,088 434 Updated Dec 12, 2022
3 Updated Sep 30, 2020

Курс Nand2Tetris в школе "Интеллектуал"

Python 19 2 Updated May 11, 2017

The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.

TypeScript 3,654 370 Updated Jun 9, 2026

Telegram Data Clustering Contest (Bossy Gnu's submission )

C++ 6 2 Updated Feb 8, 2021

Winning entry for Telegram Data Clustering competition

Jupyter Notebook 6 1 Updated Aug 5, 2020
Next