Skip to content
View pyshka501's full-sized avatar
🤔
Gaining new knowledge
🤔
Gaining new knowledge

Block or report pyshka501

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementation of tensor-based dynamic mode decomposition algorithm using JAX

Python 3 Updated Apr 16, 2026

Reinforcement Learning: From Bandits to LLM Alignment — Open textbook with 17 chapters, Colab notebooks, and exercises

TeX 65 7 Updated May 11, 2026
Python 2 Updated Mar 25, 2026

Interactive platform for exploring and visualizing reinforcement learning algorithms — from tabular methods to deep RL and RLHF. Compare methods, tune hyperparameters, and analyze training dynamics…

Python 6 Updated May 11, 2026

This repository contains lecture notes, practical materials, and implementations for the course: "Reinforcement Learning: from Bandits to RLHF" The course is designed to provide a deep and systemat…

Jupyter Notebook 36 Updated Mar 21, 2026

Unofficial PyTorch Implementation of "Your LLM Knows the Future: Uncovering Its Multi-Token Prediction Potential".

Python 10 Updated Oct 8, 2025

This repository contains the Hugging Face Agents Course.

MDX 28,687 2,061 Updated Apr 27, 2026
Python 2 Updated Dec 2, 2024

Config files for my GitHub profile.

1 Updated Mar 23, 2026
Python 2 Updated Apr 6, 2025

Репозиторий с материалами курса, читаемого Пчелиным Константином весной 2025 года в МГУ

Jupyter Notebook 9 1 Updated Apr 29, 2025