pyshka501

Follow

🤔

Gaining new knowledge

Konstantin pyshka501

🤔

Gaining new knowledge

Follow

8 followers · 0 following

Achievements

Achievements

pyshka501/README.md

Konstantin Pchelin (pyshka501) 🚀

Machine Learning Researcher | Reinforcement Learning | LLM Systems

🧠 About Me

ML researcher focused on reinforcement learning and LLM-based systems.

Research: RL, RLHF, agent systems, vulnerability discovery
Building: production-grade ML systems & high-performance inference
Teaching: ML & RL courses with 500+ students

I work at the intersection of:

theory (RL, optimization, convergence)
systems (LLM infra, high-load serving)
applied research (agents, code intelligence, security)

🌐 Personal website: https://mountainai.tech/

🔬 Research Interests

Reinforcement Learning (from bandits → PPO / GRPO / RLHF)
LLM agents & tool-use systems
Offline / Online RL for language models
Structural reasoning & context reconstruction in code

⚙️ Selected Work

📊 RL Atlas — interactive visualization of RL algorithms
→ https://github.com/pyshka501/mountainai-rl-atlas
🤗 HuggingFace Spaces (interactive demos & experiments)
→ https://huggingface.co/pyshka501/spaces
🧪 Research on blind vulnerability discovery with LLMs
(context reconstruction as core bottleneck)
⚡ High-performance LLM inference systems
(latency optimization, multi-token prediction, scaling)

🛠 Tech Stack

ML / Research

PyTorch, RL (TD, MC, PG, PPO, RLHF)
Transformers, LLM fine-tuning, evaluation
Statistical modeling & optimization

Systems / Infra

vLLM, Triton, high-load inference
Docker, FastAPI, distributed systems
Performance optimization & scaling

Data

pandas, NumPy, scikit-learn
Large-scale dataset processing

📚 Teaching

Author and instructor of multiple courses:

Reinforcement Learning: from Bandits to PPO/GRPO & RLHF
Machine Learning (fundamentals → production)
NLP & Semantic Search

📈 Courses launched and scaled to 500+ students

Mathematical rigor + intuition
Practical assignments & real implementations
Focus on modern ML systems and real-world use cases

🤝 Open to Collaboration

RL / LLM research
Agent systems & reasoning
ML infra & optimization
Early-stage AI products

📬 Contact

Pinned Loading

mountainai-rl-atlas mountainai-rl-atlas Public

Interactive platform for exploring and visualizing reinforcement learning algorithms — from tabular methods to deep RL and RLHF. Compare methods, tune hyperparameters, and analyze training dynamics…

Python 6
rl-textbook rl-textbook Public

Reinforcement Learning: From Bandits to LLM Alignment — Open textbook with 17 chapters, Colab notebooks, and exercises

TeX 65 7