Skip to content
View AlexSabaka's full-sized avatar

Block or report AlexSabaka

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Generating Pedagogically Meaningful Visuals for Math Word Problems: A New Benchmark and Analysis of Text-to-Image Models, ACL 2025 Findings

Python 14 5 Updated Oct 28, 2025

You should definitely walk! 🤖🚙🧽

Python 4 Updated Feb 17, 2026

GSM-Symbolic templates and generated data

88 12 Updated Dec 8, 2024

QuickLook source code preview and icon thumbnailing app extensions for macOS Catalina and beyond

Swift 73 7 Updated Apr 23, 2026

Official Codebase for "Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights"

Python 583 58 Updated Apr 2, 2026

Repository for Fraud Dataset Benchmark

Jupyter Notebook 260 41 Updated Sep 26, 2023

A tiny model that teaches itself to code better. On your laptop. No cloud. No teacher model. No human feedback.

Python 62 9 Updated Mar 10, 2026

Smart home assistant powered by an SLM

Dart 15 1 Updated Apr 7, 2026

Reinforcement Learning via Self-Distillation (SDPO)

Python 825 87 Updated Feb 18, 2026

⚡⚡ Lightning Fast (~300TPS) Reinforcement Learning environment on latest Minecraft 🏝️

Kotlin 62 3 Updated Mar 20, 2026

Dataset and codes for ACL 2019 DocRED: A Large-Scale Document-Level Relation Extraction Dataset.

Python 651 111 Updated Dec 1, 2020

REBEL is a seq2seq model that simplifies Relation Extraction (EMNLP 2021).

Python 564 74 Updated Nov 9, 2023

Compile and run LLVM IR in the browser

HTML 132 13 Updated Jun 11, 2023

clasp Common Lisp environment

Common Lisp 2,750 155 Updated Apr 28, 2026

The LLM Evaluation Framework

Python 15,084 1,396 Updated Apr 29, 2026

The best ChatGPT that $100 can buy.

Python 52,731 7,056 Updated Apr 14, 2026

MiDaS: Large-scale Minecraft Dataset

Python 7 1 Updated Dec 9, 2021

Weekly free datasets from global news sites

45 7 Updated Apr 26, 2026

FNSPID: A Comprehensive Financial News Dataset in Time Series

Python 405 83 Updated Jul 24, 2025

Curated list of Ukrainian natural language processing (NLP) resources (corpora, pretrained models, libriaries, etc.)

233 24 Updated Feb 19, 2026

Benchmark to estimate model sycophancy

TeX 26 5 Updated Nov 30, 2025

AI agents running research on single-GPU nanochat training automatically

Python 78,091 11,387 Updated Mar 26, 2026

An MCP server plus a CLI tool that indexes local code into a graph database to provide context to AI assistants.

Python 3,096 555 Updated Apr 13, 2026

Ghidra MCP Server — 200+ MCP tools for AI-powered reverse engineering. GUI plugin + headless server, lazy tool loading, convention enforcement, batch operations, Ghidra Server integration, and Dock…

Java 1,766 126 Updated Apr 27, 2026

A collection of benchmarks and datasets for evaluating LLM.

567 35 Updated Jul 13, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 32,648 3,448 Updated Apr 26, 2026

Awesome RWKV Projects

8 Updated Sep 5, 2025

iNatSounds Datasets

43 2 Updated Dec 13, 2024

The NES Music Database: use machine learning to compose music for the Nintendo Entertainment System!

Python 498 45 Updated Dec 22, 2024

Generate 8-bit chiptunes with deep learning

Python 355 44 Updated Nov 7, 2021
Next