Skip to content
View akontra's full-sized avatar

Block or report akontra

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 7,988 695 Updated Feb 10, 2025

Searches for lost YouTube videos in archives

Python 338 29 Updated Oct 28, 2025

📚 Process PDFs, Word documents and more with spaCy

Python 794 58 Updated Mar 8, 2025

Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"

Python 248 25 Updated Jan 31, 2025

Agentless🐱: an agentless approach to automatically solve software development problems

Python 1,948 212 Updated Dec 22, 2024

Build GUI for your Python program with JavaScript, HTML, and CSS

Python 5,529 611 Updated Nov 3, 2025

On the Theoretical Limitations of Embedding-Based Retrieval

Jupyter Notebook 589 44 Updated Sep 15, 2025

Monte Carlo tree search in JAX

Python 2,552 209 Updated Sep 2, 2025

A library of reinforcement learning components and agents

Python 3,832 501 Updated Sep 26, 2025

This dataset code generates mathematical question and answer pairs, from a range of question types at roughly school-level difficulty.

Python 1,913 262 Updated Dec 23, 2024

Code to our ICML 2025 Paper "Calibrated Language Models and How to Find Them with Label Smoothing"

Python 3 Updated Oct 21, 2025
Python 17 Updated May 31, 2023

Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)

Python 42 5 Updated May 12, 2025

Deep Reinforcement Learning for Continuous Control in PyTorch

Python 105 15 Updated Dec 31, 2021

A general model-free off-policy actor-critic implementation. Continuous and Discrete Soft Actor-Critic with multimodal observations, data augmentation, offline learning and behavioral cloning.

Python 39 6 Updated Mar 15, 2024

Advantage-Filtered Behavioral Cloning for Offline Continuous Control

Python 4 1 Updated Dec 5, 2021

Pocket Flow: 100-line LLM framework. Let Agents build Agents!

Python 8,777 987 Updated Aug 13, 2025

Enhancing LLMs with LoRA

Jupyter Notebook 174 13 Updated Oct 20, 2025

The Hunspell binding for NodeJS that exposes as much of Hunspell as possible and also adds new features. Hunspell is a first class spellcheck library used by Google, Apple, and Mozilla.

C++ 291 42 Updated Apr 15, 2023

Naive linter for English prose

JavaScript 5,037 190 Updated Mar 10, 2025

Simple text proofreader based on 'write-good' (hemingway-app-like suggestions) and 'nodehun' (spelling).

JavaScript 341 20 Updated Apr 6, 2018

Write Like Hemingway

Python 12 Updated Nov 28, 2014

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,669 440 Updated Nov 4, 2025

A series of technical report on Slow Thinking with LLM

Python 744 41 Updated Aug 13, 2025
Python 31 3 Updated Feb 7, 2025

Better Markdown Parser in PHP

PHP 14,978 1,144 Updated Aug 31, 2025

Simpler Database Intaractions in PHP

PHP 266 16 Updated Jan 5, 2024

Write programs like message passing graphs and get parallelism for free. Statically typed and compiled to machine code!

Go 1,042 40 Updated Nov 2, 2025

JavaScript framework for visual programming

TypeScript 10,916 681 Updated Sep 1, 2025

[ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.

Python 56 Updated Jul 21, 2025
Next