Skip to content
View matejgj's full-sized avatar

Block or report matejgj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Sharp Monocular View Synthesis in Less Than a Second

Python 3,119 188 Updated Dec 19, 2025

This repository allows reproduction of Poetiq's record-breaking submission to the ARC-AGI-1 and ARC-AGI-2 benchmarks.

Python 850 164 Updated Dec 16, 2025

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

Python 8,034 672 Updated Dec 17, 2025

An intelligent load balancer for LM Studio that distributes requests across multiple loaded language models, optimizing resource utilization and response times.

JavaScript 18 2 Updated Oct 13, 2025

SemEval2026 Task 3 DimABSA

Python 19 7 Updated Dec 15, 2025

Agentic Design Patterns: A Hands-On Guide to Building Intelligent Systems by Antonio Gulli

Jupyter Notebook 5,471 1,075 Updated Sep 7, 2025
Svelte 1,805 218 Updated Dec 16, 2025

🎨 NeMo Data Designer: A general library for generating high-quality synthetic data from scratch or based on seed data.

Python 468 35 Updated Dec 19, 2025

A lightweight LMM-based Document Parsing Model

Python 6,374 441 Updated Dec 8, 2025

Toolkit for linearizing PDFs for LLM datasets/training

Python 16,220 1,252 Updated Dec 12, 2025

Data and tools for generating and inspecting OLMo pre-training data.

Python 1,367 162 Updated Nov 5, 2025

Multilingual Document Layout Parsing in a Single Vision-Language Model

Python 5,888 578 Updated Oct 31, 2025

OCR model that handles complex tables, forms, handwriting with full layout.

Python 3,624 399 Updated Dec 19, 2025

A Python Notebook working with Mistral's API to process a PDF document into an accessible HTML document

Jupyter Notebook 1 Updated Mar 10, 2025

A Dockerized python Script to fetch Garmin health data and populate that in a InfluxDB Database, for visualization long term health trends with Grafana

Python 2,476 147 Updated Dec 10, 2025

RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.

Python 5,019 501 Updated Dec 17, 2025

Evolution Pretraining Fully in Int Formats

Python 131 11 Updated Dec 12, 2025

The Semantic Infrastructure for AI Apps

Python 818 68 Updated Dec 18, 2025

📑 PageIndex: Document Index for Reasoning-based RAG

Jupyter Notebook 4,321 338 Updated Dec 19, 2025

Tensorlake is a Document Ingestion API and a serverless platform for building data processing and orchestration APIs

Python 860 126 Updated Dec 18, 2025

Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages

Python 2,482 213 Updated Dec 16, 2025

PipesHub is a fully extensible and explainable workplace AI platform for enterprise search and workflow automation

Python 2,326 351 Updated Dec 18, 2025
Jupyter Notebook 14 2 Updated Aug 30, 2025

Video-based AI memory library. Store millions of text chunks in MP4 files with lightning-fast semantic search. No database needed.

Python 10,497 897 Updated Oct 12, 2025

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 49,600 4,089 Updated Dec 18, 2025

Contexts Optical Compression

Python 21,483 1,920 Updated Oct 25, 2025

Universal LLM Deployment Engine with ML Compilation

Python 21,762 1,889 Updated Dec 11, 2025

Analytics, Versioning and ETL for multimodal data: video, audio, PDFs, images

Python 2,716 132 Updated Dec 19, 2025
Next