Skip to content
View oplatek's full-sized avatar

Highlights

  • Pro

Organizations

@UFAL-DSG @ufal

Block or report oplatek

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

Python 8,050 674 Updated Dec 17, 2025
Python 1,498 108 Updated Dec 18, 2025

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

Python 17,640 1,249 Updated Nov 27, 2025

Retrieval and Retrieval-augmented LLMs

Python 11,029 812 Updated Dec 15, 2025

A VIM-inspired filemanager for the console

Python 16,748 919 Updated Nov 14, 2025

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

Go 41,774 3,724 Updated Dec 19, 2025

Official implementation of paper: Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis

Python 47 7 Updated Sep 20, 2025

Open Test for BottleCapAI

Python 128 31 Updated Nov 14, 2025
Python 87 11 Updated Dec 7, 2025

Codebase for FinePDFs

Python 156 26 Updated Nov 5, 2025
Python 155 13 Updated Oct 31, 2025

Fully automatic censorship removal for language models

Python 3,917 371 Updated Dec 16, 2025

Unified Schema-Based Information Extraction

Python 388 40 Updated Dec 19, 2025

A cosy home for your LLMs.

Swift 640 23 Updated Dec 19, 2025

Dialog2Flow: convert your dialogs to flows. This repository accompanies the paper "Dialog2Flow: Pre-training Soft-Contrastive Sentence Embeddings for Automatic Dialog Flow Extraction", accepted to …

Python 17 1 Updated Jul 1, 2025

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 3,013 583 Updated Dec 20, 2025

GPU-optimized framework for training diffusion language models at any scale. The backend of Quokka, Super Data Learners, and OpenMoE 2 training.

Python 303 25 Updated Nov 11, 2025

The official github repo for "Diffusion Language Models are Super Data Learners".

Python 212 8 Updated Nov 6, 2025

An interface library for RL post training with environments.

Python 851 135 Updated Dec 19, 2025

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 8,590 2,223 Updated Sep 5, 2025

My own repository containing the codes I wrote to practice CUDA programming.

Jupyter Notebook 64 9 Updated Jul 17, 2023

OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.

Python 605 51 Updated Oct 29, 2025

The AT Protocol (🦋 Bluesky) SDK for Python 🐍

Python 629 77 Updated Dec 8, 2025

Speed up model training by fixing data loading.

Python 566 81 Updated Dec 15, 2025

pytorch notebook for implemention for cut-cross-entropy LLM training.

Jupyter Notebook 8 Updated Dec 23, 2024

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Python 1,644 83 Updated Mar 8, 2024

LLM training in simple, raw C/CUDA

Cuda 28,430 3,333 Updated Jun 26, 2025

Our library for RL environments + evals

Python 3,649 454 Updated Dec 20, 2025

Entropy Based Sampling and Parallel CoT Decoding

Python 3,431 325 Updated Nov 13, 2024

Hydra is a framework for elegantly configuring complex applications

Python 10,042 759 Updated Dec 11, 2025
Next