Skip to content
View oplatek's full-sized avatar

Highlights

  • Pro

Organizations

@UFAL-DSG @ufal

Block or report oplatek

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
1087 results for source starred repositories
Clear filter

The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.

Python 8,073 677 Updated Dec 17, 2025

A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.

Python 2,323 203 Updated Dec 23, 2025

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

Python 19,173 1,341 Updated Nov 27, 2025

Retrieval and Retrieval-augmented LLMs

Python 11,045 817 Updated Dec 15, 2025

A VIM-inspired filemanager for the console

Python 16,754 919 Updated Nov 14, 2025

Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search

Go 41,907 3,737 Updated Dec 25, 2025

Official implementation of paper: Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis

Python 47 7 Updated Sep 20, 2025
Python 88 11 Updated Dec 7, 2025

Codebase for FinePDFs

Python 156 26 Updated Nov 5, 2025
Python 156 13 Updated Oct 31, 2025

Fully automatic censorship removal for language models

Python 3,971 379 Updated Dec 22, 2025

Unified Schema-Based Information Extraction

Python 398 41 Updated Dec 19, 2025

A cosy home for your LLMs.

Swift 689 26 Updated Dec 22, 2025

Dialog2Flow: convert your dialogs to flows. This repository accompanies the paper "Dialog2Flow: Pre-training Soft-Contrastive Sentence Embeddings for Automatic Dialog Flow Extraction", accepted to …

Python 17 1 Updated Jul 1, 2025

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on Hopper, Ada and Blackwell GPUs, to provide better performance…

Python 3,031 588 Updated Dec 22, 2025

GPU-optimized framework for training diffusion language models at any scale. The backend of Quokka, Super Data Learners, and OpenMoE 2 training.

Python 304 25 Updated Nov 11, 2025

The official github repo for "Diffusion Language Models are Super Data Learners".

Python 215 8 Updated Nov 6, 2025

An interface library for RL post training with environments.

Python 859 138 Updated Dec 24, 2025

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

C 8,605 2,227 Updated Sep 5, 2025

My own repository containing the codes I wrote to practice CUDA programming.

Jupyter Notebook 64 9 Updated Jul 17, 2023

OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.

Python 607 52 Updated Oct 29, 2025

The AT Protocol (🦋 Bluesky) SDK for Python 🐍

Python 630 77 Updated Dec 8, 2025

Speed up model training by fixing data loading.

Python 566 81 Updated Dec 15, 2025

pytorch notebook for implemention for cut-cross-entropy LLM training.

Jupyter Notebook 8 Updated Dec 23, 2024

A family of open-sourced Mixture-of-Experts (MoE) Large Language Models

Python 1,645 83 Updated Mar 8, 2024

LLM training in simple, raw C/CUDA

Cuda 28,460 3,338 Updated Jun 26, 2025

Our library for RL environments + evals

Python 3,661 456 Updated Dec 24, 2025

Entropy Based Sampling and Parallel CoT Decoding

Python 3,432 325 Updated Nov 13, 2024

Hydra is a framework for elegantly configuring complex applications

Python 10,061 765 Updated Dec 11, 2025

RLP: Reinforcement as a Pretraining Objective

218 13 Updated Oct 5, 2025
Next