- Warsaw / San Francisco
- mirkowski.dev
- in/franek-mirkowski-a0abb3330
Highlights
- Pro
Stars
PyTorch implementations of Generative Adversarial Networks.
Hierarchical Reasoning Model Official Release
This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025
The main repo for NLWeb, implemented in Python.
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
On the Variance of the Adaptive Learning Rate and Beyond
HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation
Official Repository of Absolute Zero Reasoner
[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference
Code for the paper "Exploration by Random Network Distillation"
Production-Ready MCP Server Framework • Build, deploy & scale secure AI agent infrastructure • Includes Auth, Observability, Debugger, Telemetry & Runtime • Run real-world MCPs powering AI Agents
source code for the ECCV18 paper A Style-Aware Content Loss for Real-time HD Style Transfer
Large-scale text-video dataset. 10 million captioned short videos.
A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.
[ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation
🗜️ Codebase-digest is your AI-friendly codebase packer and analyzer. Features 60+ coding prompts and generates structured overviews with metrics. Ideal for feeding projects to LLMs like GPT-4, Clau…
Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"
Scikit-learn compatible library for molecular fingerprints and chemoinformatics
[NeurIPS 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
[NeurIPS 2025] Hybrid Latent Reasoning via Reinforcement Learning
[NeurIPS 2024] VFIMamba: Video Frame Interpolation with State Space Models