Skip to content
View kdexd's full-sized avatar
💎
🙌
💎
🙌

Organizations

@batra-mlp-lab @mdgspace @Cloud-CV @redcaps-dataset

Block or report kdexd

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An image-to-world skillset for Claude.

TypeScript 4,574 461 Updated May 15, 2026

Minimal Claude Code alternative. Single Python file, zero dependencies, ~250 lines.

Python 2,424 235 Updated Jan 14, 2026

✨ An advanced 3D Gaussian Splatting renderer for THREE.js

TypeScript 3,189 342 Updated Jun 10, 2026

Improved sampling via learned diffusions (ICLR2024) and an optimal control perspective on diffusion-based generative modeling (TMLR2024)

Python 75 10 Updated Mar 14, 2025

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.

Python 5,559 420 Updated Apr 21, 2025

Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, a…

Rust 6,648 713 Updated Jun 14, 2026

Official inference repo for FLUX.1 models

Python 25,620 1,892 Updated Jul 31, 2025

Utilities intended for use with Llama models.

Python 7,629 1,388 Updated Feb 11, 2026

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 2,004 138 Updated Nov 7, 2025

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 4,424 375 Updated Oct 19, 2025

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 15,231 1,284 Updated May 23, 2024

The official Meta Llama 3 GitHub site

Python 29,280 3,525 Updated Jan 26, 2025

Lightplane implements a highly memory-efficient differentiable radiance field renderer, and a module for unprojecting features from images to 3D grids.

Python 285 9 Updated Aug 6, 2024

A PyTorch native platform for training generative AI models

Python 5,436 860 Updated Jun 14, 2026

Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...

Python 321 26 Updated Dec 9, 2023

Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.

Python 104 2 Updated Mar 23, 2025

NeurIPS 2025 Spotlight; ICLR2024 Spotlight; CVPR 2024; EMNLP 2024

Python 1,840 77 Updated Nov 27, 2025

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,570 1,063 Updated Jul 1, 2024

Fast bare-bones BPE for modern tokenizer training

Python 179 6 Updated Jun 23, 2025

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 18,481 1,506 Updated May 24, 2026

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 3,085 274 Updated May 26, 2026

WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.

1,108 47 Updated Sep 27, 2024

This repository provides the code and model checkpoints for AIMv1 and AIMv2 research projects.

Python 1,420 73 Updated Aug 4, 2025

Building blocks for foundation models.

627 27 Updated Jan 3, 2024

MLX: An array framework for Apple silicon

C++ 26,975 1,903 Updated Jun 13, 2026

A batched offline inference oriented version of segment-anything

Python 1,320 80 Updated Aug 22, 2025

Code for the paper "Visual Anagrams: Generating Multi-View Optical Illusions with Diffusion Models"

Jupyter Notebook 966 100 Updated May 3, 2026

The Modular Platform (includes MAX & Mojo)

Mojo 26,338 2,839 Updated Jun 14, 2026

Fast Implementation of Generalised Geodesic Distance Transform for CPU (OpenMP) and GPU (CUDA)

C++ 107 17 Updated Mar 12, 2026

Code for the paper "Hyperbolic Image-Text Representations", Desai et al, ICML 2023

Python 204 25 Updated Aug 23, 2023
Next