Skip to content
View giganttheo's full-sized avatar

Highlights

  • Pro

Block or report giganttheo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
264 results for source starred repositories
Clear filter

NanoGPT (124M) in 3 minutes

Python 3,969 520 Updated Dec 17, 2025

Probabilistic programming with large language models

Python 154 24 Updated Nov 18, 2025

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,414 428 Updated Oct 27, 2025

[ICCV2025] LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds

Python 2,507 196 Updated Jul 15, 2025

Use late-interaction multi-modal models such as ColPali in just a few lines of code.

Python 834 96 Updated Jan 28, 2025

Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜

Jupyter Notebook 1,810 138 Updated Oct 27, 2025

Training code for Baby-Llama, our submission to the strict-small track of the BabyLM challenge.

Python 85 13 Updated Oct 18, 2023

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

Python 2,647 220 Updated Dec 15, 2025

Fully open reproduction of DeepSeek-R1

Python 25,737 2,405 Updated Nov 24, 2025

Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.

Python 1,210 75 Updated Dec 5, 2025

Recipes to scale inference-time compute of open models

Python 1,120 131 Updated May 22, 2025

Python toolbox for sampling Determinantal Point Processes

Python 236 57 Updated Aug 14, 2024

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,741 462 Updated Oct 14, 2025

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Python 751 52 Updated Sep 27, 2024

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,551 591 Updated Dec 17, 2025
Python 13 Updated Sep 27, 2022

Everything about the SmolLM and SmolVLM family of models

Python 3,454 243 Updated Nov 20, 2025

A family of lightweight multimodal models.

Python 1,049 75 Updated Nov 18, 2024

Run the latest LLMs and VLMs across GPU, NPU, and CPU with PC (Python/C++) & mobile (Android & iOS) support, running quickly with OpenAI gpt-oss, Granite4, Qwen3VL, Gemma 3n and more.

Go 6,241 822 Updated Dec 19, 2025

A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.

Python 359 41 Updated Dec 11, 2025

A Framework of Small-scale Large Multimodal Models

Python 939 95 Updated Apr 26, 2025

An open-source implementation for training LLaVA-NeXT.

Python 427 22 Updated Oct 23, 2024

SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.

Python 148 16 Updated Oct 22, 2022

Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719

Python 22 1 Updated Jun 5, 2024

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 4,026 580 Updated Apr 24, 2024
2 Updated Dec 2, 2025

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,830 68 Updated Jun 22, 2025

formation Deep Learning Optimisé pour Jean Zay

HTML 17 4 Updated Oct 20, 2025

Entropy Based Sampling and Parallel CoT Decoding

Python 3,431 325 Updated Nov 13, 2024

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

Python 2,392 224 Updated Dec 18, 2025
Next