Skip to content
View giganttheo's full-sized avatar

Highlights

  • Pro

Block or report giganttheo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Zotero is a free, easy-to-use tool to help you collect, organize, annotate, cite, and share your research sources.

JavaScript 13,894 1,007 Updated Apr 15, 2026

The agent that grows with you

Python 89,328 12,201 Updated Apr 15, 2026

DeMo: Decoupled Momentum Optimization

Python 198 10 Updated Dec 2, 2024

NanoGPT (124M) in 2 minutes

Python 5,096 709 Updated Apr 13, 2026

Probabilistic programming with large language models

Python 167 28 Updated Apr 9, 2026

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,809 482 Updated Oct 27, 2025

[ICCV2025] LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds

Python 2,589 207 Updated Mar 17, 2026

Use late-interaction multi-modal models such as ColPali in just a few lines of code.

Python 846 93 Updated Jan 28, 2025

Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜

Jupyter Notebook 1,909 146 Updated Jan 9, 2026

Training code for Baby-Llama, our submission to the strict-small track of the BabyLM challenge.

Python 86 13 Updated Oct 18, 2023

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

Python 2,668 221 Updated Apr 13, 2026

Fully open reproduction of DeepSeek-R1

Python 25,988 2,415 Updated Apr 2, 2026

Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.

Python 1,271 82 Updated Apr 11, 2026

Recipes to scale inference-time compute of open models

Python 1,130 130 Updated Apr 2, 2026

Python toolbox for sampling Determinantal Point Processes

Python 238 57 Updated Aug 14, 2024

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,839 469 Updated Oct 14, 2025

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Python 755 51 Updated Sep 27, 2024

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 4,042 679 Updated Apr 10, 2026
Python 13 Updated Sep 27, 2022

Everything about the SmolLM and SmolVLM family of models

Python 3,710 287 Updated Apr 2, 2026

A family of lightweight multimodal models.

Python 1,053 76 Updated Nov 18, 2024

Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and CPU, with comprehensive runtime coverage for PC (Python/C++), mobile (Android & iOS), and Linux/IoT (Arm64 & x86 Docker). Su…

Kotlin 7,945 986 Updated Apr 14, 2026

A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.

Python 372 42 Updated Feb 28, 2026

A Framework of Small-scale Large Multimodal Models

Python 976 99 Updated Apr 11, 2026

An open-source implementation for training LLaVA-NeXT.

Python 437 23 Updated Oct 23, 2024

SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.

Python 151 16 Updated Oct 22, 2022

Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719

Python 22 1 Updated Jun 5, 2024

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 4,087 585 Updated Apr 24, 2024
2 Updated Dec 2, 2025

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,879 70 Updated Jun 22, 2025
Next