Skip to content
View giganttheo's full-sized avatar

Highlights

  • Pro

Block or report giganttheo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Zotero is a free, easy-to-use tool to help you collect, organize, annotate, cite, and share your research sources.

JavaScript 13,847 1,002 Updated Apr 9, 2026

The agent that grows with you

Python 43,181 5,515 Updated Apr 9, 2026

DeMo: Decoupled Momentum Optimization

Python 198 10 Updated Dec 2, 2024

NanoGPT (124M) in 2 minutes

Python 5,071 702 Updated Mar 29, 2026

Probabilistic programming with large language models

Python 165 27 Updated Apr 9, 2026

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,790 480 Updated Oct 27, 2025

[ICCV2025] LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds

Python 2,588 207 Updated Mar 17, 2026

Use late-interaction multi-modal models such as ColPali in just a few lines of code.

Python 845 93 Updated Jan 28, 2025

Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜

Jupyter Notebook 1,907 146 Updated Jan 9, 2026

Training code for Baby-Llama, our submission to the strict-small track of the BabyLM challenge.

Python 85 13 Updated Oct 18, 2023

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

Python 2,667 221 Updated Apr 6, 2026

Fully open reproduction of DeepSeek-R1

Python 25,971 2,411 Updated Apr 2, 2026

Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.

Python 1,270 82 Updated Apr 7, 2026

Recipes to scale inference-time compute of open models

Python 1,131 130 Updated Apr 2, 2026

Python toolbox for sampling Determinantal Point Processes

Python 238 57 Updated Aug 14, 2024

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,826 469 Updated Oct 14, 2025

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Python 755 51 Updated Sep 27, 2024

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 4,016 674 Updated Apr 9, 2026
Python 13 Updated Sep 27, 2022

Everything about the SmolLM and SmolVLM family of models

Python 3,702 285 Updated Apr 2, 2026

A family of lightweight multimodal models.

Python 1,053 76 Updated Nov 18, 2024

Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and CPU, with comprehensive runtime coverage for PC (Python/C++), mobile (Android & iOS), and Linux/IoT (Arm64 & x86 Docker). Su…

Kotlin 7,922 978 Updated Feb 26, 2026

A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.

Python 371 42 Updated Feb 28, 2026

A Framework of Small-scale Large Multimodal Models

Python 975 99 Updated Mar 29, 2026

An open-source implementation for training LLaVA-NeXT.

Python 436 23 Updated Oct 23, 2024

SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.

Python 151 16 Updated Oct 22, 2022

Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719

Python 22 1 Updated Jun 5, 2024

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 4,086 586 Updated Apr 24, 2024
2 Updated Dec 2, 2025

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,877 69 Updated Jun 22, 2025
Next