giganttheo

Follow

théo gigant giganttheo

Follow

research scientist @ nous research

11 followers · 7 following

Achievements

Achievements

Highlights

Pro

Lists (1)

Sort

🔮 Future ideas

Stars

264 results for source starred repositories

KellerJordan / modded-nanogpt

NanoGPT (124M) in 3 minutes

Python 3,969 520 Updated Dec 17, 2025

genlm / llamppl

Probabilistic programming with large language models

Python 154 24 Updated Nov 18, 2025

huggingface / nanoVLM

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,414 428 Updated Oct 27, 2025

aigc3d / LHM

[ICCV2025] LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds

Python 2,507 196 Updated Jul 15, 2025

AnswerDotAI / byaldi

Use late-interaction multi-modal models such as ColPali in just a few lines of code.

Python 834 96 Updated Jan 28, 2025

merveenoyan / smol-vision

Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜

Jupyter Notebook 1,810 138 Updated Oct 27, 2025

timinar / BabyLlama

Training code for Baby-Llama, our submission to the strict-small track of the BabyLM challenge.

Python 85 13 Updated Oct 18, 2023

roboflow / maestro

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

Python 2,647 220 Updated Dec 15, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,737 2,405 Updated Nov 24, 2025

segment-any-text / wtpsplit

Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.

Python 1,210 75 Updated Dec 5, 2025

huggingface / search-and-learn

Recipes to scale inference-time compute of open models

Python 1,120 131 Updated May 22, 2025

guilgautier / DPPy

Python toolbox for sampling Determinantal Point Processes

Python 236 57 Updated Aug 14, 2024

stanford-futuredata / ColBERT

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,741 462 Updated Oct 14, 2025

jzhang38 / EasyContext

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Python 751 52 Updated Sep 27, 2024

open-compass / VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,551 591 Updated Dec 17, 2025

yumoxu / oreo

Python 13 Updated Sep 27, 2022

huggingface / smollm

Everything about the SmolLM and SmolVLM family of models

Python 3,454 243 Updated Nov 20, 2025

BAAI-DCAI / Bunny

A family of lightweight multimodal models.

Python 1,049 75 Updated Nov 18, 2024

NexaAI / nexa-sdk

Run the latest LLMs and VLMs across GPU, NPU, and CPU with PC (Python/C++) & mobile (Android & iOS) support, running quickly with OpenAI gpt-oss, Granite4, Qwen3VL, Gemma 3n and more.

Go 6,241 822 Updated Dec 19, 2025

zjysteven / lmms-finetune

A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.

Python 359 41 Updated Dec 11, 2025

TinyLLaVA / TinyLLaVA_Factory

A Framework of Small-scale Large Multimodal Models

Python 939 95 Updated Apr 26, 2025

xiaoachen98 / Open-LLaVA-NeXT

An open-source implementation for training LLaVA-NeXT.

Python 427 22 Updated Oct 23, 2024

danieldeutsch / sacrerouge

SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.

Python 148 16 Updated Oct 22, 2022

juvi21 / CoPE-cuda

Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719

Python 22 1 Updated Jun 5, 2024

deepseek-ai / DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 4,026 580 Updated Apr 24, 2024

hjzhuang / MSMO-Eval

2 Updated Dec 2, 2025

google-deepmind / penzai

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,830 68 Updated Jun 22, 2025

IDRIS-CNRS / DLO-JZ

formation Deep Learning Optimisé pour Jean Zay

HTML 17 4 Updated Oct 20, 2025

xjdr-alt / entropix

Entropy Based Sampling and Parallel CoT Decoding

Python 3,431 325 Updated Nov 13, 2024

illuin-tech / colpali

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

Python 2,392 224 Updated Dec 18, 2025