giganttheo

Follow

théo gigant giganttheo

Follow

research scientist @ nous research

12 followers · 8 following

Achievements

Achievements

Highlights

Pro

Lists (1)

Sort

🔮 Future ideas

Stars

zotero / zotero

Zotero is a free, easy-to-use tool to help you collect, organize, annotate, cite, and share your research sources.

JavaScript 13,900 1,008 Updated Apr 16, 2026

NousResearch / hermes-agent

The agent that grows with you

Python 93,542 13,028 Updated Apr 16, 2026

bloc97 / DeMo

DeMo: Decoupled Momentum Optimization

Python 199 10 Updated Dec 2, 2024

KellerJordan / modded-nanogpt

NanoGPT (124M) in 2 minutes

Python 5,100 709 Updated Apr 13, 2026

genlm / llamppl

Probabilistic programming with large language models

Python 167 28 Updated Apr 9, 2026

huggingface / nanoVLM

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,814 482 Updated Oct 27, 2025

aigc3d / LHM

[ICCV2025] LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds

Python 2,589 207 Updated Mar 17, 2026

AnswerDotAI / byaldi

Use late-interaction multi-modal models such as ColPali in just a few lines of code.

Python 846 93 Updated Jan 28, 2025

merveenoyan / smol-vision

Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜

Jupyter Notebook 1,909 146 Updated Jan 9, 2026

timinar / BabyLlama

Training code for Baby-Llama, our submission to the strict-small track of the BabyLM challenge.

Python 86 13 Updated Oct 18, 2023

roboflow / maestro

streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL

Python 2,668 221 Updated Apr 13, 2026

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,991 2,414 Updated Apr 2, 2026

segment-any-text / wtpsplit

Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.

Python 1,271 82 Updated Apr 11, 2026

huggingface / search-and-learn

Recipes to scale inference-time compute of open models

Python 1,130 130 Updated Apr 2, 2026

guilgautier / DPPy

Python toolbox for sampling Determinantal Point Processes

Python 238 57 Updated Aug 14, 2024

stanford-futuredata / ColBERT

ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)

Python 3,841 470 Updated Oct 14, 2025

jzhang38 / EasyContext

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Python 755 51 Updated Sep 27, 2024

open-compass / VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 4,049 678 Updated Apr 10, 2026

yumoxu / oreo

Python 13 Updated Sep 27, 2022

huggingface / smollm

Everything about the SmolLM and SmolVLM family of models

Python 3,712 287 Updated Apr 2, 2026

BAAI-DCAI / Bunny

A family of lightweight multimodal models.

Python 1,053 76 Updated Nov 18, 2024

NexaAI / nexa-sdk

Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and CPU, with comprehensive runtime coverage for PC (Python/C++), mobile (Android & iOS), and Linux/IoT (Arm64 & x86 Docker). Su…

Kotlin 7,950 987 Updated Apr 14, 2026

zjysteven / lmms-finetune

A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.

Python 372 42 Updated Feb 28, 2026

TinyLLaVA / TinyLLaVA_Factory

A Framework of Small-scale Large Multimodal Models

Python 976 99 Updated Apr 16, 2026

xiaoachen98 / Open-LLaVA-NeXT

An open-source implementation for training LLaVA-NeXT.

Python 437 23 Updated Oct 23, 2024

danieldeutsch / sacrerouge

SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.

Python 151 16 Updated Oct 22, 2022

juvi21 / CoPE-cuda

Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719

Python 22 1 Updated Jun 5, 2024

deepseek-ai / DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 4,088 585 Updated Apr 24, 2024

hjzhuang / MSMO-Eval

2 Updated Dec 2, 2025

google-deepmind / penzai

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,879 70 Updated Jun 22, 2025