Stars
Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Tools for merging pretrained large language models.
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
The simplest, fastest repository for training/finetuning small-sized VLMs.
A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 37.3% tasks (pass@1) in SWE-bench lite and 46.2% tasks (pass@1) in SWE-bench verified with…
Image to prompt with BLIP and CLIP
[CVPR 2025 Highlight] Real-time dense scene reconstruction with SLAM3R
LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/l…
An easy-to-use Python framework to generate adversarial jailbreak prompts.
PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Language Models [NeurIPS 2024 Datasets and Benchmarks Track]
StableDelight: Revealing Hidden Textures by Removing Specular Reflections
Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024
Official code of DynOMo: Online Point Tracking by Dynamic Online Monocular Gaussian Reconstruction (3DV 2025))
实现暗通道去雾算法 Realizing 'Single Image Haze Removal Using Dark Channel Prior'
This repository contains the architectures, Models, logs, etc pertaining to the SimpleNet Paper (Lets keep it simple: Using simple architectures to outperform deeper architectures )
This repo is for Amazon ML Challenge 2024. The challenge was to develop a Machine Learning model to extract product details directly from the product images.
Fine tune Gemma 3 on an object detection task