ericlewis

Eric Lewis ericlewis

Making my way downtown walking fast and I’m homebound

442 followers · 23 following

Humane
Atlanta, Georgia
14:15 (UTC -05:00)
@ericlewisplease

Sponsoring

Achievements

x2 x4

Achievements

x2 x4

Highlights

Lists (2)

Sort

oooooo

1 repository

🚀 SeedFi stack

Open source libraries utilized by the SwiftUI-based SeedFi app.

21 repositories

Stars

shufangxun / LLaVA-MoD

[ICLR 2025] LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation

Python 220 16 Updated Mar 31, 2025

Wi11p7 / DeepOCR-VT

DeepSeek-OCR as Vision Tower

Python 1 Updated Nov 21, 2025

tinygrad / tinygrad

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 31,373 3,898 Updated Feb 16, 2026

arctanxarc / UniCTokens

A framework for unified personalized model, achieving mutual enhancement between personalized understanding and generation. Demonstrating the potential of cross-task information transfer in persona…

Python 128 3 Updated Dec 25, 2025

anilkeshwani / speech-integration

Research implementation to investigate methods of integrating the speech modality into pre-trained language models

Python 1 Updated Dec 26, 2025

gpt-omni / mini-omni2

Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。

Python 1,867 205 Updated Jan 16, 2025

open-compass / VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,818 632 Updated Feb 13, 2026

TRI-ML / prismatic-vlms

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Python 927 931 Updated Jul 4, 2024

daixiangzi / Awesome-Token-Compress

A paper list of some recent works about Token Compress for Vit and VLM

831 39 Updated Feb 10, 2026

google-research / big_vision

Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.

Jupyter Notebook 3,360 211 Updated May 19, 2025

pliang279 / awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

6,816 896 Updated Aug 20, 2024

NVIDIA-NeMo / Automodel

Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support

Python 305 63 Updated Feb 16, 2026

eren23 / blipren_release

BLIP-2 implementation for training vision-language models. Q-Former + frozen encoders + any LLM. Colab-ready notebooks with MoE variant.

Jupyter Notebook 3 Updated Dec 19, 2025

mattt / AnyLanguageModel

An API-compatible, drop-in replacement for Apple's Foundation Models framework with support for custom language model providers.

Swift 771 56 Updated Feb 11, 2026

atfortes / Awesome-LLM-Reasoning

From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓

3,538 198 Updated May 7, 2025

TinyLLaVA / TinyLLaVA_Factory

A Framework of Small-scale Large Multimodal Models

Python 962 96 Updated Feb 7, 2026

kshitijgetsac / Comparative-Analysis-of-Vision-Language-Connectors

Trying to study the effect of different connectors , (linear, MLP and Cross Attention) to analyze what paradigms do LLM'S use or make a best guess

3 Updated Nov 26, 2025

zhjohnchan / awesome-vision-and-language-pretraining

A curated list of vision-and-language pre-training (VLP). :-)

62 7 Updated Jul 6, 2022

EvolvingLMMs-Lab / OneVision-Encoder

Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence

Python 225 4 Updated Feb 13, 2026

EvolvingLMMs-Lab / multimodal-search-r1

MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.

Python 394 21 Updated Aug 26, 2025

danieleschmidt / fast-vlm-ondevice-kit

Turn Apple's CVPR-25 FastVLM encoder into a reproducible baseline for mobile apps. First complete implementation achieving <250ms multimodal inference on iPhone.

Python 11 2 Updated Oct 27, 2025

CircleRadon / TokenPacker

The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM", IJCV2025

Python 276 9 Updated May 26, 2025

LLaVA-VL / LLaVA-NeXT

Python 4,563 445 Updated Sep 14, 2025

EvolvingLMMs-Lab / LLaVA-OneVision-1.5-RL

Fully Open Framework for Democratized Multimodal Reinforcement Learning.

Python 40 3 Updated Dec 19, 2025

gokayfem / awesome-vlm-architectures

Famous Vision Language Models and Their Architectures

Markdown 1,178 54 Updated Jan 11, 2026

NexaAI / nexa-sdk

Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and CPU, with comprehensive runtime coverage for PC (Python/C++), mobile (Android & iOS), and Linux/IoT (Arm64 & x86 Docker). Su…

Kotlin 7,711 948 Updated Feb 12, 2026

Paranioar / Awesome_Matching_Pretraining_Transfering

The Paper List of Large Multi-Modality Model (Perception, Generation, Unification), Parameter-Efficient Finetuning, Vision-Language Pretraining, Conventional Image-Text Matching for Preliminary Ins…

443 49 Updated Sep 25, 2025

EvolvingLMMs-Lab / lmms-engine

A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.

Python 725 32 Updated Feb 12, 2026

xxnuo / open-coreui

Open CoreUI - A rewritten Open WebUI in Rust, significantly reducing memory and resource usage, requiring no dependency services, no Docker, with both a server version and a Tauri-based desktop cli…

Svelte 1,493 105 Updated Dec 28, 2025

kshetrajna12 / sparkstation

Unified LLM orchestration and gateway service for DGX Spark — dynamically manages vLLM, SGLang, and TensorRT-LLM backends under a single OpenAI-compatible API.

Python 3 1 Updated Feb 13, 2026