Modelpool vlm detection optimization #61

sxy-trans-n · 2025-07-24T05:57:22Z

Performance & Bug Fix: ModelPool Optimization and VLM Detection Improvements

Changes

Performance: Unified VLM detection logic to eliminate redundant getContainer() calls
Bug Fix: Added Gemma DWQ special handling - quantized models lose vision capabilities and should be treated as LLM
Feature: Added extraEOSTokens support for QAT tokenization bugs (e.g., <end_of_turn> in Gemma models)
Feature: Add Gemma 3n - Text Only (LM) support
Optimization: Cached model type detection to avoid repeated registry lookups

Testing

✅ All 44 unit tests pass
✅ Release build completes successfully (107.26s)
✅ CI validation completed

Impact

Faster model loading through reduced redundant operations
Correct model type detection for quantized variants
Better tokenization handling for models with EOS token bugs

syh-trans-n

👏

sxy-trans-n added 2 commits July 24, 2025 14:37

update dep

458050e

feat: optimize ModelPool with VLM detection improvements

0999d41

syh-trans-n approved these changes Jul 24, 2025

View reviewed changes

sxy-trans-n merged commit 2b76c3a into main Jul 24, 2025
2 checks passed

sxy-trans-n deleted the modelpool-vlm-detection-optimization branch July 24, 2025 06:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Modelpool vlm detection optimization #61

Modelpool vlm detection optimization #61

Uh oh!

sxy-trans-n commented Jul 24, 2025

Uh oh!

syh-trans-n left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Modelpool vlm detection optimization #61

Modelpool vlm detection optimization #61

Uh oh!

Conversation

sxy-trans-n commented Jul 24, 2025

Performance & Bug Fix: ModelPool Optimization and VLM Detection Improvements

Changes

Testing

Impact

Uh oh!

syh-trans-n left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants