Highlights
- Pro
Stars
[ICML 2026] Official Implementation of "See What Matters: Differentiable Grid Sample Pruning for Generalizable Vision-Language-Action Model"
[DEIMv2] Real Time Object Detection Meets DINOv3
SGLang is a high-performance serving framework for large language models and multimodal models.
[CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos
OmX - Oh My codeX: Your codex is not alone. Add hooks, agent teams, HUDs, and so much more.
Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞
[ICLR 2026] Glance and Focus Reinforcement for Pan-cancer Screening
A Transparent Generalist Model towards Holistic Medical Vision-Language Understanding
MedEvalKit: A Unified Medical Evaluation Framework
【ICML 2025 Spotlight】 Official Repo for Paper ‘’HealthGPT : A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation‘’
🧠「大模型」2小时完全从0训练64M的小参数LLM!Train a 64M-parameter LLM from scratch in just 2h!
Apache ECharts is a powerful, interactive charting and data visualization library for browser
[COLING22] An End-to-End Library for Evaluating Natural Language Generation
Code for "DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation" (ACL 2024)
Evaluation code for various unsupervised automated metrics for Natural Language Generation.
[EMNLP 2024] RaTEScore: A Metric for Radiology Report Generation
Python Implementation of Apriori Algorithm for finding Frequent sets and Association Rules
Reference PyTorch implementation and models for DINOv3
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
CoPESD: A Multi-Level Surgical Motion Dataset for Training Large Vision-Language Models to Co-Pilot Endoscopic Submucosal Dissection
A powerful tool for creating datasets for LLM fine-tuning 、RAG and Eval
(AAAI-2025 oral) LLM-RG4: Flexible and Factual Radiology Report Generation across Diverse Input Contexts
Official Code for "Surgformer++: Revisiting and Enhancing End-to-End Spatiotemporal Modeling for Surgical Phase Recognition"
A curated list of awesome neuroscience libraries, software and any content related to the domain.
TradingAgents: Multi-Agents LLM Financial Trading Framework