Highlights
- Pro
Stars
S-Chain: Structured Visual Chain-of-Thought For Medicine
A collection of vision-language-action model post-training methods.
[NeurIPS 2025] ExGra-Med: Medical Multi-Modal LLM with Extended Context Alignment
ViDRiP-LLaVA: A Dataset and Benchmark for Diagnostic Reasoning from Pathology Videos
A collection of sample agents built with Agent Development Kit (ADK)
🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"
Dreambooth (LoRA) with well-organized code structure. Naive adaptation from 🤗Diffusers.
Awesome papers & datasets specifically focused on long-term videos.
Instruction tuning dataset generation inspired by LLaVA-Instruct-158k via any LLM, also for commercial use.
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…
Code associated to the publication: Scaling self-supervised learning for histopathology with masked image modeling, A. Filiot et al., MedRxiv (2023). We publicly release Phikon 🚀
Official Repository of NeurIPS 2023 - MedFM Challenge
[NeurIPS 2023 Oral] Quilt-1M: One Million Image-Text Pairs for Histopathology.
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
This Repository includes DGL tutorials and various information related to graph neural networks.
KBSMC colon cancer grading dataset repository
ECCV-MCV2022 paper: "IMPaSh: A Novel Domain-shift Resistant Representation for Colorectal Cancer Tissue Classification"
A python library for self-supervised learning on images.
Paper bank for Self-Supervised Learning
PyTorch implementation of MoCo v3 https//arxiv.org/abs/2104.02057
Creating bot to play Chrome Dinosaur game, with GA and hardcode version
📺 Discover the latest machine learning / AI courses on YouTube.
Object Detection Metrics. 14 object detection metrics: mean Average Precision (mAP), Average Recall (AR), Spatio-Temporal Tube Average Precision (STT-AP). This project supports different bounding b…
Collection of leetcode company tag problems. Periodically updating.
NLP 101: a resource repository for Deep Learning and Natural Language Processing
Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow
(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"