Stars
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.
Making large AI models cheaper, faster and more accessible
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Janus-Series: Unified Multimodal Understanding and Generation Models
End-to-End Object Detection with Transformers
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Fully autonomous & self-evolving research from idea to paper. Chat an Idea. Get a Paper. 🦞
a state-of-the-art-level open visual language model | 多模态预训练模型
PyTorch implementations of deep reinforcement learning algorithms and environments
3D ResNets for Action Recognition (CVPR 2018)
A collection of loss functions for medical image segmentation
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…
Pytorch framework for doing deep learning on point clouds.
Unsupervised Learning for Image Registration
Tool for robust segmentation of >100 important anatomical structures in CT and MR images
A PyTorch Library for Multi-Task Learning
Many studies have shown that the performance on deep learning is significantly affected by volume of training data. The MedicalNet project provides a series of 3D-ResNet pre-trained models and rela…
Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.
PyTorch implementation of Contrastive Learning methods
The largest open-source medical AI skills library for OpenClaw🦞.
PyTorch implementation of MoCo v3 https//arxiv.org/abs/2104.02057
Implementations of recent research prototypes/demonstrations using MONAI.