Stars
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
ConceptAttention: A method for interpreting multi-modal diffusion transformers.
Learn how to design, develop, deploy and iterate on production-grade ML applications.
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
Schedule-Free Optimization in PyTorch
This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]
openvla / openvla
Forked from TRI-ML/prismatic-vlmsOpenVLA: An open-source vision-language-action model for robotic manipulation.
fastdup is a powerful, free tool designed to rapidly generate valuable insights from image and video datasets. It helps enhance the quality of both images and labels, while significantly reducing d…
The merlin dataloader lets you rapidly load tabular data for training deep leaning models with TensorFlow, PyTorch or JAX
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
The official PyTorch implementation of the paper "Human Motion Diffusion Model"
[TPAMI 2019] The implementation for "Direction Concentration Learning: Enhancing Congruency in Machine Learning"
Official implementation of "DCT-Net: Domain-Calibrated Translation for Portrait Stylization", SIGGRAPH 2022 (TOG); Multi-style cartoonization
mkdocs + material + cool stuff
⚡ A newly designed ultra lightweight anchor free target detection algorithm, weight only 250K parameters, reduces the time consumption by 10% compared with yolo-fastest, and the post-processing is …
This is the repo for our new project Highly Accurate Dichotomous Image Segmentation
Project page of the paper "Learning Multi-Scale Photo Exposure Correction" (CVPR 2021).
geomagical / lama-with-refiner
Forked from advimman/lama🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
Python implementation of "A New Image Contrast Enhancement Algorithm Using Exposure Fusion Framework", CAIP2017
The official repository for "Intermediate Layers Matter in Momentum Contrastive Self Supervised Learning" paper.
Build scripts and configuration for building CPython for Emscripten
Prototypical Cross-Attention Networks for Multiple Object Tracking and Segmentation, NeurIPS 2021 Spotlight
🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022
Template repository to build PyTorch projects from source on any version of PyTorch/CUDA/cuDNN.
A curated list of awesome data labeling tools