Starred repositories
A complete computer science study plan to become a software engineer.
All Algorithms implemented in Python
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
⚡ Dynamically generated stats for your github readmes
A latent text-to-image diffusion model
A playbook for systematically maximizing the performance of deep learning models.
A modified web browser that helps in responsive web development. A web developer's must have dev-tool.
A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
This repository contains the source code for the paper First Order Motion Model for Image Animation
📋 A list of open LLMs available for commercial use.
This repository contains demos I made with the Transformers library by HuggingFace.
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
Deep Learning Specialization by Andrew Ng on Coursera.
Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"
🔥 Stay motivated and show off your contribution streak! 🌟 Display your total contributions, current streak, and longest streak on your GitHub profile README
A scikit-learn compatible neural network library that wraps PyTorch
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
List of Research Internships for Undergraduate Students
Go to https://github.com/pytorch/tutorials - this repo is deprecated and no longer maintained
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
MapAnything: Universal Feed-Forward Metric 3D Reconstruction