-
FAIR @ Meta
- Paris
- angelvillarcorrales.com
- @angelvillar96
Highlights
- Pro
Stars
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials,…
Python sample codes and textbook for robotics algorithms.
Open-Sora: Democratizing Efficient Video Production for All
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
Deezer source separation library including pretrained models.
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Lets make video diffusion practical!
End-to-End Object Detection with Transformers
Convert Machine Learning Code Between Frameworks
An open source implementation of CLIP.
Minimal reproduction of DeepSeek R1-Zero
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Code for the paper "Jukebox: A Generative Model for Music"
A faster pytorch implementation of faster r-cnn
A Collection of Variational Autoencoders (VAE) in PyTorch.
OpenMMLab Pose Estimation Toolbox and Benchmark.
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
Benchmarks of approximate nearest neighbor libraries in Python
openvla / openvla
Forked from TRI-ML/prismatic-vlmsOpenVLA: An open-source vision-language-action model for robotic manipulation.