Skip to content
View travisddavies's full-sized avatar
🎯
修行
🎯
修行
  • Brisbane, Australia

Block or report travisddavies

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
64 stars written in Python
Clear filter

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 73,221 10,044 Updated Mar 26, 2026

Inference code for Llama models

Python 59,274 9,827 Updated Jan 26, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 33,185 6,877 Updated Mar 28, 2026

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 32,201 6,671 Updated Sep 30, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 17,000 3,392 Updated Mar 27, 2026

A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations

Python 16,850 1,243 Updated Mar 27, 2026

StyleGAN - Official TensorFlow Implementation

Python 14,413 3,157 Updated Apr 10, 2024

Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.

Python 12,716 1,701 Updated Apr 7, 2025

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 12,567 1,265 Updated Nov 4, 2025

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,396 1,025 Updated Jul 1, 2024

PyTorch3D is FAIR's library of reusable components for deep learning with 3D data

Python 9,837 1,449 Updated Mar 18, 2026

Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image …

Python 9,319 1,710 Updated Feb 5, 2026

Fast Segment Anything

Python 8,296 754 Updated Jul 30, 2024

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 8,254 1,343 Updated Jul 23, 2024

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 7,805 798 Updated Mar 24, 2026

Google AI 2018 BERT pytorch implementation

Python 6,521 1,326 Updated Sep 15, 2023

High-resolution models for human tasks.

Python 5,306 314 Updated Nov 18, 2024

A PyTorch native platform for training generative AI models

Python 5,190 763 Updated Mar 28, 2026

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 5,033 338 Updated Mar 17, 2026

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 4,276 321 Updated Jan 5, 2026

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 3,647 371 Updated Feb 27, 2025

PyTorch code and models for VJEPA2 self-supervised learning from video.

Python 3,458 397 Updated Mar 23, 2026

Efficient vision foundation models for high-resolution generation and perception.

Python 3,273 235 Updated Sep 5, 2025

[IROS 2025 Best Paper Award Finalist & IEEE TRO 2026] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Python 2,844 198 Updated Dec 16, 2025

Muon is an optimizer for hidden layers in neural networks

Python 2,442 110 Updated Jan 19, 2026

CVNets: A library for training computer vision networks

Python 1,968 252 Updated Oct 30, 2023

RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation

Python 1,657 154 Updated Jan 21, 2026

Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.

Python 1,589 261 Updated Jul 31, 2024

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 1,327 71 Updated Jan 27, 2026
Next