-
xAI
- Palo Alto, CA
- http://ronghanghu.com/
- https://orcid.org/0000-0002-5060-9485
- @RonghangHu
- in/ronghanghu
Stars
An opinionated list of Python frameworks, libraries, tools, and resources.
An Open Source Machine Learning Framework for Everyone
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Microsoft PowerToys is a collection of utilities that supercharge productivity and customization on Windows
Models and examples built with TensorFlow
Protocol Buffers - Google's data interchange format
Making large AI models cheaper, faster and more accessible
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Library for fast text representation and classification.
A curated list of awesome computer vision resources
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
verl: Volcano Engine Reinforcement Learning for LLMs
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Ongoing research training transformer models at scale
End-to-End Object Detection with Transformers
An open source implementation of CLIP.
A flexible tool for creating, organizing, and sharing visualizations of live, rich data. Supports Torch and Numpy
A PyTorch implementation of the Transformer model in "Attention is All You Need".
Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.
The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO
Unofficial Python API and agentic skill for Google NotebookLM. Full programmatic access to NotebookLM's features—including capabilities the web UI doesn't expose—via Python, CLI, and AI agents like…
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings
Reading list for research topics in multimodal machine learning