-
Carnegie Mellon University
- Pittsburgh, PA, USA
Stars
HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Awesome Search - this is all about the (e-commerce, but not only) search and its awesomeness
Full text search that feels like a numpy array
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Generative Representational Instruction Tuning
Build and share delightful machine learning apps, all in Python. π Star to support our work!
A Pythonic framework to simplify AI service building
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
An industrial deep learning framework for high-dimension sparse data
A high performance and generic framework for distributed DNN training
Navigating Spreading-out Graph For Approximate Nearest Neighbor Search
2020 MIND news recomendation first place solution
Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Alphabetical list of free/public domain datasets with text data for use in Natural Language Processing (NLP)
The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.
Distributed Peer-to-Peer Web Search Engine and Intranet Search Appliance
Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index (DenSPI)
TextAttack π is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/
A comprehensive list of awesome contrastive self-supervised learning papers.
Beyond Accuracy: Behavioral Testing of NLP models with CheckList
Label Studio is a multi-type data labeling and annotation tool with standardized output format
π§βπ« 60+ Implementations/tutorials of deep learning papers with side-by-side notes π; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gaβ¦
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.
Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"