Stars
Slowdown prediction module of Echo: Simulating Distributed Training at Scale
OpenDiLoCo: An Open-Source Framework for Globally Distributed Low-Communication Training
My paper/code reading notes in Chinese
This is a workshop designed for Amazon Bedrock a foundational model service.
Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
RAG with langchain using Amazon Bedrock and Amazon OpenSearch
Contriever: Unsupervised Dense Information Retrieval with Contrastive Learning
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
A collection of AWESOME things about mixture-of-experts
alibaba / Megatron-LLaMA
Forked from NVIDIA/Megatron-LMBest practice for training LLaMA models in Megatron-LM
An open-source framework for training large multimodal models.
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
An open-source tool-augmented conversational language model from Fudan University
This is a repository for the paper on testing inductive bias with scaled-down RoBERTa models.
Examples and guides for using the OpenAI API
An unofficial Python wrapper for OpenAI's ChatGPT API
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
This is the repository for the resources in TACL 2022 Paper "Ultra-fine Entity Typing with Indirect Supervision from Natural Language Inference".
This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”
Natural Perturbation for Robust Question Answering
A collection of 1000+ survey papers on Natural Language Processing (NLP) and Machine Learning (ML).
An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)
Repository for the Dynamically Generated Hate Speech Dataset by Vidgen et al. (2021).