Stars
The Quranic Arabic Corpus, an invaluable linguistic resource, is due for a revamp. We're calling on Linguistics, AI, and Tech volunteers to join us in this exciting journey. 🚀
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
aspctu / alpaca-lora
Forked from tloen/alpaca-loraInstruct-tuning LLaMA on consumer hardware
Extends scikit-learn with new models, transformers, metrics, plotting.
Scalable training for dense retrieval models.
Models supported: ResNet, ResNetV2, SE-ResNet, ResNeXt, SE-ResNeXt [layers: 18, 34, 50, 101, 152] (1D and 2D versions with DEMO for Classification and Regression).
PyTorch code for ETSformer: Exponential Smoothing Transformers for Time-series Forecasting
An open collection of implementation tips, tricks and resources for training large language models
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch
A collection of libraries to optimise AI model performances
A tool to help researchers and product teams understand datasets with the goal of improving data quality, and mitigating fairness and bias issues.
A playbook for systematically maximizing the performance of deep learning models.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
fastdup is a powerful, free tool designed to rapidly generate valuable insights from image and video datasets. It helps enhance the quality of both images and labels, while significantly reducing d…
A curated list of modern Generative Artificial Intelligence projects and services
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
A semantic segmentation pipeline for custom image annotation
Stable Diffusion in TensorFlow / Keras
Repository containing notebooks of my posts on Medium
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Scalable and user friendly neural 🧠 forecasting algorithms.
Lightning ⚡️ fast forecasting with statistical and econometric models.
Metric learning and retrieval pipelines, models and zoo.