Stars
Python tool for translating subtitles using Google Gemini AI
Cloud-native, data onboarding architecture for Google Cloud Datasets
An autonomous agent that conducts deep research on any data using any LLM providers
A list of AI analytics tools (assistants, chat with data, text-to-sql, benchmarks, etc.)
[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data generation pipeline!
General technology for enabling AI capabilities w/ LLMs and MLLMs
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
This project uses the open-source model Mistral Small, deployed in Amazon SageMaker or invoked via API on Amazon Bedrock, to enable users to chat with their database using natural language, without…
A framework for few-shot evaluation of language models.
LLM Workshop by Sourab Mangrulkar
Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.
Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.
💭 Aspect-Based-Sentiment-Analysis: Transformer & Explainable ML (TensorFlow)
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca
This repository is for active development of the Azure SDK for Python. For consumers of the SDK we recommend visiting our public developer docs at https://learn.microsoft.com/python/azure/ or our v…
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
The RedPajama-Data repository contains code for preparing large datasets for training large language models.
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
A sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure AI Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.
Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation and works with PyTorch.
Best Practices on Recommendation Systems
A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch
💫 Industrial-strength Natural Language Processing (NLP) in Python
Techniques for deep learning with satellite & aerial imagery