Starred repositories
12 Lessons to Get Started Building AI Agents
Integrate cutting-edge LLM technology quickly and easily into your apps
Worlds first open-source real-time end-to-end spoken dialogue model with personalized voice cloning.
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Systematic Survey".
Generate audiobooks from e-books, voice cloning & 1158+ languages!
Pytorch implementation of the Variational Recurrent Neural Network (VRNN).
Code and documentation to train Stanford's Alpaca models, and generate the data.
Large Language Model Text Generation Inference
The official Python library for the OpenAI API
Convert PDF to HTML without losing text or format.
pdf-translator translates English PDF files into Japanese, preserving the original layout.
Community maintained fork of pdfminer - we fathom PDF
Convert PDF to HTML without losing text or format.
Python module for NAVER English-Korean and Korean-English dictionaries
👦 👧 Technical-Interview guidelines written for those who started studying programming. I wish you all the best. 👾
A quick guide (especially) for trending instruction finetuning datasets
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Awesome-LLM: a curated list of Large Language Model
📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
A python package to build AI-powered real-time audio applications
Real time transcription with OpenAI Whisper.