-
tada Public
Forked from HumeAI/tadaOpen Source Speech Language Model
Jupyter Notebook MIT License UpdatedMar 10, 2026 -
autoresearch Public
Forked from karpathy/autoresearchAI agents running research on single-GPU nanochat training automatically
Python UpdatedMar 9, 2026 -
pocket-tts Public
Forked from kyutai-labs/pocket-ttsA TTS that fits in your CPU (and pocket)
Python MIT License UpdatedJan 29, 2026 -
kubernetes-with-python Public
Forked from Raihan-009/kubernetes-with-python -
Qwen3-TTS Public
Forked from QwenLM/Qwen3-TTSQwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice…
Python Apache License 2.0 UpdatedJan 24, 2026 -
TTS-Training-Blueprint Public
Intuitive understanding of Autoregressive TTS Models
-
This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming architecture for fluid conversations with immediate responses and…
-
supertonic Public
Forked from supertone-inc/supertonicLightning-fast, on-device TTS — running natively via ONNX.
Swift MIT License UpdatedNov 19, 2025 -
awesome-llm-speech-to-speech Public
Forked from tleyden/awesome-llm-speech-to-speechAwesome LLM speech-to-speech models and frameworks
Apache License 2.0 UpdatedNov 18, 2025 -
reader3 Public
Forked from karpathy/reader3Quick illustration of how one can easily read books together with LLMs. It's great and I highly recommend it.
Python UpdatedNov 18, 2025 -
train-higgs-audio Public
Forked from JimmyMa99/train-higgs-audioText-audio foundation model from Boson AI
-
unmute Public
Forked from kyutai-labs/unmuteMake text LLMs listen and speak
Python MIT License UpdatedJul 24, 2025 -
-
Orpheus-TTS-Local Public
Run Orpheus TTS locally.
-
Training-TTS Public
Train and finutune text-to-speech models for Bengali and many other languages!
-
csm Public
Forked from SesameAILabs/csmA Conversational Speech Generation Model
Python Apache License 2.0 UpdatedMar 14, 2025 -
Ollm-Bridge Public
Forked from Les-El/Ollm-BridgeEasily access your Ollama models within LMStudio
-
This is a web application that lets you search for images using natural language, powered by modern Vision Transformer (ViT) models. It works by processing and indexing the images you upload, then …
-
Bangla-Llama Public
Fine tuned llama 3 models for context based question answering in bengali language.
-
bangla-pdf-ocr Public
Bangla PDF to text converter that works on Windows, macOS, and Linux without any extra downloads or configurations.
-
-
API-Demo Public
This project is an API Key Manager application developed using Node.js, Express, and MongoDB. I created this demo project to practice API creation, handling, and Docker management as part of my web…
-
ChameleonAI Public
Introducing ChameleonAI, your very own, customizable roleplaying AI chatbot. Powered by Google's state-of-the-art PALM Generative AI Model, you get to talk to anything, anyone, anywhere!
-
PoRAG Public
Forked from Bangla-RAG/PoRAGFully Configurable RAG Pipeline for Bengali Language RAG Applications. Supports both Local and Huggingface Models, Built with Langchain.
-
Fine-tune mBart 50 for Bengali Sentence Error Correction
-
Train a mBart with your data.
Jupyter Notebook Apache License 2.0 UpdatedMay 6, 2024 -
Scrape-Any-Sites Public
Scrape any websties with a single script all you need is the domain name!
-
Text Generation with TensorFlow from Scratch
-
Tired of struggling to find answers in your PDFs? Talk to Doc is here to help! Ask it questions, and it will use its advanced capabilities to extract the information you need quickly and easily.
-
Experiments-with-Gemma-2B Public
I’ll be testing different Gemma models and sharing the results here and on my Hugging Face space. Stay tuned for updates!