-
Computer Science Ph.D. - Artificial Intelligence
Highlights
- Pro
Lists (2)
Sort Name ascending (A-Z)
Stars
[Support 0.49.x](Reset Cursor AI MachineID & Bypass Higher Token Limit) Cursor Ai ,自动重置机器ID , 免费升级使用Pro功能: You've reached your trial request limit. / Too many free trial accounts used on this machi…
解决Cursor在免费订阅期间出现以下提示的问题: Your request has been blocked as our system has detected suspicious activity / You've reached your trial request limit. / Too many free trial accounts used on this machine.
SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
Repository for training models for music source separation.
Get started with building Fullstack Agents using Gemini 2.5 and LangGraph
Serve Ollama LLMs on Google Colab (free plan) using Ngrok
wip - running some training with overfitting - https://wandb.ai/snoozie/vasa-overfitting
Open-source Windows and Office activator featuring HWID, Ohook, TSforge, KMS38, and Online KMS activation methods, along with advanced troubleshooting.
This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.
This tool uses AI to evaluate your pronunciation.
MARS5 speech model (TTS) from CAMB.AI
Generate music based on natural language prompts using LLMs running locally
A curated list of awesome voice conversion, projects and communities.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Code for the InterSpeech 2023 paper: MMER: Multimodal Multi-task learning for Speech Emotion Recognition
Fine-tuning wav2vec2 to for Pathological Speech Processing
HashLips Art Engine is a tool used to create multiple different instances of artworks based on provided layers.
A one-of-a-kind resume builder that keeps your privacy in mind. Completely secure, customizable, portable, open-source and free forever. Try it out today!
Custom Kaldi recipes for DNN feature extraction on public and non-public audio corpora. Medical speech and computational paralinguistics related.
This project attempts to provide the functionality offered by various libraries used for speech and audio processing in a 'all-in-one' fashion.