Highlights
- Pro
Lists (8)
Sort Name ascending (A-Z)
Starred repositories
Python tool for converting files and office documents to Markdown.
💫 Toolkit to help you get started with Spec-Driven Development
Seamless Agent - A VSCode extension that makes IA Agente requiring user confirmation before actions are executed.
Sample code and notebooks for Generative AI on Google Cloud, with Gemini Enterprise Agent Platform
Very low latency speech to text, intent recognition, and text to speech, for building voice agents and interfaces
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
Community-contributed instructions, agents, skills, and configurations to help you make the most of GitHub Copilot.
Multi-platform SDK for integrating GitHub Copilot Agent into apps and services
📄 Awesome OCR multiple programing languages toolkits based on ONNX Runtime, OpenVINO, MNN, PaddlePaddle, TensorRT and PyTorch.
Schema definition and other documents of eSDScom (formerly SDScom and ESCom), the standard for electronic exchange of Safety Data Sheets in a structured, processible way across Europe and other reg…
AI Agent Framework, the Pydantic way
AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
A fast type checker and language server for Python
Breakthrough Method for Agile Ai Driven Development
WPF UI provides the Fluent experience in your known and loved WPF framework. Intuitive design, themes, navigation and new immersive controls. All natively and effortlessly.
AudioMog is a free all-in-one audio modding tool, that allows users to unpack, and repack supported game's audio binary files.
Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis
This is the official repository of ``Scalable Neural Vocoder from Range-Null Space Decomposition'', which is submitted to TPAMI.
Elucidated Text-To-Audio (ETTA) is a SOTA text-to-audio model with a holistic understanding of the design space and trained with synthetic captions.
pytorch model for contexless-phoneme prediction from speech audio
Extract phoneme-level timestamps from speeh audio.