Lists (1)
Sort Name ascending (A-Z)
Stars
- All languages
- ActionScript
- ApacheConf
- Batchfile
- Blade
- C
- C#
- C++
- CSS
- Dart
- Dockerfile
- Erlang
- Gherkin
- Go
- Groovy
- HTML
- Java
- JavaScript
- Jupyter Notebook
- Kotlin
- LLVM
- LilyPond
- Lua
- MDX
- Makefile
- Objective-C
- Objective-C++
- PHP
- PLpgSQL
- PowerShell
- Python
- QML
- Rich Text Format
- Ruby
- Rust
- Scala
- Shell
- Svelte
- Swift
- TLA
- TypeScript
- Vim Script
- Vue
- XSLT
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
Fast and accurate AI powered file content types detection
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
A configurable set of panels that display various debug information about the current request/response.
KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge ba…
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
The Open edX LMS & Studio, powering education sites around the world!
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
Accessible large language models via k-bit quantization for PyTorch.
A fast PostgreSQL Database Client Library for Python/asyncio.
Text-audio foundation model from Boson AI
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Very efficient backup system based on the git packfile format, providing fast incremental saves and global deduplication (among and within files, including virtual machine images). Please post prob…
Multilingual Voice Understanding Model
Community maintained fork of pdfminer - we fathom PDF
No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents
All-in-One Development Tool based on PaddlePaddle
Extract Keywords from sentence or Replace keywords in sentences.
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection