Skip to content
View ice6's full-sized avatar

Block or report ice6

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
156 stars written in Python
Clear filter

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 9,471 732 Updated Sep 22, 2025

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Python 9,444 1,325 Updated Apr 24, 2024

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Python 9,424 1,593 Updated Aug 9, 2024

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

Python 9,106 825 Updated Nov 8, 2025

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 9,088 827 Updated Nov 3, 2025

Ollama Python library

Python 8,853 853 Updated Nov 13, 2025

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

Python 8,736 760 Updated Nov 13, 2025

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…

Python 8,452 1,385 Updated Oct 14, 2025

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 8,371 735 Updated Aug 13, 2024

The awesome document factory

Python 8,345 766 Updated Nov 12, 2025

A configurable set of panels that display various debug information about the current request/response.

Python 8,307 1,069 Updated Nov 11, 2025

KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge ba…

Python 8,204 630 Updated Sep 22, 2025

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 8,004 697 Updated Feb 10, 2025

The Open edX LMS & Studio, powering education sites around the world!

Python 7,914 4,179 Updated Nov 13, 2025

MySQL client library for Python

Python 7,836 1,444 Updated Aug 24, 2025

Semantic cache for LLMs. Fully integrated with LangChain and llama_index.

Python 7,827 566 Updated Jul 11, 2025

Accessible large language models via k-bit quantization for PyTorch.

Python 7,742 793 Updated Nov 13, 2025

A fast PostgreSQL Database Client Library for Python/asyncio.

Python 7,679 434 Updated Oct 21, 2025

Text-audio foundation model from Boson AI

Python 7,621 565 Updated Sep 15, 2025

用文本编辑器剪视频

Python 7,462 773 Updated Oct 5, 2024

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 7,343 666 Updated Nov 10, 2025

Very efficient backup system based on the git packfile format, providing fast incremental saves and global deduplication (among and within files, including virtual machine images). Please post prob…

Python 7,272 424 Updated Aug 30, 2025

Multilingual Voice Understanding Model

Python 6,938 646 Updated Aug 15, 2025

Community maintained fork of pdfminer - we fathom PDF

Python 6,781 1,014 Updated Nov 7, 2025
Python 6,727 1,136 Updated Nov 3, 2025

No-code LLM Platform to launch APIs and ETL Pipelines to structure unstructured documents

Python 5,936 565 Updated Nov 13, 2025

All-in-One Development Tool based on PaddlePaddle

Python 5,886 1,110 Updated Nov 13, 2025

Extract Keywords from sentence or Replace keywords in sentences.

Python 5,683 605 Updated Apr 13, 2025

TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...

Python 4,499 1,590 Updated Oct 29, 2025

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

Python 4,415 396 Updated Nov 13, 2025