A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
-
Updated
Nov 27, 2025 - Python
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
将 GeminiCLI 和 antigravity 转换为 OpenAI 和 GEMINI API 接口
Co-create PowerPoint slide decks with AI
DarkGPT Chat Explorer is an interactive web application that allows users to engage in conversations with various GPT (Generative Pre-trained Transformer) models in real-time. This repository contains the source code for the application.
Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, pymupdf, pdfplumber etc for efficient snapshot, text, table, and metadata extraction.
Bot on aiogram powered by AI
A completely free python discord chat bot which uses Google's Gemini API.
Curated collection of AI agents built with Google’s Agent Development Kit (ADK): templates, best practices, and production-ready examples for research, business, automation, education, and more.
Google Gemini Vision Web application with Speech and Text
AI-powered YouTube Shorts automation suite that handles content discovery, downloading, metadata optimization, uploading, scheduling, and performance tracking with self-improvement capabilities.
Gemini Pro: An AI-powered Telegram bot script for generating text and image-based responses using Gemini AI
This project enables real-time streaming of audio (and optionally video or screen captures) from your local device to Google Gemini using the Live API. It allows you to interact with Gemini through both text and voice, supporting conversational AI responses.
Python Server for Mivro
TelegramBot Tool to Interact with Google's Gemini AI Chatbot
A telegram bot that uses Google's Gemini Pro Vision API to convert image to text
An AI-powered suite to automate job scraping, resume parsing, job-to-resume scoring, and application tracking, designed to run via GitHub Actions.
Enhanced Voice Assistant, Works with both terminal and API, Multimodal, Multilingual, Modular design. Supports Voice ID, Face recognition, and configurable tools. Built-in Chatgpt, Claude, Deepseek, Gemini, Grok, and Ollama. Explore the possibilities of Human-AI interactions
An innovative AI conversation API leveraging Google's Gemini for multimodal understanding. Combines FastAPI, Langchain, and Redis for robust, scalable, and privacy-conscious text and image-based interactions
AI agent for creating personalized digests of research papers
Add a description, image, and links to the gemini-ai topic page so that developers can more easily learn about it.
To associate your repository with the gemini-ai topic, visit your repo's landing page and select "manage topics."