Skip to content
View Data-drone's full-sized avatar
😎
😎

Highlights

  • Pro

Block or report Data-drone

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🎨 NeMo Data Designer: A general library for generating high-quality synthetic data from scratch or based on seed data.

Python 468 35 Updated Dec 19, 2025

High performance self-hosted photo and video management solution.

TypeScript 86,819 4,571 Updated Dec 19, 2025

DeepAnalyze is the first agentic LLM for autonomous data science. 🎈你的AI数据分析师,自动分析大量数据,一键生成专业分析报告!

Python 3,232 476 Updated Dec 15, 2025

Plug-and-play document AI with zero-shot models.

Python 120 8 Updated Dec 18, 2025

SOTA search powered LLM

Python 3,743 343 Updated Apr 4, 2025

An OpenSource Deep Research library with reasoning

TypeScript 170 22 Updated Dec 4, 2025

An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.

Python 27,728 2,510 Updated Sep 30, 2025

A lightweight LMM-based Document Parsing Model

Python 6,374 441 Updated Dec 8, 2025

an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM

Rust 24,735 2,202 Updated Dec 19, 2025

Simple UI for debugging correlations of text embeddings

HTML 305 23 Updated May 28, 2025

Build Real-Time Knowledge Graphs for AI Agents

Python 21,212 2,050 Updated Dec 18, 2025

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 18,976 1,651 Updated Nov 19, 2025

Composable building blocks to build LLM Apps

Python 8,199 1,225 Updated Dec 18, 2025

A flexible, adaptive classification system for dynamic text classification

Python 516 36 Updated Oct 7, 2025

The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization ✨

TypeScript 2,694 247 Updated Dec 19, 2025

AdalFlow: The library to build & auto-optimize LLM applications.

Python 3,920 357 Updated Dec 11, 2025

The LLM's practical guide: From the fundamentals to deploying advanced LLM and RAG apps to AWS using LLMOps best practices

Python 4,495 1,046 Updated Dec 15, 2025

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Python 5,698 602 Updated Dec 14, 2025

A Kubernetes deployable instance of GroundX for document parsing, storage, and search.

Smarty 802 91 Updated Dec 16, 2025

A playbook for effectively prompting post-trained LLMs

896 39 Updated Jan 21, 2025

Everything about the SmolLM and SmolVLM family of models

Python 3,454 243 Updated Nov 20, 2025

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,427 323 Updated Dec 19, 2025

🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data

TypeScript 70,120 5,510 Updated Dec 19, 2025

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN

Python 57,342 5,802 Updated Dec 18, 2025

Structured data extraction and instruction calling with ML, LLM and Vision LLM

Python 5,068 509 Updated Dec 18, 2025

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 6,165 568 Updated Aug 22, 2025
Jupyter Notebook 693 85 Updated Apr 30, 2025

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

Jupyter Notebook 18,711 4,447 Updated Dec 17, 2025

Synthetic data generation for tabular data

Python 3,359 409 Updated Dec 18, 2025
Next