Lists (1)
Sort Name ascending (A-Z)
Stars
Production-ready platform for agentic workflow development.
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale
Integration between Lance and Ray for distributed data processing
A data collection and processing pipeline for animal video, annotations include mask, keypoint, depth, occlusion, etc. Suitable for 3D/4D reconstruction, tracking, pose prediction, etc.
A super great audio/video source and FFmpeg wrapper
赛博医生项目——”赛博华佗“,基于多模态大模型的多功能智能体,一键搭建本地多模态大模型。接入医疗健康相关的知识图谱和知识库后可以进行疾病初诊,病历分析,专业知识问答等功能,成为你的私人医生。赛博华佗项目能帮助实现医疗资源的跨地域传播,让更多人借助大模型改善健康水平。"Cyber Huatuo" - Easy to build a personal doctor agent based o…
A High-performance cross-platform Video Processing Python framework powerpacked with unique trailblazing features 🔥
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
Video Duplicate Finder - Crossplatform
a machine learning image inpainting task that instinctively removes watermarks from image indistinguishable from the ground truth image
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。
基于multi-agent架构的医疗文本处理系统,实现医疗文本的智能处理。主要包含文本摘要、研究分析和PHI(受保护健康信息)编辑等功能,并配备智能验证机制确保输出质量。
Batch LLM Inference with Ray Data LLM: From Simple to Advanced
A lightweight data processing framework built on DuckDB and 3FS.
SympCheck Helper is a modern healthcare consultation chatbot that leverages the power of DeepSeek V3 API and Supabase to provide intelligent health-related assistance. Built with cutting-edge techn…
本demo使用ultralytics-YOLO8对水印位置进行模型训练&检测,然后使用IOPaint移除检测到的水印。
[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning
Model for watermark classification implemented with PyTorch
Framework for processing and filtering datasets
The Largest-scale Chinese Medical QA Dataset: with 26,000,000 question answer pairs.
An AI copilot, that reads research papers, clinical trials, drug trials for you, and summarizes it and also let's you to chat with the knowledgebase.