Skip to content

repomesh/pipeshub-ai

 
 

PipesHub

The Open-Source Workplace AI Platform

Website · Docs · Discord · Roadmap

PipesHub on Trendshift

License Release Docker Pulls Discord Last commit Commits per month GitHub issues GitHub pull requests GitHub Repo stars
Twitter LinkedIn

Translations: Français · Deutsch · 简体中文 · 日本語 · Русский · עברית · 한국어 · Español · Português · Türkçe · Tiếng Việt · Italiano

PipesHub - Explainable & Extensible

PipesHub is an open-source, self-hosted AI-native execution layer that connects enterprise knowledge, delivers explainable search with citations, and automates workflows across your systems.

Features

  • 📝 Explainable Answers: PipesHub delivers grounded answers with precise block citations to the original documents.
  • 🔒 Permission-Aware Search: Enforces source-level access controls so users only see what they're authorized to.
  • 🕸️ Knowledge Graph Retrieval: Graph-backed retrieval that captures relationships across enterprise data.
  • 🔌 Enterprise Connectors: 30+ connectors with real-time and scheduled indexing out of the box.
  • 🔍 Unified Search, Deep Research, and Agents: Search, Q&A, deep research, web search, and AI agents on one context layer.
  • 📊 Artifacts and Code Execution: Generate reports, charts, and dashboards in a safe execution sandbox.
  • 🎙️ Multimodal Support: Image, diagram, and scanned-file understanding plus voice-based interaction.
  • 🤖 No-Code Agents and Actions: Build agents visually and execute actions across enterprise tools.
  • 🧠 Bring Your Own Model, Fully Self-Hostable: Any LLM provider, deployed in your VPC — data never leaves your infrastructure.
  • 🛠️ Developer-First and Extensible: APIs, SDKs, MCP tools, custom connectors, and independently scalable services.

PipesHub in Action

Connectors

Connectors

Citations

Citations

All Records

All Records

Knowledge Search

Knowledge Search

Connectors

PipesHub Connectors

File Formats Supported

Format Details
PDF Including scanned PDFs
Docx / Doc Microsoft Word
XLSX / XLS Microsoft Excel
PPTX / PPT Microsoft PowerPoint
CSV Comma-separated values
Markdown .md files
HTML Web pages
Text Plain text files
Google Docs, Sheets, Slides Google Workspace formats
Images PNG, JPG, etc.
Audio Audio files
Video Video files

Tech Stack

Frontend

Technology Description
Next.js App Router UI (client-rendered React)
TypeScript Strongly typed JavaScript superset
Radix UI Themes Accessible component primitives and styling
Zod Schema validation and parsing
React Hook Form Flexible form state management

Backend

Technology Description
FastAPI High-performance Python web framework
LangChain Framework for LLM pipelines
LangGraph State graph for LLM workflows
Qdrant Vector similarity search engine
Neo4j / ArangoDB Graph database
Kafka / Redis Streams Distributed event streaming platform
Redis Caching
Redis / etcd3 Distributed key-value configuration store
Celery Distributed task queue system
Docling Document parsing and extraction toolkit
PyMuPDF PDF processing library
pandas Data analysis and manipulation

🚀 Deployment Guide

PipesHub (the Workplace AI Platform) can be run locally or deployed on the cloud using Docker Compose. Note: If you are deploying PipesHub on a cloud server, make sure you are using an HTTPS endpoint. PipesHub enforces stricter security checks, and browsers will block certain requests when the application is served over HTTP. You can use a reverse proxy like Cloudflare, Nginx, or Traefik to terminate SSL/TLS and provide a valid HTTPS certificate. If you see a white screen after deploying PipesHub while accessing it over HTTP, this is likely the cause. The frontend will refuse to load due to stricter security checks.


📦 Production Deployment

# Clone the repository
git clone https://github.com/pipeshub-ai/pipeshub-ai.git

# 📁 Navigate to the deployment folder
cd pipeshub-ai/deployment/docker-compose

# Set Environment Variables
> 👉 Set Environment Variables for secrets, passwords, and the public URLs of the **Frontend** and **Connector** services
> _(Required for webhook notifications and real-time updates)_
> Refer to env.template

# 🚀 Start the production deployment
docker compose -f docker-compose.prod.yml -p pipeshub-ai up -d

# 🛑 To stop the services
docker compose -f docker-compose.prod.yml -p pipeshub-ai down

📦 Developer Deployment Build

# Clone the repository
git clone https://github.com/pipeshub-ai/pipeshub-ai.git

# 📁 Navigate to the deployment folder
cd pipeshub-ai/deployment/docker-compose

# Set Optional Environment Variables
> 👉 Set Environment Variables for secrets, passwords, and the public URLs of the **Frontend** and **Connector** services
> _(Required for webhook notifications and real-time updates)_
> Refer to env.template

# 🚀 Start the local build deployment
docker compose -f docker-compose.build.neo4j.yml -p pipeshub-ai up --build -d

# 🛑 To stop the services
docker compose -f docker-compose.build.neo4j.yml -p pipeshub-ai down

The main Dockerfile pulls pre-built layers from pipeshubai/pipeshub-ai-base:python-deps and pipeshubai/pipeshub-ai-base:runtime (see Dockerfile.base in the repo root for build/push commands). To use local tags instead, set PYTHON_DEPS_IMAGE and RUNTIME_BASE_IMAGE in the environment or in compose build args.

MCP Server

Use PipesHub with any MCP-compatible client to bring your enterprise context into AI workflows. Check the README for setup and usage.

Repository: pipeshub-ai/mcp-server

SDKs

PipesHub provides developer SDKs for Python, TypeScript, and Go to help you integrate quickly. Check the respective SDK repository README for setup and usage details.

Name Description Link
Python SDK Python SDK for PipesHub pipeshub-ai/pipeshub-sdk-python
TypeScript SDK TypeScript SDK for PipesHub pipeshub-ai/pipeshub-sdk-typescript
Go SDK Go SDK for PipesHub pipeshub-ai/pipeshub-sdk-go

Need an SDK in another language? Reach out to us at developer@pipeshub.com

RoadMap

We ship in the open. Here's what's done and what's next:

  • ✅ 🤖 Workplace AI agents: first-class no-code agent builder
  • ✅ 🔗 MCP (Model Context Protocol) support, both server and client
  • ✅ 🧰 Developers SDKs
  • ✅ 🔍 Code search across GitHub, GitLab, and Bitbucket
  • ⬜ 👤 Personalized search based on team, role, and history
  • ✅ ☸️ Production Kubernetes deployment with HA defaults
  • ⬜ 📈 PageRank-augmented relevance across the knowledge graph

👉 View the full product roadmap on Notion


👥 Contributing

Want to join our community of developers? Please check out our Contributing Guide for more details on how to set up the development environment, our coding standards, and the contribution workflow.

Where to go for what

Ask a question or get helpDiscord
Report a bug or request a featureGitHub Issues
Report a security issueReport Security Issue
Read the docsPipeshub Docs

FAQ

What is PipesHub?

PipesHub is an open-source, self-hosted AI-native execution layer that connects enterprise knowledge, delivers explainable search with citations, and automates workflows across your systems. It provides a unified context layer for search, Q&A, deep research, web search, and AI agents.

How is PipesHub different from other workplace AI tools?

PipesHub is fully open-source (Apache 2.0) and self-hostable — your data never leaves your infrastructure. It features permission-aware search that enforces source-level access controls, and delivers explainable answers with precise block citations to original documents.

What connectors does PipesHub support?

PipesHub has 30+ enterprise connectors with real-time and scheduled indexing. It supports file formats like PDF, Docx, XLSX, PPTX, CSV, Markdown, HTML, Google Docs/Sheets/Slides, images, audio, and video.

How do I deploy PipesHub?

# Clone the repository
git clone https://github.com/pipeshub-ai/pipeshub-ai.git
cd pipeshub-ai/deployment/docker-compose

# Set Environment Variables (refer to env.template)
# Start production deployment
docker compose -f docker-compose.prod.yml -p pipeshub-ai up -d

Note: Use HTTPS for cloud deployments. HTTP may cause frontend security blocks.

What LLM providers does PipesHub support?

PipesHub is "Bring Your Own Model" — you can use any LLM provider. Deploy in your VPC with your preferred models. The tech stack includes LangChain and LangGraph for LLM pipelines and workflows.

What is the Knowledge Graph Retrieval feature?

PipesHub uses graph-backed retrieval that captures relationships across enterprise data. It uses Neo4j or ArangoDB as graph databases, combined with Qdrant for vector similarity search.

Does PipesHub have an MCP server?

Yes. PipesHub provides an MCP server for integration with any MCP-compatible client. Repository: pipeshub-ai/mcp-server.

What SDKs are available?

PipesHub provides SDKs for:

Can I build AI agents without coding?

Yes. PipesHub has a no-code agent builder. You can build agents visually and execute actions across enterprise tools without writing code.

What is the multimodal support?

PipesHub supports image, diagram, and scanned-file understanding, plus voice-based interaction. It uses Docling and PyMuPDF for document parsing, and Azure Document Intelligence or a multimodal LLM (VLM) for scanned PDF OCR.

How do I troubleshoot deployment issues?

  1. Ensure HTTPS is configured for cloud deployments
  2. Check Docker compose logs: docker compose logs
  3. Verify environment variables in env.template
  4. Consult docs.pipeshub.com for detailed guides

Where can I get help?


⭐ Star us on GitHub!

It helps the project reach the teams who need it.

Star PipesHub

Star History

Built with ❤️ by the PipesHub team and contributors around the world.

About

The OpenSource Alternative to Glean's Workplace AI

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 78.1%
  • TypeScript 21.7%
  • Makefile 0.1%
  • CSS 0.1%
  • JavaScript 0.0%
  • Shell 0.0%