KNOWNET: Guided Health Information Seeking from LLMs via Knowledge Graph Integration

This is the GitHub repository for our IEEE VIS 2024 paper,
“KNOWNET: Guided Health Information Seeking from LLMs via Knowledge Graph Integration” (Pre-Print).
This paper received the Best Paper Honorable Mention at IEEE VIS 2024.

Demo: https://maurilaparva.github.io/KNOWNET/

Documentation: https://visual-intelligence-umn.github.io/KNOWNET/

Quick Start

1. Clone the repository

git clone https://github.com/visual-intelligence-umn/KNOWNET.git
cd KNOWNET

2. Set up a Python virtual environment

python3 -m venv venv
source venv/bin/activate   # (on Windows: venv\Scripts\activate)

3. Install frontend dependencies

KNOWNET uses React + TypeScript + Vite.

pnpm install

(If you don’t have pnpm installed, run npm install -g pnpm first.)

4. Run the development server

pnpm run dev

The app will be served locally at:
http://localhost:5173

Development Setup

If you are modifying the codebase:

Install Frontend Dev Dependencies

pnpm install
pnpm run dev

Install Backend Dev Dependencies

cd api
python3 index.py

Both servers must run concurrently in development mode.

Backend Data and Database Setup

The KNOWNET backend relies on two primary data resources:

Embedding Files — Precomputed text embeddings used for entity and relation verification.
These embeddings are stored on the server and accessed through embedding_utils.py and embeds.py.
Developers reproducing the system locally can use existing OpenAI or local embedding models to regenerate these vectors from the source entity/relation files.
Knowledge Graph Database (Neo4j) —
The deployed version of KNOWNET connects to a Neo4j database hosted on AWS EC2, which contains the curated biomedical knowledge graph used for verification and recommendation.
The hosted instance supports the public demo and is read-only for external users.

Production vs. Local Neo4j Setup

KNOWNET uses two different Neo4j environments, each serving a different purpose:

1. Production Neo4j (AWS EC2) — Used by the Public Demo

Hosts the curated, final biomedical knowledge graph used in the VIS 2024 paper
Contains data imported from node_data.csv and rel_data_filtered.csv, plus several rounds of filtering and manual verification
Read-only for all external users
Connected directly to the deployed Flask backend
Includes precomputed embeddings and optimized indexes
Intended only for running the public demo, not for developer experimentation

2. Local Neo4j (Developer Setup)

Used when developers want to:
- extend KNOWNET
- test custom knowledge graphs
- run experiments offline
Requires creating your own copies of node_data.csv and rel_data_filtered.csv based of ADInt_CUI_embeddings.parquet (can be downloaded below).
Requires computing or providing your own embeddings (see next section)
Credentials and database URI must be configured locally
Fully editable and meant for development

Setting Up a Local Neo4j Instance (Developer Setup)

To reproduce or extend KNOWNET locally with a custom knowledge graph:

Install Neo4j Community or AuraDB
- Local setup: https://neo4j.com/download/
- Cloud setup (recommended for collaboration): https://neo4j.com/cloud/aura/
Create a New Database
- Configure a database (e.g., knownet_db) and note your connection credentials:
```
NEO4J_URI=bolt://localhost:7687
NEO4J_USER=neo4j
NEO4J_PASSWORD=<your-password>
```
- Update these credentials in your local .env or configuration file (used by verify.py and recommend.py).
Load Graph Data The biomedical knowledge graph used for KNOWNET is constructed from two ADInt-derived files:
- node_data.csv - list of biomedical entities
- rel_data_filtered.csv - filtered relations between entities These files were used to build both the AWS Neo4j database and the local development setup. Note: KNOWNET does not include these files in the repository.
```
 LOAD CSV WITH HEADERS FROM 'file:///node_data.csv' AS row
 CREATE (:Entity {
 cui: row.CUI,
 name: row.name,
 type: row.type
 });

 LOAD CSV WITH HEADERS FROM 'file:///rel_data_filtered.csv' AS row
 MATCH (a:Entity {cui: row.Source}), (b:Entity {cui: row.Target})
 CREATE (a)-[:RELATION {type: row.Relation}]->(b);
```
- If you are using your own biomedical dataset, you may adapt this format to match your schema.
Generate or Import Embeddings
- Use embedding_utils.py to compute embeddings for all entities and relations:
```
python3 api/embedding_utils.py
```
- Store resulting vectors locally or in a connected S3 bucket.

Downloading Precomputed Embeddings

KNOWNET uses precomputed concept embeddings stored in:

ADInt_CUI_embeddings.parquet

A copy of the embeddings file can be downloaded here:

Download ADInt_CUI_embeddings.parquet

After downloading the file, place it in the api/ directory (or your preferred location), and update the path in embeds.py:

EMBEDDING_FILE = "api/ADInt_CUI_embeddings.parquet"

AWS Deployment (for Reference)

The hosted KNOWNET backend runs on AWS EC2 with a connected Neo4j instance configured for read access.
Developers replicating this deployment can follow a similar setup:

Launch an EC2 instance (Ubuntu 22.04 or later).
Install Neo4j, Flask, and dependencies listed in requirements.txt.
Configure security groups to allow bolt:// (7687) and HTTPS (443) traffic.
Use Nginx or a similar reverse proxy for routing and SSL termination.

Project Structure

The KNOWNET repository follows a modular full-stack architecture composed of a Python-based backend and a React + TypeScript frontend.
Below is an overview of the top-level structure:

KNOWNET/
├── api/                     # Flask backend (embedding, verification, recommendation)
│   ├── embedding_utils.py
│   ├── embeds.py
│   ├── index.py
│   ├── recommend.py
│   └── verify.py
│
├── src/                     # React + TypeScript frontend
│   ├── assets/
│   ├── components/
│   ├── lib/
│   ├── main.tsx
│   ├── App.css
│   └── index.css
│
├── docs/                    # Documentation and setup guides
├── requirements.txt         # Python dependencies
├── package.json             # Frontend dependencies
├── vite.config.js           # Vite build configuration
├── tailwind.config.js       # TailwindCSS configuration
├── postcss.config.js        # PostCSS configuration
├── run_flask.sh             # Flask startup script
└── README.md

`src/components/` — Component Description

This directory contains the primary interface modules that enable chat-based interaction, visualization, and dynamic user feedback.

Component	Description
`App.tsx`	The top-level container that integrates the chat interface, visualization panel, and overall application layout.
`chat.tsx`, `chat-panel.tsx`, `chat-list.tsx`	Manage chat session state, message rendering, and conversational layout structure.
`chat-message.tsx`, `chat-message-action.tsx`	Define the appearance and behavior of individual chat messages, including user interactions such as feedback or copy actions.
`chat-slider.tsx`, `chat-scroll-anchors.tsx`	Control streaming message presentation, auto-scrolling, and smooth viewport transitions during chat updates.
`prompt-form.tsx`	Handles user prompt input and manages message dispatch to the backend API.
`recommendation_tray.tsx`	Displays suggested follow-up queries, related entities, or recommended exploration paths generated by the backend.
`empty-screen.tsx`	Defines the placeholder state and introductory UI shown before a conversation begins.
`vis-flow/`	Implements the interactive visualization of knowledge graph entities and relations using React Flow.
`markdown.tsx`	Parses and renders model responses using Markdown and syntax highlighting for structured readability.
`external-link.tsx`, `footer.tsx`, `providers.tsx`, `tailwind-indicator.tsx`	Supporting components that manage layout, theming, developer utilities, and external linking behavior.
`ui/`	Reusable user interface primitives and styled components that provide consistent visual design across the application.

Citation

If you use or reference this project, please cite:

Youfu Yan, Yu Hou, Yongkang Xiao, Rui Zhang, and Qianwen Wang. 2025.
KNowNet: Guided Health Information Seeking from LLMs via Knowledge Graph Integration.
IEEE Transactions on Visualization and Computer Graphics, 31(1), 547–557.
https://doi.org/10.1109/TVCG.2024.3456364

KNOWNET v2: Guided Health Information Seeking via Web Search Verification & Edge Uncertainty

A follow-up version of KNOWNET explores web search–based validation as an alternative to static knowledge-graph integration.
This prototype introduces edge-level uncertainty quantification to assess the reliability of model-generated relations in real time.

View KNOWNET v2 here: https://maurilaparva.github.io/prototype/

Name		Name	Last commit message	Last commit date
Latest commit History 195 Commits
api		api
docs		docs
public		public
src		src
.env.example		.env.example
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
README.md		README.md
dist.tgz		dist.tgz
eslint.config.js		eslint.config.js
index.html		index.html
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
postcss.config.js		postcss.config.js
requirements.txt		requirements.txt
run_flask.sh		run_flask.sh
tailwind.config.js		tailwind.config.js
tsconfig.app.json		tsconfig.app.json
tsconfig.app.tsbuildinfo		tsconfig.app.tsbuildinfo
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.js		vite.config.js
vite.config.js.bak		vite.config.js.bak

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

KNOWNET: Guided Health Information Seeking from LLMs via Knowledge Graph Integration

Quick Start

1. Clone the repository

2. Set up a Python virtual environment

3. Install frontend dependencies

4. Run the development server

Development Setup

Install Frontend Dev Dependencies

Install Backend Dev Dependencies

Backend Data and Database Setup

Production vs. Local Neo4j Setup

1. Production Neo4j (AWS EC2) — Used by the Public Demo

2. Local Neo4j (Developer Setup)

Setting Up a Local Neo4j Instance (Developer Setup)

Downloading Precomputed Embeddings

AWS Deployment (for Reference)

Project Structure

`src/components/` — Component Description

Citation

KNOWNET v2: Guided Health Information Seeking via Web Search Verification & Edge Uncertainty

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

Visual-Intelligence-UMN/KNOWNET

Folders and files

Latest commit

History

Repository files navigation

KNOWNET: Guided Health Information Seeking from LLMs via Knowledge Graph Integration

Quick Start

1. Clone the repository

2. Set up a Python virtual environment

3. Install frontend dependencies

4. Run the development server

Development Setup

Install Frontend Dev Dependencies

Install Backend Dev Dependencies

Backend Data and Database Setup

Production vs. Local Neo4j Setup

1. Production Neo4j (AWS EC2) — Used by the Public Demo

2. Local Neo4j (Developer Setup)

Setting Up a Local Neo4j Instance (Developer Setup)

Downloading Precomputed Embeddings

AWS Deployment (for Reference)

Project Structure

src/components/ — Component Description

Citation

KNOWNET v2: Guided Health Information Seeking via Web Search Verification & Edge Uncertainty

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

`src/components/` — Component Description

Packages