5 TH
5 TH
sciences
Article
A Visual Analytics Framework for Inter-Hospital Transfer
Network of Stroke Patients
Kyuhan Kwak, Jinu Park and Hyunjoo Song *
School of Computer Science and Engineering, Soongsil University, Seoul 06978, Republic of Korea
* Correspondence: hsong@ssu.ac.kr; Tel.: +82-2-820-0676
Abstract: Effective inter-hospital coordination is crucial in improving the stroke treatment process and
outcomes. The introduction of endovascular thrombectomy (EVT) further emphasized the importance
of coordination. Although previous studies considered various clinical data besides stroke in terms
of the network structure between hospitals, a majority of these studies performed only quantitative
analyses instead of topological analyses. This study proposes a new framework (PatientFlow) for
constructing a network based on stroke patient transfer data and performing exploratory analysis.
The proposed framework can visualize the network structure among hospitals at the national level
and analyze the detailed structure through dynamic queries. The hub-and-spoke structure for each
cluster derived through community detection can be compared visually and analyzed quantitatively
using network measures. Further, the relationship between regions can be analyzed by aggregating
the transfer of patients by province. PatientFlow allows medical researchers to perform an exploratory
analysis to understand the network at the national, provincial, and community levels with multiple
coordinated views.
1. Introduction
Citation: Kwak, K.; Park, J.; Song, H.
Improving the process and outcomes of stroke treatment requires effective inter-
A Visual Analytics Framework for
hospital coordination [1]. The introduction of endovascular thrombectomy (EVT) high-
Inter-Hospital Transfer Network of
Stroke Patients. Appl. Sci. 2023, 13,
lighted the significance of coordination even further [2]. The effectiveness of EVT was
5241. https://doi.org/10.3390/
proved in several studies conducted in 2015 [3,4]; however, it was challenging to introduce
app13095241 it in all hospitals. Since only some major hospitals can afford the environment, smaller hos-
pitals transfer patients who need EVT treatment to a capable hospital. Consequently, given
Academic Editors: Il Dong Yun and
this interest in transferring stroke patients, recent research focused on how the network
Walid Abdullah Al
between hospitals is structured [1,5].
Received: 14 February 2023 Despite the interest and importance, information on network structure for stroke treat-
Revised: 16 April 2023 ment is unknown and still needs to be studied [6]. In the Republic of Korea, 70 medium
Accepted: 19 April 2023 catchment areas have been established for inter-hospital coordination, and they are man-
Published: 22 April 2023 aged along with 17 large catchment areas centered on tertiary hospitals [7]. However,
there could be differences between the catchment area established by the government
and the service area in which healthcare institutions occupy the market through mutual
competition. Thus, there are ongoing discussions between policymakers of central and
Copyright: © 2023 by the authors. local governments about determining the catchment areas and inspecting whether the
Licensee MDPI, Basel, Switzerland.
hospitals within the areas are coordinating with each other [7].
This article is an open access article
Although previous studies considered various clinical problems besides stroke in
distributed under the terms and
terms of the network structure between hospitals, most of these studies performed only
conditions of the Creative Commons
quantitative analyses on the related clinical outcomes instead of topological analysis. For
Attribution (CC BY) license (https://
example, studies on hospital-acquired infections (HAIs) analyzed national hospital net-
creativecommons.org/licenses/by/
4.0/).
works in the United States [8] and France [9]. The network structure was used to develop
a method for rapidly detecting disease prevalence [8] or a method to construct an opti-
mized network for HAIs from patient transfer data [9]. Although stroke-related network
studies [1,5] used geospatial data for the analysis, they focused on changes in the stroke
treatment outcomes following the introduction of EVT or the increase or decrease in the
number of patients transferred between hospitals rather than deriving and analyzing a
hub-and-spoke structure of communities.
Therefore, deriving clear quantitative criteria for well-formed clusters from existing
studies was challenging. As a result, it was necessary to perform clustering on the given
data and then visually explore the well/poorly formed cases based on domain knowledge
by looking at the topology of each cluster. Based on the results of this exploratory analysis,
one could examine the network metrics of well/poorly formed clusters and heuristically
derive quantitative commonalities for each case. However, there are difficulties in dy-
namically analyzing partial data of interest (e.g., clusters showing similar network metric
values) with the abovementioned approaches since they did not focus on performing
exploratory analysis.
This study proposes an interactive framework, PatientFlow, for network analysis of
stroke care hospitals. It introduces a community-detection algorithm for patient transfer
data at the national level to create an environment that facilitates an exploratory analysis
by reflecting the geospatial characteristics of hospitals. The contributions of this study are
summarized as follows:
1. Introduction of an integrated visual analysis environment supporting data prepro-
cessing, cleansing, and filtering, allowing users to apply them dynamically;
2. Introduction of three-level exploratory analysis of patient transfers networks
(i.e., national, provincial, and community levels) with multiple coordinated views;
3. Introduction of a comparative analysis of commonly used network measures between
communities in clinical research;
4. Evaluation of efficacy through case studies with clinical researchers and comparison
with previous studies.
The rest of this paper is organized as follows. Section 2 reviews the cases of analyzing
stroke networks and visually analyzing network data in the healthcare field. We summarize
the data covered in this study, describe the data types and preprocessing procedures, and
provide an abstraction of the network analysis tasks performed by medical researchers
(Section 3). Then, we introduce the visual analysis framework and summarize how the
visualization has been constructed to reflect the task analysis results (Section 4). The content
and results of the case study conducted with domain experts are described, and this is
followed by further discussions. Finally, we conclude with Section 6, recapitulating the
main findings and introducing future work.
2. Related Work
The patient transfer was visually analyzed with node-link diagrams for various clinical
purposes. Fernández-Gracia et al. [8] analyzed the spread of pathogens in the patient
transfer network in the United States. The network structure constructed based on patient
transfers for two years was plotted on a map with the number of transferred patients
indicated by the color of the link. The analysis results were used to introduce a method
for selecting sensor hospitals based on a network measure (i.e., in-degree) and for quickly
detecting disease prevalence based on temporal, geographical, and topological properties.
Nekkab et al. [9] divided the patient group (1) with HAI, (2) with suspected HAI, and
(3) all patients, and they derived the network connection structure based on the patient
transfer pattern for determining the optimal network for studying HAIs. The transfer
pattern was also presented on a map, and the authors identified notable transfer patterns
that could help find a critical hospital at the regional and county level in France. The
nodes in the diagram were colored according to the community derived from the detection
algorithm [10]. Dong et al. [11] used a community detection algorithm [12], which extends
the previous approach [10] for large networks, to analyze the clustering patterns of hospitals
Appl. Sci. 2023, 13, 5241 3 of 21
in China. The authors neglected geographic information in analyzing the network and
plotting the nodes and links. We presented a hospital network using both geographical
layout and force-directed layout algorithms for analysis of geographical relationships and
topological analysis within each cluster, respectively.
A study on patient transfers was conducted regarding treatments for stroke patients.
Adeoye et al. [2] studied the geographic access of stroke patients in the United States.
They argued that a more efficient stroke treatment system was required by analyzing
country-level data on the accessibility to hospitals and the number of patients who received
treatments from the hospitals. Accessibility to treatment facilities was visualized using
a choropleth map. Zachrison et al. [7] argued that the study of networks could help im-
prove stroke treatment systems, and they introduced examples of community-detection
algorithms employed in other medical studies. It was followed by studies that grafted a
network analysis on various regions. For example, Zachrison et al. [5] analyzed stroke
patient transfer data in the northeastern United States and visualized the network us-
ing ArcGIS. Each network constructed based on patient transfer data at two time-points
(2007 and 2011) was expressed as a node-link diagram on a map; hospitals that sent a
significant number of patients and hospitals that received them were analyzed. Another
network analysis of the stroke treatment system was conducted for California [1]. The
two networks constructed based on patient transfer data (between 2013 and 2014; and
2016 and 2017) were visualized on a map to examine changes from the introduction of
EVT; the rate of progression of the EVT treatment and the rate of patient transfer were ana-
lyzed. Meanwhile, a study on telestroke care analyzed the hub-and-spoke structures and
investigated how the network changed over time [13]. Our study analyzed the community
in the network based on patient transfer data using the Louvain algorithm for directed
networks [14]. We provided tailored visualization without commercial tools by identifying
tasks through collaboration with medical researchers.
Prior work explored methods to visualize networks based on the data size (i.e., number
of nodes and links) and the purpose of visualization. Komarek et al. [15] classified the
layout method of placing nodes and links on a screen into seven types: force-directed,
hive plot, adjacency matrix, arc diagram, Sankey diagram, chord diagram, and pivot
graph. They compared the number of nodes and links suitable for network visualization.
Vehlow et al. [16] conducted a survey and analyzed previous studies that visualized the
group structure of graphs. According to the criteria from prior work [15], the number of
nodes (i.e., about 1000 hospitals) and the number of links (i.e., about 2000 patient transfers)
in our study were in the range for a force-directed layout. Nonetheless, we placed the
nodes reflecting their actual locations on a map as it was necessary to support the analysis
considering the geographical location of hospitals. Further, grouping was visualized
through superimposition among the methods summarized in a previous study [16] to
demonstrate the group structure derived through community detection. The map was
designed to reduce the scope of analysis through various interactions. There were many
overlapping nodes with many groups, which cannot be distinguished by color when
visualizing them on the map.
Andrienko et al. [17] surveyed visual analytic research of movement and transporta-
tion systems. Patient transfers in this study are origin-destination (OD) travel data without
start and end times. According to the survey, one commonly aggregates OD data into
matrices or flows for visual analysis. While visualizing flows as lines on a map has been
widely used [18], the following research worked on resolving visual clutters for more
significant numbers of flow. For instance, Wood et al. [19] proposed an OD map in which
each cell represents an OD vector as in an OD matrix. It preserved the spatial layout by
introducing a spatial treemap, and reduced visual clutters caused by intersecting links.
Andrienko et al. [20] worked on long-term flow data and proposed OD flow aggrega-
tion by direction and distance ranges. They used the aggregated flow to cluster time
intervals and presented the flow with diagram maps. Von Landesberger et al. [21] also
adopted spatial simplification of locations to avoid visual clutter in analyzing mass mo-
Appl. Sci. 2023, 13, x FOR PEER REVIEW 4 of 21
3.3.Analysis
Analysisof of Data
Data and
and User
User Task
Task
As
Asthis
thisstudy
study aims
aims to
to propose
propose aa visual
visual analysis framework for
analysis framework for medical
medicalresearchers
researchers
interested
interestedin instroke
stroke patients,
patients, we
we collaborated with two
collaborated with two researchers
researchers (one
(oneneurologist
neurologistandand
one
oneclinical
clinicalstatistician)
statistician) from
from aa quaternary hospital. We
quaternary hospital. We met
met monthly
monthlyfor forthree
threeyears,
years,and
and
they participated in data and task analysis. We independently searched for relevant
they participated in data and task analysis. We independently searched for relevant prior prior
work
workand andreached
reacheda consensus
a consensus ononrelevance in the
relevance monthly
in the monthly meetings.
meetings.They shared
They insights
shared in-
over the literature review, provided domain knowledge to aid in determining
sights over the literature review, provided domain knowledge to aid in determining net- network
metrics, and summarized
work metrics, and summarizedtheir analytic requirements.
their analytic The overall
requirements. framework
The overall frameworkfromfrom
data
and
datauser
andtask
useranalysis results results
task analysis is summarized in Figure
is summarized in1,Figure
and the1, following section describes
and the following section
the results.the results.
describes
3.1.Hospital
3.1. Hospitaland
and Patient
Patient Transfer
Transfer Data
Data
InInthe
theRepublic
RepublicofofKorea,
Korea,thetheHIRA
HIRA (Health
(Health Insurance
Insurance Review
Review andand Assessment
Assessment ser-
service)
vice) accumulates and manages a list of hospitals registered as treatment institutions
accumulates and manages a list of hospitals registered as treatment institutions for specific for
specific diseases, as well as the data related to patient transfers between hospitals,
diseases, as well as the data related to patient transfers between hospitals, their treatmenttheir
treatment and
processes, processes, and outcomes.
outcomes. The data
The data used used byresearchers
by medical medical researchers who conducted
who conducted the joint
study consisted of the transfers of 19,113 stroke patients among 1009 hospitals from 1 July
to 31 December 2016. The data used in this study are summarized in Table 1.
Appl. Sci. 2023, 13, 5241 5 of 21
Some data needed to be preprocessed before analysis, and parts of the provided data
were anonymized to prevent the hospital’s or personal information from being identified.
For example, the detailed address information that can specify a hospital in the hospital list
data is not provided, and only the regional administrative district (e.g., cities and provinces)
and second-level administrative district (e.g., cities, counties, or districts) information are
provided. The hospital ID is also randomly assigned to prevent the name of the hospital
from being identified. Accordingly, the GPS coordinates of the government office (e.g., city
hall, county office, or ward office) of the relevant administrative districts are used instead
of the actual location of the hospital to display the location of the hospital on a map. In this
case, random jittering is applied in the process of rendering on the screen to distinguish
them on a map because multiple hospitals within the same administrative district can
overlap. The distance between hospitals is calculated as a straight-line distance based on
the GPS coordinates before jittering is applied.
Other data about the hospital include classification based on the size of the hos-
pital (e.g., A = quaternary hospital, B = tertiary hospital, C = secondary hospital, and
D = primary hospital). One can also classify the hospital based on the stroke care hospital
(SCH) designation status. An SCH is a hospital equipped with healthcare professionals and
an environment that can provide intensive treatment, including EVT, for stroke patients;
the SCH designation status is considered important in the network analysis process in
previous related studies. Based on the data received from the HIRA, all quaternary hos-
pitals are designated as SCHs, and some tertiary hospitals are designated as SCHs. None
of the secondary or primary hospitals are designated as SCHs. Accordingly, the interface
is configured to facilitate the classification of B-type hospitals depending on their SCH
designation status in the analysis process (Section 4.4).
We complied with restrictions on the confidentiality of data: patient transfer data only
provides the sum of the number of patients transferred between hospitals to prevent the
identification of individual patients. The data contains the ID number of each hospital and
the number of patients transferred in and out. Among the types of patient transfer, those
discharged after receiving treatment at the first hospital visited (stay), and those transferred
to another hospital without receiving treatment at the first hospital visited and discharged
after receiving treatment at the transferred hospital (transfer and stay) are examined. In
practice, there are other cases where patients receive treatment at the first hospital and
are discharged from the transferred hospital. The last case involves three hospitals where
patients are transferred from the first hospital to the second hospital to receive treatment
and then transferred to a third hospital to get discharged. As such cases account for about
Appl. Sci. 2023, 13, 5241 6 of 21
0.43% of the total number of patients, this study only examined the two transfer types that
accounted for the most (99.57%).
of differences in the cluster; ‡ Eij = number of undirected edges from cluster i to j, gi = number of nodes in
the ith cluster; ¶ l = number of clusters in the network, pintra = number of patients transferred within a cluster;
§ p
inter = number of patients transferred from other clusters, psend = number of patients transferred to other
clusters; || m = number of edges in the network, Aij = weight between the nodes ni and n j , Cni = cluster to which
node ni belongs; # | E| = number of undirected edges, N = number of nodes in the network.
The degree centrality of the ith node (ni ) in a cluster is the number of hospitals that sent
patients to the node (i.e., in-degree) out of the total number of hospitals within the cluster,
excluding the respective node ( g − 1). Among three options (i.e., in-degree, out-degree,
total degree) to calculate the degree centrality, the use of the in-degree in the calculation
was determined after a review of network measures and confirmed by the collaborating
Appl. Sci. 2023, 13, 5241 7 of 21
medical researchers who also participated in user task analysis (Section 3.4). They wanted
to focus on the role of the hub hospital in receiving and treating patients from nearby spoke
hospitals. We tested various thresholds of degree centrality to determine a hub hospital.
In this study, heuristically, a node with a value of 0.5 or higher is classified as a hub node.
The collaborating researchers used our tool to analyze 93 clusters from Louvain clustering
and found that 86 groups had a node located in the cluster’s center with a degree centrality
higher than 0.5. Three groups only had one node, and the remaining four had a central
node with a centrality between 0.3 and 0.5. As a result, the collaborators determined the
threshold as 0.5 in this research.
Measures calculated for each cluster include group degree centralization, intra-cluster
density, and inter-cluster density, which are used for community analysis of networks
in previous studies [25,26]. Regarding group-degree-centralization, the previous study
calculated the value for the undirected network. Thus, we adjusted the denominator from
[(g − 1)(g − 2)] to [(g − 1)(g − 1)] to accommodate the characteristics of the directed net-
work (i.e., minimum in-degree is 0). Furthermore, three types of patient transfer rates were
calculated for each cluster based on feedback from medical researchers. The proportion
of patients transferred within each cluster, the proportion of patients transferred out to
other clusters, and the proportion of patients transferred in from other clusters are calcu-
lated using the number of patients who visited the hospital in each cluster at least once
(i.e., excluding stay patients) as the denominator.
Indices calculated for the entire network included modularity [14], which is calculated
for community detection, and three densities (i.e., global, mean intra-cluster, and mean inter-
cluster), which are used to evaluate community detection results in a previous study [26].
In this study, the adequacy of community detection is examined by determining if three
density values satisfied K inter < K < K intra .
investigate patient distribution by transfer types with exploratory analysis. Moreover, they
also showed interest in investigating the distribution over the whole nation, provinces,
and communities.
Figure 2. PatientFlow interface. (a) A list of interactive charts presents network characteristics. One
Figure 2. PatientFlow interface. (a) A list of interactive charts presents network characteristics. One
can filter or highlight items by brushing or pointing. Patient transfer distances less than 150 km are
can filter or highlight items by brushing or pointing. Patient transfer distances less than 150 km are
selected, and one can confirm the applied filter with chips on top of the screen. (b) The topology
selected, and one
view presents can confirm
hospitals the applied
and patient transfersfilter with chips
between them on on top
a map. of the
(c) screen.
The bar (b) The
chart topology
depicts the
view
number presents hospitals
of patients andhospital
for each patient in transfers between (i.e.,
three categories themtransfer-in,
on a map.transfer-out,
(c) The bar and
chartstay).
depicts
(d)
the number
A list of patients
of clusters yieldedfor each
from hospital in three
a community categories
detection (i.e., is
algorithm transfer-in,
presentedtransfer-out, and stay).
in three visualizations
(node-link
(d) A list of diagram, chord diagram,
clusters yielded and bar chart).
from a community Eachalgorithm
detection is designed to supportinanalyzing
is presented topology,
three visualizations
patient transfer
(node-link patterns,
diagram, chordand the number
diagram, and bar of chart).
patientsEach
for each hospital.
is designed to The totalanalyzing
support number oftopology,
clusters
and selected
patient clusters
transfer are and
patterns, shownthe on top ofofthe
number view (e.g.,
patients 5 out
for each of 95 clusters
hospital. The total are selected)
number (e) The
of clusters
and selected clusters are shown on top of the view (e.g., 5 out of 95 clusters are selected) (e) Thea
comparison view supports quantitative comparison between the selected clusters. One can select
network measure from the list on the left to make a comparison. (f) The province view presents
comparison view supports quantitative comparison between the selected clusters. One can select
patient transfers between provincial-level regions with a directed node-link diagram and a diverg-
a network measure from the list on the left to make a comparison. (f) The province view presents
ing bar chart. Depending on the transfer direction, the number of patients is colored in blue or or-
patient
ange. Atransfers
table andbetween provincial-level
a bar chart on the bottom regions
show with a directed
the hub hospitals node-link
within thediagram
region.and a diverging
One can click
bar
on achart. Depending
hospital on the
in any chart transfer
to locate direction,
it in the number
the topology view. of patients is colored in blue or orange.
A table and a bar chart on the bottom show the hub hospitals within the region. One can click on a
hospital
Forineach
any chart
clusterto locate
derived it infrom
the topology view.
the community-detection results, we used bubble sets
[28] to show the assumed coverage. (T4) It could show the geographical size of the cluster
For each cluster derived from the community-detection results, we used bubble
more clearly than simply coloring the node markers and could also aid in estimating the
sets [28] to show the assumed coverage. (T4) It could show the geographical size of
catchment area. However, the patient’s address or point of departure when visiting the
the cluster more clearly than simply coloring the node markers and could also aid in
hospital was missing from our data. Thus, it does not represent accurate catchment areas
estimating the catchment area. However, the patient’s address or point of departure when
as in [2], where Adeoye et al. assessed the accessibility of hospitals by region and visual-
visiting the hospital was missing from our data. Thus, it does not represent accurate
ized it. Still, it is possible to indirectly identify regions not participating in the treatment
catchment areas as in [2], where Adeoye et al. assessed the accessibility of hospitals by
network for stroke patients. Moreover, any patient transfer between distant hospitals can
region and visualized it. Still, it is possible to indirectly identify regions not participat-
be confirmed immediately through more giant bubbles spanning several areas.
ing in the treatment network for stroke patients. Moreover, any patient transfer between
distant hospitals can be confirmed immediately through more giant bubbles spanning
several areas.
The total number of patients that passed through each hospital was another aspect that
collaborating researchers wanted to examine besides topology (T5). For each hospital, there
are patients transferred to another hospital (transfer-out), transferred from other hospitals
(transfer-in), and received treatment without being transferred (stay). The researchers
wanted to examine the distribution of all three kinds of patients. Therefore, we included
a view showing the number of patients per hospital with a stacked bar chart under the
topology view (Figure 2c). The color encoding is kept consistent throughout the system,
with blue, orange, and red indicating transfer-in, transfer-out, and stay, respectively.
Appl. Sci. 2023, 13, 5241 11 of 21
During data analysis, we found far more stay cases than the other types (i.e., transfer-in
or transfer-out). As a result, the number of stay cases hindered the visual examination
of patient distribution for transferred patients. Thus, we placed a legend at the top right
corner, which the user could toggle the visibility of each patient type by clicking an item
in the legend. We also enabled grouping bars by provinces and changing the order of
hospitals by the number of patients: ascending/descending order of stay, transfer-in, or
transfer-out patients. One can also zoom in to narrow the analysis scope by brushing the
chart. As there are more than 1000 hospitals, one might have to drill down several times by
consecutive brushings. When a user brushes several times, buttons appear next to the title
on top, indicating each drill-down step. One can click one of them to roll back to that state.
If one finds a hospital of interest after brushing and browsing, one can locate it in another
view by clicking on the bar. A user can change the mouse interaction mode (i.e., clicking or
brushing) by toggling the button in the bottom right corner.
Task analysis results indicated that the researchers demanded multi-perspective analy-
sis with quantitative measures as well as analysis on topology and geographical distribution
(T1). We placed several interactive charts on the left side of the overview area to comply
with this demand (Figure 2a). With a histogram of link distance, a user can analyze the
distribution and filter links by brushing the chart. A boxplot of link distance for each
province depicts the distribution and outliers (Figure 3). Regarding inter-province transfers,
we count them based on the origin province. We assumed that the origin of long-distance
transfer would be more interesting than the destination, as it might indicate a poor local
network at the origin province. For instance, KAW province has a comparably significant
deviation. When one clicks the outlying circle, the corresponding link becomes highlighted
in red (Figure 3). One can also perform a similar examination at the cluster level with a
brushable bar chart. As some clusters only have a couple of hospitals as members, visu-
alizing with a boxplot was not applicable in some cases. Thus, we introduced a bar chart
showing each cluster’s longest link distance (Figure 4a). We also prepared other bar charts
depicting network measures of hub nodes and clusters (i.e., measures in Table 2). For
example, there is a bar chart for degree centrality (Figure 4b), where the color strip on the
right shows the value range for the primary hub (red), secondary hub (orange and yellow),
and spoke hospital. It could aid in selecting each class of hospital. We also included a bar
Appl. Sci. 2023, 13, x FOR PEER REVIEW chart (Figure 2a) to visualize the number of hospitals by size and SCH designation status 12 of 21
(e.g., A-SCH, B-SCH, B, C, and D).
Figure 3. Box
Figure plot
3. Box of of
plot the patient
the patienttransfer
transfer distance aggregatedbyby
distance aggregated province.
province. OneOne
can can
clickclick the outlier
the outlier
dotsdots
in the box plot to highlight the transfer link in the map.
in the box plot to highlight the transfer link in the map.
Figure 3. Box plot of the patient transfer distance aggregated by province. One can click the outlier
Appl. Sci. 2023, 13, 5241 12 of 21
dots in the box plot to highlight the transfer link in the map.
(a) (b)
Figure 4. Brushable bar charts
Figure for (a) the
4. Brushable longest
bar link
charts for (a)within eachlink
the longest cluster
withinand
each(b) the degree
cluster and (b) centrality
the degree centrality
of each hospital. Darker gray background indicates the brushed area (i.e., selected range).
of each hospital. Darker gray background indicates the brushed area (i.e., selected The colorThe color
range).
strip on the right sidestrip
of the degree
on the centrality
right side chart centrality
of the degree shows the value
chart range
shows for arange
the value primary
for a hub (red),
primary hub (red), a
a secondary hub (orange and yellow),
secondary andand
hub (orange a spoke hospital
yellow), (blue).
and a spoke hospital (blue).
Figure 6. Analysis of provincial-level patient transfers. (a) The province of interest (i.e., KWJ) lies in
Figure 6. Analysis of provincial-level patient transfers. (a) The province of interest (i.e., KWJ) lies in
the center, and other linked provinces encircle it. One can notice that KWJ receives many patients
the center, and other linked provinces encircle it. One can notice that KWJ receives many patients
from CLN province, judging from a thick link on the left and the longest bar on the right. (b) As
for CLN province, it mostly sends patients to others except for three provinces. (c) The table on the
left shows the complete list of hubs in the area. One can identify the hub with the most significant
number of patients in the stacked bar chart on the right (i.e., hospital ID 425). When one clicks the
row or bar, the cluster with the hub becomes highlighted, and one can locate them in the cluster view.
(d) The chord diagram shows that hospital ID 425 mostly receives patients from hospitals within the
cluster but sends patients to hospitals outside the provinces judging from the bar chart in (c). (e) One
can confirm it in the topology view by selecting the cluster and hovering a mouse on the hub. There
are connected dark dray nodes outside the province.
There are two visualizations in the bottom row to display information on SCHs in the
province (Figure 6c). The table on the left shows a complete list of SCHs. The columns
contain a district, hospital type, and the number of patients who transferred in/out and
stayed. Since tabular visualization is challenging to observe the overall trend, we put a
stacked bar chart on the right that shows the three types of patient counts. We linked the
two visualizations with the others (T2) and clicking a specific hospital in the table or the
stacked bar chart highlights it in every linked visualization (e.g., topology view and cluster
view). Clicking the column title sorts the items in the table and the stacked bar chart. As in
the cluster view, we added a button on top to toggle whether to include stayed patients or
not in the visualizations.
5. Evaluation
5.1. Case Study
We conducted a case study with two neurologists and a clinical statistician to evaluate
the effectiveness of the proposed visualization tool. Since these researchers intended to
derive clinical insights such as EVT outcome and mortality rate based on the findings
through this framework, they attempted to use the tool in their own space. Moreover, due
to the COVID-19 pandemic, we had to conduct the study online. Thus, we deployed the
tool to the experts and encouraged to use it for a couple of weeks and then interviewed
them for an hour.
One of the common requests of the participants was verifying whether community
detection was successful. They could validate the results by comparing the three density
values in Table 2 and checking the modularity calculated from the Louvain algorithm. The
validation result indicated that the communities were well derived. Then, they tried to ex-
amine the derived structure visually. At first, they used a node-link diagram, but thousands
of nodes caused excessive visual clutters. Thus, they switched to a geographical layout
to analyze distributions on a map and used the bubble sets to examine the community’s
coverage (Figure 2b). Since the bubbles were semi-transparent, the participants could notice
that the east and southwest regions had a relatively small number of clusters. They found
Appl. Sci. 2023, 13, 5241 16 of 21
several outlying clusters with long-distance transfers by hovering a cursor on the bubbles.
After a few filtering interactions in the distance bar chart (Figure 2b), they concluded that
excluding links over 150 km of transfer distance would be appropriate for their dataset.
A priori, neurologists expected that patient transfer in a specific province would show
different patterns compared to other regions. They tried to verify this widely known
expectation. In the province view, they browsed patient transfers for each region and
noticed singularities in some province pairs. Most provinces showed mutual patient
transfers between them, even if the number of patients for each direction were somewhat
biased toward one side. However, visual analysis with a node-link diagram and diverging
bar chart identified some province pairs (e.g., CLN and KWJ) with dominant transfer
directions (Figure 6a,b). In order to take a closer look, they clicked hospital ID 425 in
the table (Figure 6c), which showed the highest number of patients among the SCHs in
the CLN province. As they checked the belonging cluster, they found that the hospital
received patients from the rest of the cluster, resulting in a degree centrality of 1.0. In the
geographical map, they could find a reason for this discrepancy hospitals sent patients to
the hospitals in neighboring provinces.
The case study participants observed the potential of applying a network analysis
(i.e., analysis of community detection results and network measures) to the cases of hospitals
in the Republic of Korea. The preliminary result from the case study led to a clinical research
opportunity. In this section, we summarized additional use cases where PatientFlow could
aid in finding notable patient transfer patterns. With limited access to medical data, we
could not draw clinical implications from the following cases at the time of writing. Thus,
we asked the case study participants to review them with their domain knowledge, and
they regarded the cases as prominent patterns for future research.
In the bar chart of hospital types (Figure 2a), one could notice that all quaternary
hospitals (i.e., A) are SCHs, about 75% of tertiary hospitals (i.e., B) are SCHs; and none of
the secondary (i.e., C) and primary hospitals (i.e., D) are SCHs. One could assume that
SCH is designated to lead the care of stroke patients among hospitals nearby. Thus, the
user searched for uncommon hospitals with a degree centrality smaller than 0.5 in A-SCH
hospitals. The user clicked the A-SCH bar in the bar chart of hospital types to filter out
other types. Then, the user brushed the lower area in the degree centrality bar chart to
examine hospitals with a degree centrality less than 0.5. Four hospitals in KAW, CCB, CCN,
and CLN provinces matched the conditions. The user clicked the node in the geographical
topology view, and he/she could notice that two of them were secondary hubs, but the
rest were spokes in the cluster (highlighted with red outlines in Figure 5a,b). In the case of
hospital ID 83, it showed a tendency to receive patients from the others within the cluster.
However, the hub hospital in the cluster (i.e., hospital ID 88) received significantly more
patients even though it was not an SCH (Figure 5a). Regarding hospital ID 434, it did not
receive any patients from other hospitals. Instead, it sent most of the visited patients to
one of the hub hospitals in the cluster, which is also A-SCH. Considering that the other
30 A-SCHs acted as a primary or a secondary hub, it might require further examination
with additional clinical data.
During the case study, the participants used group degree centralization and the
number of hubs in the cluster as criteria for classifying clusters. While such classification
sufficed the demand, a user took a step forward by analyzing additional metrics for network
topology. When a hospital receives patients from all the others in the cluster, group degree
centralization becomes 1.0. Thus, the user selected five clusters (Figure 7a) with identical
group degree centralization (i.e., 1.0) and star topology (i.e., cluster without inter-spoke
links). Within the cluster, all five of them seemed similar to the user. Then, he/she selected
inter-cluster density to investigate connectivity between clusters to find differences between
the five clusters (Figure 7b). He/she wanted to investigate the cause and selected three
transfer rates to analyze them in a stacked bar chart (Figure 7c). As a result, the user could
identify two distinctive patterns: sending more patients to other clusters (e.g., cluster ID 1
and 3), and vice versa (e.g., cluster ID 5). When analyzing topology within a cluster, all
identical group degree centralization (i.e., 1.0) and star topology (i.e., cluster without in-
ter-spoke links). Within the cluster, all five of them seemed similar to the user. Then,
he/she selected inter-cluster density to investigate connectivity between clusters to find
differences between the five clusters (Figure 7b). He/she wanted to investigate the cause
Appl. Sci. 2023, 13, 5241 and selected three transfer rates to analyze them in a stacked bar chart (Figure 7c).17As a
of 21
result, the user could identify two distinctive patterns: sending more patients to other
clusters (e.g., cluster ID 1 and 3), and vice versa (e.g., cluster ID 5). When analyzing topol-
ogy within a cluster, all five clusters would be similar. However, if the goal of the analysis
five clusters would be similar. However, if the goal of the analysis is related to inter-cluster
is related to inter-cluster transfers, the user might need another perspective to investigate
transfers, the user might need another perspective to investigate them properly.
them properly.
6. Conclusions
This study proposed a framework for deriving and analyzing a community from
patient transfer data, geographical location, and hospital type (i.e., size and whether
designated as SCH). The framework adopted the methodologies and suggested network
measures in prior work [1,7]. Unlike cases with a clear hypothesis to be tested or the
aim of making a predictive model for fixed variables, medical researchers performed
an exploratory analysis to understand the topology at the time data were collected. We
designed PatientFlow for a national-level overview and interactive analysis of inter-hospital
coordination. Users can also examine patients’ transfer patterns at the cluster and provincial-
level administrative district units. To this end, PatientFlow supports visual and quantitative
analyses with multiple-coordinated views.
Our approach fulfilled the requirement of the collaborating medical researchers. It
led to clinical implications with additional medical data and inspired follow-up research.
However, at the time of research, we could not access the clinical records of the medical
research institution. Since the patient outcome at the hospital played an essential role in
prior work [7], the researchers had to rely on a separate tool to analyze them with the
findings from our tool. Regarding quantitative analyses, we are working on integrating
the clinical records into our framework and adopting additional measures. For instance,
Appl. Sci. 2023, 13, 5241 19 of 21
additional network measures, such as eigen centrality or page rank, could be introduced to
analyze the topology from another perspective. Moreover, additional data could introduce
a subgroup analysis as in the prior work [9] on HAIs. We could divide patients into
subgroups by gender, age, diabetes, or other meaningful factors for stroke treatment.
We are also working on adopting another community detection algorithm. The
Louvain algorithm identified a total of 93 communities in this work. Even though we
introduced interactivity (i.e., brushing-and-linking, zooming, and filtering) in our tool,
communities located near the metropolitan area yielded visual clutter from occlusions.
Jittering nodes resolved the problem to some extent, but overlapping bubble sets made it
challenging to analyze the clusters in the topology view. Additional efforts on grouping
adjacent clusters using district information and the current patient transfer information
might help mitigate the occlusion problem. A visual comparison of multiple community
detection results would also help determine the appropriate algorithm heuristically. Such
an approach would lead to examining the changes in network structure over time, which
has been a subject of interest in prior work [1,5]. It could also aid in comparing the derived
communities to planned catchment areas for policy making.
One can use our research to analyze hospital networks for cases where interhospital
transfers are inevitable. For instance, patient transfers are prevalent in the emergency
department [31], but some studies reported challenges in arranging interhospital trans-
fers [32,33]. The COVID-19 outbreak has also introduced the need for network analysis.
Negative pressure rooms were proven effective for treatment [34], but the limited number
of rooms in each hospital might cause patient transfers to other hospitals. In such cases,
PatientFlow could aid clinical researchers in analyzing hospital networks and finding
communities with a low level of cooperation.
Author Contributions: Conceptualization, K.K., J.P. and H.S.; methodology, K.K. and H.S.; software,
K.K., J.P. and H.S.; validation, K.K., J.P. and H.S.; formal analysis, K.K. and H.S.; investigation, K.K.
and H.S.; resources, H.S.; data curation, K.K., J.P. and H.S.; writing—original draft preparation,
K.K. and H.S.; writing—review and editing, H.S.; visualization, K.K., J.P. and H.S.; supervision,
H.S.; project administration, H.S.; funding acquisition, H.S. All authors have read and agreed to the
published version of the manuscript.
Funding: This work was support by the Soongsil University Research Fund (New Professor Support
Research) of 2020.
Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable.
Data Availability Statement: The data presented in this study are available on request from the
corresponding author with the permission of HIRA (Health Insurance Review and Assessment
service). The data are not publicly available due to privacy.
Acknowledgments: We thank Hee-Joon Bae, Jihoon Kang, and Seong Eun Kim (Department of
Neurology at Seoul National University Bundang Hospital) for their help and valuable comments on
this research.
Conflicts of Interest: The authors declare no conflict of interest.
References
1. Zachrison, K.S.; Hsia, R.Y.; Li, S.; Reeves, M.J.; Camargo, C.A.; Yan, Z.; Onnela, J.; Schwamm, L.H. Ischemic Stroke Systems of
Care in California: Evolution in the Organization During the Mechanical Thrombectomy Era. Stroke Vasc. Interv. Neurol. 2022,
2, e000206. [CrossRef]
2. Adeoye, O.; Albright, K.C.; Carr, B.G.; Wolff, C.; Mullen, M.T.; Abruzzo, T.; Ringer, A.; Khatri, P.; Branas, C.; Kleindorfer, D.
Geographic Access to Acute Stroke Care in the United States. Stroke 2014, 45, 3019–3024. [CrossRef] [PubMed]
3. Berkhemer, O.A.; Fransen, P.S.S.; Beumer, D.; van den Berg, L.A.; Lingsma, H.F.; Yoo, A.J.; Schonewille, W.J.; Vos, J.A.; Nederkoorn,
P.J.; Wermer, M.J.H.; et al. A Randomized Trial of Intraarterial Treatment for Acute Ischemic Stroke. N. Engl. J. Med. 2015, 372,
11–20. [CrossRef]
Appl. Sci. 2023, 13, 5241 20 of 21
4. Goyal, M.; Demchuk, A.M.; Menon, B.K.; Eesa, M.; Rempel, J.L.; Thornton, J.; Roy, D.; Jovin, T.G.; Willinsky, R.A.;
Sapkota, B.L.; et al. Randomized Assessment of Rapid Endovascular Treatment of Ischemic Stroke. N. Engl. J. Med. 2015, 372,
1019–1030. [CrossRef] [PubMed]
5. Zachrison, K.S.; Onnela, J.-P.; Hernandez, A.; Reeves, M.J.; Camargo, C.A.; Cox, M.; Matsouaka, R.A.; Metlay, J.P.; Goldstein, J.N.;
Schwamm, L.H. Ischemic Stroke Transfer Patterns in the Northeast United States. J. Stroke Cerebrovasc. Dis. 2019, 28, 295–304.
[CrossRef] [PubMed]
6. Zachrison, K.S.; Dhand, A.; Schwamm, L.H.; Onnela, J.-P. A Network Approach to Stroke Systems of Care. Circ Cardiovasc. Qual.
Outcomes 2019, 12, e005526. [CrossRef]
7. Lee, H.H. Significance and Limitations of the 2nd Basic Public Health Care Plan from the Perspective of Expanding Public Health
Care. Public Health Aff. 2021, 5, e3. [CrossRef]
8. Fernández-Gracia, J.; Onnela, J.-P.; Barnett, M.L.; Eguíluz, V.M.; Christakis, N.A. Spread of Pathogens in the Patient Transfer
Network of US Hospitals. In Social, Cultural, and Behavioral Modeling; Lee, D., Lin, Y.-R., Osgood, N., Thomson, R., Eds.; Springer
International Publishing: Cham, Switzerland, 2017; Volume 10354, pp. 271–280.
9. Nekkab, N.; Astagneau, P.; Temime, L.; Crépey, P. Spread of Hospital-Acquired Infections: A Comparison of Healthcare Networks.
PLoS Comput. Biol. 2017, 13, e1005666. [CrossRef]
10. Clauset, A. Finding Local Community Structure in Networks. Phys. Rev. E 2005, 72, 026132. [CrossRef]
11. Dong, X.; Tang, D.; Tang, C. Social Network Analysis in China’s Hospital Healthcare. Phys. A Stat. Mech. Appl. 2021, 565, 125546.
[CrossRef]
12. Blondel, V.D.; Guillaume, J.-L.; Lambiotte, R.; Lefebvre, E. Fast Unfolding of Communities in Large Networks. J. Stat. Mech. 2008,
2008, P10008. [CrossRef]
13. Sharma, R.; Zachrison, K.S.; Viswanathan, A.; Matiello, M.; Estrada, J.; Anderson, C.D.; Etherton, M.; Silverman, S.; Rost, N.S.;
Feske, S.K.; et al. Trends in Telestroke Care Delivery: A 15-Year Experience of an Academic Hub and Its Network of Spokes. Circ.
Cardiovasc. Qual. Outcomes 2020, 13, e005903. [CrossRef]
14. Dugué, N.; Perez, A. Direction Matters in Complex Networks: A Theoretical and Applied Study for Greedy Modularity
Optimization. Phys. A Stat. Mech. Appl. 2022, 603, 127798. [CrossRef]
15. Komarek, A.; Pavlik, J.; Sobeslav, V. Network Visualization Survey. In Computational Collective Intelligence; Núñez, M., Nguyen,
N.T., Camacho, D., Trawiński, B., Eds.; Springer International Publishing: Cham, Switzerland, 2015; Volume 9330, pp. 275–284.
16. Vehlow, C.; Beck, F.; Weiskopf, D. Visualizing Group Structures in Graphs: A Survey: Visualizing Group Structures in Graphs.
Comput. Graph. Forum 2017, 36, 201–225. [CrossRef]
17. Andrienko, G.; Andrienko, N.; Chen, W.; Maciejewski, R.; Zhao, Y. Visual Analytics of Mobility and Transportation: State of the
Art and Further Research Directions. IEEE Trans. Intell. Transp. Syst. 2017, 18, 2232–2249. [CrossRef]
18. Wood, J.; Slingsby, A.; Dykes, J. Visualizing the Dynamics of London’s Bicycle-Hire Scheme. Cartographica 2011, 46, 239–251.
[CrossRef]
19. Wood, J.; Dykes, J.; Slingsby, A. Visualisation of Origins, Destinations and Flows with OD Maps. Cartogr. J. 2010, 47, 117–129.
[CrossRef]
20. Andrienko, G.; Andrienko, N.; Fuchs, G.; Wood, J. Revealing Patterns and Trends of Mass Mobility Through Spatial and Temporal
Abstraction of Origin-Destination Movement Data. IEEE Trans. Vis. Comput. Graph. 2017, 23, 2120–2136. [CrossRef]
21. von Landesberger, T.; Brodkorb, F.; Roskosch, P.; Andrienko, N.; Andrienko, G.; Kerren, A. MobilityGraphs: Visual Analysis of
Mass Mobility Dynamics via Spatio-Temporal Graphs and Clustering. IEEE Trans. Vis. Comput. Graph. 2016, 22, 11–20. [CrossRef]
22. Vrotsou, K.; Fuchs, G.; Andrienko, N.; Andrienko, G. An Interactive Approach for Exploration of Flows Through Direction-Based
Filtering. J. Geovisualization Spat. Anal. 2017, 1, 1. [CrossRef]
23. Yang, C.; Zhang, Z.; Fan, Z.; Jiang, R.; Chen, Q.; Song, X.; Shibasaki, R. EpiMob: Interactive Visual Analytics of Citywide Human
Mobility Restrictions for Epidemic Control. IEEE Trans. Vis. Comput. Graph. 2022, in press. [CrossRef]
24. Newman, M.E.J. Modularity and Community Structure in Networks. Proc. Natl. Acad. Sci. USA 2006, 103, 8577–8582. [CrossRef]
[PubMed]
25. Wasserman, S.; Faust, K. Social Network Analysis: Methods and Applications; Structural Analysis in the Social Sciences; Cambridge
University Press: Cambridge, UK, 1994.
26. Miasnikof, P.; Shestopaloff, A.Y.; Bonner, A.J.; Lawryshyn, Y.; Pardalos, P.M. A Density-Based Statistical Analysis of Graph
Clustering Algorithm Performance. J. Complex Netw. 2020, 8, cnaa012. [CrossRef]
27. Bostock, M.; Ogievetsky, V.; Heer, J. D3 Data-Driven Documents. IEEE Trans. Vis. Comput. Graph. 2011, 17, 2301–2309. [CrossRef]
28. Collins, C.; Penn, G.; Carpendale, S. Bubble Sets: Revealing Set Relations with Isocontours over Existing Visualizations. IEEE
Trans. Visual. Comput. Graph. 2009, 15, 1009–1016. [CrossRef] [PubMed]
29. Plique, G. Graphology, a Robust and Multipurpose Graph Object for JavaScript. Zenodo 2022. [CrossRef]
30. Dkaki, T.; Mothe, J. An Energy-Based Model to Optimize Cluster Visualization. In Proceedings of the 2014 IEEE Eighth
International Conference on Research Challenges in Information Science (RCIS), Marrakech, Morocco, 28–30 May 2014; pp. 1–11.
31. Dunn, M.J.G.; Gwinnutt, C.L.; Gray, A.J. Critical Care in the Emergency Department: Patient Transfer. Emerg. Med. J. 2007, 24,
40–44. [CrossRef]
32. Hains, I.M.; Marks, A.; Georgiou, A.; Westbrook, J.I. Non-Emergency Patient Transport: What Are the Quality and Safety Issues?
A Systematic Review. Int. J. Qual. Health Care 2011, 23, 68–75. [CrossRef]
Appl. Sci. 2023, 13, 5241 21 of 21
33. Craig, S.S. Challenges in Arranging Interhospital Transfers from a Small Regional Hospital: An Observational Study. Emerg. Med.
Australas. 2005, 17, 124–131. [CrossRef]
34. Al-Benna, S. Negative Pressure Rooms and COVID-19. J. Perioper. Pract. 2021, 31, 18–23. [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual
author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to
people or property resulting from any ideas, methods, instructions or products referred to in the content.