Curran23 Supp
Curran23 Supp
HCM participants
In total 710 patients with a clinical diagnosis of HCM, either seen in the inherited cardiomyopathy service or referred for
CMR imaging, were consecutively enrolled into a prospective registry at the National Institute for Health Research (NIHR)
Royal Brompton Hospital Cardiovascular Biobank project between 2009-2015, of whom 436 were included in this study.
All participants provided written informed consent and the study was approved by the National Research Ethics Service
(19/SC/0257). HCM diagnosis was independently adjudicated by a cardiomyopathy specialist based on established clinical
and CMR criteria where all patients met the American Heart Association criteria for diagnosis.10 This was defined as a wall
thickness of 15mm or greater, or 13–14mm if there was a first degree relative with HCM, not explained by another cardiac
or systemic disease-causing abnormal loading conditions, or had disproportionate apical wall thickness and tapering in
keeping with an apical HCM phenotype.35
Patients were excluded from analysis based on age (< 16 years at time of CMR), missing demographic or clinical data,
contraindication to CMR, previous history of septal ablation, cardiac transplantation or myectomy at baseline. A history of
hypertension or diabetes was documented, as well as current medication at time of enrolment to the study. The cohort un-
derwent detailed clinical, imaging and genetic assessment. All patients underwent CMR for assessment of cardiac chamber
volumes and function (1.5T, Siemens Sonata or Avanto, Siemens Medical Systems, Erlangen, Germany). Variables reported
were collected at enrolment to the study. The CMRs selected for analysis were those closest to the date of enrolment
or the first diagnostic study available. Where present, left ventricular outflow tract obstruction (LVOTO) was confirmed
through stress echocardiography. Unrelated Singaporean patients with a diagnosis of HCM (n = 60) were prospectively
recruited from the National Heart Centre Singapore. Patients gave written informed consent to participate which was ap-
proved by the Singhealth Centralised Institutional Review Board (2020/2353) and Singhealth Biobank Research Scientific
Advisory Executive Committee (SBRSA 2019/001v1). Singaporean subjects underwent an equivalent CMR protocol at 1.5T
(Aera, Siemens, Erlangen, Germany) or 3T (Ingenia, Philips, Best, Netherlands).
Conventional CMR analysis was undertaken by accredited operators using semi-automated software (CMRtools, Car-
diovascular Imaging Solutions, London, UK).
UK Biobank participants
The UKB study recruited 500,000 participants aged 40 to 69 years old from across the United Kingdom between 2006 and
2010 (National Research Ethics Service, 11/NW/0382).36 This study was conducted under terms of access approval number
40616. In each case, written informed consent was provided.
A sub-study of UKB invited participants for CMR for assessment of cardiac chamber volumes and function using a
standard protocol (1.5T, Siemens Aera, Siemens Medical Systems, Erlangen, Germany).37 As a reference population, we
selected 16,691 participants that did not meet criteria for left ventricular hypertrophy and were classified as genotype neg-
ative (SARC-NEG) by having no variants in genes that may cause or mimic HCM (see Sequencing and variant categorisation).
External validation
Wall thicknesses were residualized by linear regression using sex and age at scan as covariates. In this way, we could
compare the similarities between the intra-cohort statistical properties of WT values. The first 5 principal components were
estimated from the development cohort and used to project the adjusted Singaporean WT values. The development cohort
principal component scores were used to fit two random forest models aimed at predicting the x and y tree coordinates.
Performances of the models were evaluated using a 10-fold cross-validation, repeated 3 times. The fitted models were
therefore used to predict the tree coordinates of the Singaporean individuals, from their principal component scores.
Faithfulness of the tree mapping for the Singaporean cohort was evaluated by the “trustworthiness” M1 measure, which
estimates how observations that are similar in the original high dimensional space are placed close to each other in the
low dimensional space. It ranges from 0 to 1, with larger values corresponding to a better representative low dimensional
mapping.13
In order to evaluate the consistency of the local statistical patterns between the development tree and the projected Sin-
gaporean individuals, we considered Spearman’s correlation between nearest neighbour points in the tree. We estimated
the correlations between the adjusted WT of the closest points in the development tree, and compared their distribution
with that of the correlations between the Singaporean WT and their closest points in the development tree. In order to
test differences in the distributions, we used a Wilcoxon test for difference of medians, and a Kolmogorov Smirnov test for
difference of distributions.
Supplementary Table I. Cluster stability at end diastole. Results from the analysis of the cluster stability for end diastolic wall thickness
(ED WT) in 1000 subsets. The possible values range between 0 and 1, where 0 means that the cluster is not stable, and 1 that the clusters
are identical in all repetitions. Between a resolution of 0.4 and 0.5, determined from the clustree plot (Supplementary Fig. II). We chose
the resolution of 0.5 because it is also characterized by more stable clusters. Cluster 2 is found as an intermediate state between cluster 0
and 1, and it is less stable than the other two.
Supplementary Table III. Pathogenic/likely pathogenic variant prediction from tree coordinates. Fitted parameters for the GAM
model used to predict individuals with P/LP variants using the 2 tree coordinates. 1 OR, Odds ratio; CI, Confidence interval
ED ES
Characteristic log(OR)1 95% CI1 p-value log(OR)1 95% CI1 p-value
(Intercept) -1.1 -1.4, -0.93 <0.001 -1.2 -1.4, -0.96 <0.001
s(Z1) 0.008 0.013
s(Z2) 0.2 0.062
Supplementary Table IV. Singaporean branch assignment. Singaporean HCM patients were assigned to the tree branches of their
nearest neighbours in the development tree. In both end diastole (ED) and end systole (ES), most of the individuals were assigned to
branches 1 and 4.
Branch ED ES
1 23 21
2 13 1
3 4 2
4 20 35
5 - 1
Supplementary Table V. Association between ES tree coordinates and genotype status. Coefficients of the logistic regression
between the ES tree coordinates and the genotype status (genotype == "P/LP"). The results for the whole cohort tree (left) and the
unrelated subjects tree are consistent. 1OR = Odds Ratio, CI = Confidence Interval
Supplementary Table VI. Association between ES tree coordinates and PRS. Coefficients of the logistic regression between the ES tree
coordinates and PRS (PRS == "high"). The results for the whole cohort tree (left) and the unrelated subjects tree are consistent. 1OR = Odds
Ratio, CI = Confidence Interval
Branch 1 2 3 4
1.32
Beta Coefficients
0
-1.32
B
Weight Body Surface Area ED Volume ACE/ARB ASA/Clopi Beta Blocker
1 1 1
1
* 1
** 1
Branch
Branch
No No No
3 Yes 3 Yes 3 Yes
2 2 2
** * **
Branch
Branch
Branch
4 4 4
***
***
***
***
***
***
70 80 90 100 1.7 1.8 1.9 2.0 2.1 100 120 140 160
1 1
** 1
*** Value
Weight BSA EDV 2 Value 2 Value 2
Branch
Branch
Branch
1
No No 2
3 Yes 3 Yes 3 3
Left Ventricle Mass LV Max Wall Thickness SV 4
4
*** 4 4
***
2 2 2
Branch
Branch
Branch
***
***
***
***
***
1
** 1
*
***
***
***
3 3 3 Value
2
*** Value 2
**
Branch
Branch
Apex
***
**
No Base
3 Yes 3
Mid
4 4 4
4
** 4
150 200 250 15.0 17.5 20.0 22.5 25.0 80 90 100 110 120 0 50 100150 0 50 100150
LVM LV Max WT SV Num. subjects Num. subjects
Supplementary Figure I. Phenotypic tree from 3D end-diastolic wall thickness. a. The projection of patients’ 3D end-diastolic (ED) wall thickness (WT) by the
DDRTree dimensionality reduction reveal the presence of four main branches that are associated to specific morphological changes of the myocardium. Each branch is
represented by the decimated ED atlas mesh, coloured accordingly to the beta coefficients resulting from testing the average difference between each branch individual
and the other subjects. The yellow contour denotes the areas with a beta significantly different from zero. b. The continuous and discrete phenotypic variables found to
be significantly associated to at least one branch. For left ventricular (LV) Gadolinium, labels are as follows: 1: None, 2: Minimal, 3: Moderate and 4: Severe. The
significance for the enrichment of discrete variables is reported within the bars. ACE, Angiotensin-converting enzyme inhibitors; Aff, affected; ARB, Angiotensin receptor
blockers; ASA, aspirin; Clopi, clopidogrel; LVOTO, Left ventricular outflow tract obstruction; SV, stroke volume. Only the significant pairs are reported with the symbols:
∗ 𝑃 ≤ 0.05; ∗∗ 𝑃 ≤ 0.01; ∗∗∗ 𝑃 ≤ 0.001; ∗∗∗∗ 𝑃 ≤ 0.0001, 𝑛 = 436.
4 0 1
0 1
3
0 1
UMAP2
2 0 2 1
0 2 1
1
0 2 1 3
0 1 2 0 3
1 2 0 3
−1 2 1 0 3
0 2 4 6 1 2 3 0 4
UMAP1
B D
Genotype NEG PLP VUS
Number of subjects
150
4
Genotype
3
100 NEG
UMAP2
2 P/LP
VUS
1 50
0
0
−1
0 1 2
0 2 4 6 Cluster
UMAP1
Supplementary Figure II. Selection of optimal resolution for clustering of end diastolic wall thickness. The optimal resolution for the Louvain partitioning is found
by inspecting the clustree plot (c.). In this case, the value of 0.5 was chosen, corresponding to the resolution with stable branching before any assignment mixing
(diagonal arrows), and the largest bootstrapping stability (Supplementary Table I). The UMAP projections in a. and b. show the parallelism between the clusters and the
genotypes. d. Genotype proportions by cluster.
0 1
8
0 1
0 1
UMAP2
0 1
6
0 1
0 1
4 0 2 1 3
0 2 1 3
0 2 4 1 3
B D
Genotype NEG PLP VUS
200
Number of subjects
8 150
Genotype
NEG
UMAP2
100 P/LP
6
VUS
50
4
0
0 1
0.0 2.5 5.0 7.5 10.0 Cluster
UMAP1
Supplementary Figure III. Selection of optimal resolution for clustering of end systolic wall thickness. The optimal resolution for the Louvain partitioning is found
by inspecting the clustree plot (c.). In this case, the value of 0.1 was chosen, corresponding to the resolution with stable branching before any assignment mixing
(diagonal arrows), and the largest bootstrapping stability (Supplementary Table II). The UMAP projections in a. and b. show the parallelism between the clusters and the
genotypes. d. Genotype proportions by cluster.
ES Branch 2
ES Branch 3
ES Branch 4
ES Branch 5
0.4
0.36
ED Branch 1 0.37 0.09 0.05 0.14 0.09
0.32
0.28
Jaccard Index
0.2
0.16
ED Branch 3 0.02 0.08 0.22 0.18 0.03
0.12
0.08
Supplementary Figure IV. Co-occurrence between end diastolic and end systolic tree branches. The Jaccard index of the subjects membership for the branches in
the DDRTree from ED and ES WT shows that branches 1 to 4 have the largest co-occurrence and they can be considered capturing a similar phenotypic subpopulation.
Branch 5 in end systolic DDRTree is not found in the end diastolic DDRTree and consists of an average sub-type of the cohort.
Frame ES
0.950
Survival probability (69 yrs)
0.925
0.900
0.875
Supplementary Figure V. Survival probability in end systolic branch 4. The more distal regions of branch 4 correspond to lower probability of survival at a
chronological age of 69 years. The OR between the distal and base points of the branch is 0.9773.
0
Z2
Z2
−4
−5
−8
Supplementary Figure VI. Predicted tree coordinates for the Singaporean cohort. The coordinates predicted by the two random forest models for the Singaporean
cohort follow the original spatial distribution of the development cohort, with few points falling outside the main structure of the tree, in both end diastole (a.) and end
systole (b.). Sing, Singaporean patients; RBH, Royal Brompton Hospital patients.
A ED B ES
0.5 0.5
Spearman's rho
Spearman's rho
dataset dataset
RBH RBH
Sing. Sing.
0.0 0.0
−0.5
−0.5
RBH Sing. RBH Sing.
Set Set
Supplementary Figure VII. Similarity between nearest tree points. Spearman’s correlation between the adjusted wall thickness of the nearest RBH points in the tree
follow the same distribution of the nearest Singaporean and RBH points, in both end diastole (a.) and end systole (b.). Sing, Singaporean patients; RBH, Royal Brompton
Hospital patients.
3 6 9 12 15
5
4
5 2
4 5
Component 2
2 63
Component 2
5 0
63
0
1
1 −5 7
−5 7
colour 10 16 18 21 State_merged 10 16 18 21
5
4 6
3
2
2
5
Component 2
63
0
1
1
−5 7 7
−20 −10 0 10
Component 1
Supplementary Figure VIII. Branch merging for ED phase. Intermediate results of the branch merging process for the ED phase data. The graph at the top-left shows
the branch labels determined by ‘monocle‘ that were progressively merged: a) short leaf branches, b) branches that did not have bifurcation into two different states. The
final branches qualitatively correlate with the hierarchical structure of the tree, where the root state correspond to the center of the tree.
1 4 7 10 1 3 7 10
colour
colour 2 5 8 11 2 5 8 11
3 6 9
4
4 5
5 4
2 1
Component 2
0
4
Component 2
0 2 1 3
3
−4
−4
−8 −8
−20 −10 0 10 20 −20 −10 0 10 20
Component 1 Component 1
State_merged 1 2 11 12 14
colour 1 2 11 12 14
4
1
5
4
2 1
Component 2
0 2
4
3 3
−4 5
−8
−20 −10 0 10 20
Component 1
Supplementary Figure IX. Branch merging for ES phase. Intermediate results of the branch merging process for the ES phase data. The graph at the top-left shows the
branch labels determined by ‘monocle‘ that were progressively merged: a) short leaf branches, b) branches that did not have bifurcation into two different states. The
final branches qualitatively correlate with the hierarchical structure of the tree, where the root state correspond to the center of the tree.
−8
−20 −10 0 10 20
Component 1 20
B
Pseudotime
0 10 20 30 10
5.0
2.5 1 0
5 3 8
2
Component 2
0.0 0 10 20 30 40
4 6
Pseudotime original tree
−2.5 7
−5.0
−7.5
−20 −10 0 10 20
Component 1
Supplementary Figure X. Relative positions of subjects in the original and unrelated subjects tree. The pseudotime calculated from the whole cohort tree (A) and
that without unrelated subjects (B) shows that the subjects relative positions are highly correlated (Spearman’s rho = 0.89) (C). This suggests that the tree generated after
removing the related subjects was consistent with the one from the whole cohort.
Supplementary Table VIII. Characteristics of Singaporean HCM cohort. BSA, body surface area; SBP, systolic blood pressure.
Supplementary Table IX. Cumulative hazard model. All cause mortality in individuals with hypertrophic cardiomyopathy carrying
pathogenic or likely pathogenic sarcomeric variants (SARC-P/LP) compared to those without variants in genes that may cause or mimic
HCM (SARC-NEG) and those with variants of uncertain significance (SARC-VUS), adjusted for Age, Sex and Race. n = 436; P = 0.002. 1 HR =
Hazard Ratio, CI = Confidence Interval
Stroke volume (ml) LVESV (ml) LV mass (g) SBP (mmHg) BSA (m2)
LVEDV (ml)
Supplementary Figure XI. Unsupervised clustering of clinical features and feature importance. a. Significant pairs of associations between identified clusters and
numerical features from the initial set of data. The line represents the interquartile range and median value. b. Significant one-vs-rest associations between clusters and
categorical features from the initial set of data grouped by significance level. The height of the curved bars illustrates the significance level (−𝑙𝑜𝑔10 𝑃 ). Only the significant
pairs are reported with the symbols: ∗ 𝑃 ≤ 0.05; ∗∗ 𝑃 ≤ 0.01; ∗∗∗ 𝑃 ≤ 0.001; ∗∗∗∗ 𝑃 ≤ 0.0001, 𝑛 = 436.