0% found this document useful (0 votes)
122 views31 pages

LLM Usecase

SHIELD is a predictive analytics system designed to assess risks in the electric vehicle (EV) battery supply chain by integrating Large Language Models (LLMs) with domain expertise. It employs schema learning for knowledge representation and disruption analysis for event extraction and prediction, outperforming traditional methods in accuracy and interpretability. The framework enhances decision-making through an interactive interface that incorporates expert feedback, ultimately improving supply chain risk management strategies.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
122 views31 pages

LLM Usecase

SHIELD is a predictive analytics system designed to assess risks in the electric vehicle (EV) battery supply chain by integrating Large Language Models (LLMs) with domain expertise. It employs schema learning for knowledge representation and disruption analysis for event extraction and prediction, outperforming traditional methods in accuracy and interpretability. The framework enhances decision-making through an interactive interface that incorporates expert feedback, ultimately improving supply chain risk management strategies.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 31

SHIELD: LLM-Driven Schema Induction for Predictive Analytics in EV

Battery Supply Chain Disruptions


Zhi-Qi Cheng1 Yifei Dong2 Aike Shi3 Wei Liu4 Yuzhi Hu5
Jason O’Connor1 Alexander G. Hauptmann1 Kate S. Whitefoot1
1
Carnegie Mellon University 2 Columbia University 3 Georgia Institute of Technology
4
University of Michigan, Ann Arbor 5 Boston University
Project Page: https://f1y1113.github.io/MFI/

Abstract
The electric vehicle (EV) battery supply
chain’s vulnerability to disruptions necessi-
tates advanced predictive analytics. We present
arXiv:2408.05357v2 [cs.AI] 21 Oct 2024

SHIELD (Schema-based Hierarchical Induc-


tion for EV supply chain Disruption), a system
integrating Large Language Models (LLMs)
with domain expertise for EV battery sup-
ply chain risk assessment. SHIELD com-
bines: (1) LLM-driven schema learning to con-
struct a comprehensive knowledge library, (2) a
disruption analysis system utilizing fine-tuned
language models for event extraction, multi- Figure 1: SHIELD’s process for EV battery supply
dimensional similarity matching for schema chain disruption prediction. The framework integrates
matching, and Graph Convolutional Networks LLM-driven schema learning with expert curation, en-
(GCNs) with logical constraints for predic- abling robust event extraction and prediction from di-
tion, and (3) an interactive interface for visu- verse news sources. This approach uniquely combines
alizing results and incorporating expert feed- LLM capabilities with domain expertise, enhancing both
back to enhance decision-making. Evaluated predictive accuracy and interpretability for proactive
on 12,070 paragraphs from 365 sources (2022- supply chain risk management.
2023), SHIELD outperforms baseline GCNs
and LLM+prompt methods (e.g. GPT-4o) in Traditional supply chain risk management ap-
disruption prediction. These results demon- proaches, which rely on rule-based reasoning and
strate SHIELD’s effectiveness in combining agent-based simulations (Gallab et al., 2019; Pino
LLM capabilities with domain expertise for en- et al., 2010; Giannakis and Louis, 2011, 2016; Blos
hanced supply chain risk assessment.
et al., 2015), often fall short in predictive accu-
1 Introduction racy and adaptability to dynamic market conditions.
The expected widespread adoption of electric ve- While machine learning (ML) and deep learning
hicles (EVs) is threatened by risks associated with (DL) techniques have enhanced predictive perfor-
the geographic and economic concentration of crit- mance (Hegde and Rokseth, 2020; Ruz et al., 2020;
ical battery minerals, such as lithium, cobalt, and Aljohani, 2023; Silva et al., 2017; Garvey et al.,
nickel. To enhance the resilience of the EV battery 2015; Carbonneau et al., 2008), they frequently
supply chain, manufacturers must anticipate disrup- sacrifice interpretability, limiting their practical ap-
tions caused by natural disasters and geopolitical plication. Recent studies employing large language
tensions. Proactive strategies and supply diversifi- models (LLMs) in supply chain management (Ray,
cation are essential to mitigate these risks1 . 2023; Wang et al., 2022a; Du et al., 2022; Shi
et al., 2024; Dror et al., 2022; Li et al., 2023) have
Completed by Y. Dong and Y. Hu during remote vis- focused on improving predictions but struggle to
its, and A. Shi and W. Liu during CMU internships. Z.
Cheng, Y. Dong, A. Shi, W. Liu, and Y. Hu contributed fully grasp complex domain-specific supply chain
equally. J. O’Connor, A. Hauptmann, and K. Whitefoot pro- knowledge. This limitation often leads to hallucina-
vided guidance. See Appx. L for details. Correspondence: tions and inaccuracies which, coupled with limited
zhiqic,alex@cs.cmu.edu, kwhitefoot@andrew.cmu.edu.
1
https://nncta.org/_files/documents/ interpretability, hinder the generation of actionable
chapter4-energy-critical-materials.pdf insights crucial for effective risk management.
Figure 2: Overview of the supply chain schema construction process, illustrating the collection of diverse sources,
schema extraction using large language models, and the integration into a unified schema library.

To address these challenges, we introduce SHIELD butions include: (1) an LLM-expert integration
(Schema-based Hierarchical Induction for EV sup- methodology for accurate, interpretable predic-
ply chain Disruption), a two-stage framework that tions; (2) a schema learning and news evaluation
integrates LLMs and domain expertise for predic- dataset spanning the EV battery lifecycle; (3) an
tive analytics in EV battery supply chains (Fig. 1): interactive schema curation system; and (4) ad-
vanced analytical techniques for supply chain anal-
1. Schema Learning (Sec. 3): We leverage LLMs
ysis. SHIELD offers a promising approach in sup-
(GPT-4o, Llama3-3b, Llama3-70b) to con-
ply chain risk management, addressing evolving
struct a comprehensive schema library—a
challenges across the EV industry and beyond.
structured representation of supply chain com-
ponents and their relationships—from diverse
2 Related Work
sources. An interactive system integrates ex-
pert knowledge, distilling supply chain ex- Supply Chain Risk Management. AI has been
pertise from specialized documents. This ap- increasingly applied to predict and manage supply
proach ensures analyses align with domain chain risks (Ganesh and Kalpana, 2022). Agent-
knowledge, capturing EV battery supply chain based approaches (Pino et al., 2010; Giannakis
complexities for accurate, interpretable predic- and Louis, 2011, 2016; Blos et al., 2015) facili-
tions that adapt to industry dynamics through tate inter-agent communication for forecasting but
continuous refinement. often suffer from limited predictive power and pa-
2. Disruption Analysis (Sec. 4): Building on our rameter constraints. Rule-based methods (Gal-
schema learning, we develop a comprehen- lab et al., 2019; Behret et al., 2012; Paul, 2015;
sive disruption prediction system. This sys- Paul et al., 2017; Awasthi et al., 2018; Camarillo
tem integrates fine-tuned RoBERTa (Liu et al., et al., 2018) offer decision frameworks with mini-
2019) for event detection, multi-dimensional mal quantitative insights. Machine Learning (ML)
similarity for matching, and Graph Convo- and Deep Learning (DL) techniques have improved
lutional Networks (GCNs) with logical con- forecasting and disruption prediction (Silva et al.,
straints for impact analysis. The resulting end- 2017; Hegde and Rokseth, 2020; Garvey et al.,
to-end system enables precise event extraction 2015; Ruz et al., 2020; Aljohani, 2023; Carbonneau
and reliable predictions in complex supply et al., 2008), yet many focus on predictive perfor-
chains, mitigating LLM hallucination risks mance at the expense of interpretability (Hendrik-
while maintaining efficiency. This approach sen, 2023; Makridis et al., 2023). Recent work has
offers a scalable solution for real-time supply explored large language models (LLMs) in supply
chain risk assessment and mitigation. chain management (Ray, 2023), but interpretabil-
ity remains a challenge. Our approach integrates
Evaluated on 12,070 paragraphs from 365 sources LLMs to enhance both predictive accuracy and in-
(2022-2023) (Sec. 5), SHIELD outperforms base- terpretability by extracting hierarchical knowledge-
line GCNs and LLM+prompt methods (e.g., GPT- graph structures to forecast disruptions.
4o) in disruption prediction. By integrating LLM
capabilities with domain expertise, this framework Schema Induction & Learning. Building on early
enhances supply chain risk assessment. Key contri- schema induction work (Anderson et al., 1979;
Evans, 1967), LLMs (Brown et al., 2020; Rae et al., the EV battery lifecycle, enabling our methods to
2021) have shown strong schema-learning abili- acquire expert knowledge for accurate, real-world
ties with minimal supervision. Recent strategies, predictions. More details are in Appx. B.1.
such as contextual explanations (Wei et al., 2021;
Schema Generation & Merging. Building upon
Lampinen et al., 2022), rationale-augmented mod-
our collected dataset, our Schema Learning Sys-
els (Wang et al., 2022b), and incremental prompt-
tem facilitates the extraction, visualization, and
ing (Li et al., 2023), have further refined schema
management of schemas from the 125 diverse tex-
induction. Transformer-based methods (Li et al.,
tual sources (Fig. 2). The process begins with data
2020, 2021) excel at schema representation through
cleaning using regular expressions and a locally
graph structures, with human feedback playing a
deployed Llama3-8b model. Subsequently, we em-
vital role in improving model accuracy (Mondal
ploy GPT-4o, Llama3-3b, and Llama3-70b with
et al., 2023; Yang et al., 2024; Zhang et al., 2023).
specific prompts to extract hierarchical structures
Our method leverages these advancements, com-
(H) capturing main events (E) and sub-events
bining human feedback with LLM-driven schema
(Esub ). More details are in Appx. C.
induction to improve accuracy and relevance in
disruption prediction. The extracted structures are then converted into in-
dividual schemas (Si ) and visualized as graphs,
Event Extraction & Analysis. Event extraction
demonstrating the hierarchical nature of the
has evolved from handcrafted features (Ahn, 2006)
schemas and the relationships between main events
to neural models like recurrent (Nguyen et al.,
and sub-events. These schemas are then integrated
2016; Sha et al., 2018), convolutional (Chen et al.,
into a singleSlibrary (Sfinal ), aggregating contexts
2015), graph (Zhang and Ji, 2021), and transformer- n
based networks (Liu et al., 2020). Advances in Snfinal = i=1 Ci ), merging events (Efinal =
(C
argument extraction (Wang et al., 2019), zero-shot i=1 Ei ), S
and updating event IDs for relations
(Rfinal = ni=1 Ri ). The detailed schema genera-
learning (Huang et al., 2018), and weak supervi-
tion and merging algorithm is provided in Appx. D.
sion (Chen et al., 2015) have boosted performance.
Our approach enhances event extraction by using To ensure efficient retrieval and updates, a dedi-
fine-tuned RoBERTa models and graph convolu- cated Database & Storage module manages schema
tional networks (GCNs) to capture complex event storage, while the Schema Management System in-
relationships and cascading effects, offering deeper corporates a Schema Viewer, Editor, collaboration
insights into supply chain disruptions compared to tools, and AI-driven suggestions are built to man-
traditional methods. age and annotate schemas (Appx. E). This human-
in-the-loop curated framework streamlines schema
3 Schema Learning for Supply Chain extraction and management, enabling interactive
Disruptions knowledge extraction from structured documents,
Schema Learning Dataset. Our dataset comprises leveraging supply chain experts’ insights.
239 diverse sources: 200 academic papers, 22 in- 4 Dynamic Analysis of Supply Chain
dustry reports, and 17 Wikipedia entries (Fig. 5 and
Disruptions
Fig. 6). This collection provides an up-to-date view
of the EV battery supply chain, covering advanced Supply Chain News Dataset. We developed an EV
battery technologies (e.g. LFP, NiMH), produc- Supply Chain News Dataset (January 2022 - De-
tion processes, and six key raw materials. We cat- cember 2023) to evaluate our system’s real-world
egorized events into 8 categories, three with long- performance (Appx. B.2). The dataset comprises
term impacts, subdivided into 18 subcategories. 247 articles from major news outlets and 118 en-
Our analysis includes five-year price trends for terprise reports from EV battery-related companies
all materials, correlated with 39 significant sup- (Fig. 7 and Fig. 8). After preprocessing—including
ply chain events. Industry expert feedback refined text extraction, language standardization, and noise
our categorization into 11 main categories with reduction—we obtained Meta data with 3,022 para-
27 subcategories, each illustrated with 1-2 real- graphs. We then fused international news with con-
world events (Tab. 6). The academic dataset was temporaneous corporate stories in the meta data,
distilled from 239 sources to 125 highly relevant creating 354 diverse documents comprising 12,070
entries. This dataset of over 1,000 events spans paragraphs. The final dataset contains approxi-
Figure 3: Overview of the supply chain disruption prediction pipeline, illustrating the integration of GCN-based
predictions, constrained prediction refinement, and argument coreference resolution.

mately 660K words (Table 9), providing a robust and score each event’s impact as:
foundation for evaluating supply chain disruption ImpactScore(ei ) = Centrality(ei )+Magnitude(ei )
detection and analysis. Comprehensive replication (5)
details, including the full dataset and preprocessing This scoring mechanism balances two crucial fac-
pipeline, are provided to ensure reproducibility. tors. Centrality(ei ) represents the event’s impor-
Event Extraction. Our pipeline extracts multi- tance within the network, reflecting its centrality or
faceted events from textual data, focusing on their influence in the supply chain context. Meanwhile,
impact on the EV battery supply chain. We begin Magnitude(ei ) quantifies the event’s impact inten-
with custom-trained SpaCy models2 for tokeniza- sity, indicating its severity or significance.
tion, sentence segmentation, named entity recogni- Finally, we apply logical constraints and argument
tion, and dependency parsing (Appx. F). coreference to ensure robustness:
Building on this, we deploy a fine-tuned RoBERTa LogicCoref(PC ) → PF (6)
model for cross-sentence event detection: producing a refined, logically consistent set of
event parameters PF . More implementation details
EventDetectmulti-sentence (T) → EC (1) are in Appx. F.
where T represents the input text and EC the de-
Event Matching & Instantiation. We link ex-
tected events. These events are then enriched with
tracted events with schema library to detect supply
contextual information using BERT:
chain disruption patterns using a multi-dimensional
approach of semantic and structural similari-
BERTcontext (EC ) → CE (2)
ties. We align each extracted event Eext ∈ Eext (ex-
generating contextual embeddings CE . To enhance tracted events) with each schema event Eschema ∈
analytical coherence, we implement coreference Eschema (schema events) using a composite similar-
resolution and event linking: ity:
Sim(Eext , Eschema ) = α · SemSim(Eext , Eschema )
CorefLink(EC ) → EL (3)
+ β · StrSim(Eext , Eschema )
This critical step, yielding linked events EL , main- (7)
tains contextual continuity across documents. Sub- where SemSim captures contextual meaning using
sequently, Conditional Random Fields (CRFs) ex- BERT embeddings, and StrSim assesses structural
tract event parameters PC : similarity. Specifically, semantic similarity mea-
sures contextual alignment using cosine similarity
CRF(EL ) → PC (4)
between BERT embeddings:
vext · vschema
Leveraging Graph Convolutional Networks SemSim(Eext , Eschema ) = (8)
(GCNs), we model complex event relationships ∥vext ∥∥vschema ∥
where vext and vschema are BERT embeddings of
2
https://spacy.io/models extracted and schema events. Similarly, structural
similarity evaluates parameter overlap using Jac- Disruption Prediction. Building on the extracted
card similarity: and matched events, we employ Graph Convolu-
tional Networks (GCNs), logical constraints, and
|Pext ∩ Pschema |
StrSim(Eext , Eschema ) = (9) argument coreference resolution to predict supply
|Pext ∪ Pschema | chain disruptions. Note that the events are repre-
where Pext and Pschema are the parameter sets for sented as nodes and interactions as edges using
the extracted and schema events. GCNs with the propagation rule:
Following the calculation of semantic and struc-
tural similarities, we refine matching using heuris- H(l+1) = σ(AH(l) W(l) ) (12)
tic rules from annotated datasets. Successful
where H(l) is the hidden state at layer l, A is the ad-
matches lead to event instantiation, enriching the
jacency matrix, W(l) is the weight matrix, and σ is
event representation with schema attributes:
a non-linear activation function. We optimize using
Instantiate(Ematched , Sschema ) → Einst (10) mean squared error loss with L2 regularization:
where Ematched represents the matched event,
N
Sschema yields the schema library, and Einst refers 1 X
L= (yi − ŷi )2 + λ∥W∥2 (13)
to the instantiated event with enriched attributes. N
i=1
To ensure logical adherence to schema constraints, where yi and ŷi are actual and predicted disruption
we perform consistency checks. These checks vali- scores, and λ is a regularization parameter. This
date the instantiated events against the schema li- approach balances prediction accuracy and model
brary, ensuring they conform to predefined logical complexity, preventing overfitting.
and structural constraints:
To ensure consistency with domain knowledge, we
ConsistencyCheck(Einst , Sschema ) (11) apply logical constraints, refining initial predictions
This step is crucial for maintaining the integrity of (ŷ) to produce final predictions (ŷ ′ ) that adhere to
the schema and the reliability of the predictions. known rules:
Finally, we incorporate a continuous improvement
ŷ ′ = arg min Constrain(ŷ)
process through manual review and feedback. Feed- ŷ ′ ∈Y (14)
back from domain experts is used to update and subject to C(ŷ ′ ) = true
refine the models, ensuring they adapt to new pat-
terns and maintain high performance. The complete where C represents constraint sets. For example, a
process is summarized in Algorithm 3. More im- constraint might ensure that a major supplier’s dis-
plementation details are in Appx. G. ruption increases risk for dependent manufacturers.
Algorithm 1 Supply Chain Disruption Prediction To further enhance the model’s contextual under-
1: Input: Historical supply chain events E, adjacency matrix standing, we incorporate argument coreference:
A, initial predictions ŷ
2: Output: Refined predictions ŷ ′
3: GCN-based Prediction ▷ Initial prediction using GCN Rij = arg, max; Coref(Ei , Ej )
4: for l = 1 to L do Ei ,Ej ∈E (15)
5: H(l+1) = σ(AH(l) W(l) ) ▷ Refer to Eq. 12
6: end for subject to Coref(Ei , Ej ) = true
7: ŷ ← H(L)
8: Constrained Prediction ▷ Apply logical constraints where (Ei , Ej ) denotes each event pair and Rij rep-
9: for each prediction ŷi do
10: ŷi′ ← Constrain(ŷi ) resents their relation. This AllenNLP-based model
11: such that C(ŷi′ ) = true ▷ Refer to Eq. 14 links entities across event mentions, recognizing
12: end for
13: Coreference Resolution ▷ Link related events
when different descriptions refer to the same inci-
14: for each pair of events (Ei , Ej ) do dent, thereby improving prediction accuracy and
15: Rij ← Coref(Ei , Ej ) ▷ Refer to Eq. 15 context comprehension. Algorithm 1 outlines our
16: if Rij is coreferential then
17: Link Ei and Ej
approach, combining GCN-based predictions, log-
18: end if ical constraints, and argument coreference resolu-
19: end for tion. Detailed examples and implementation guide-
20: Return: Refined predictions ŷ
lines are provided in Appx. H.
Table 1: Performance comparison of different LLMs on schema learning in stage 1.
ChatGPT4o Llama3-3b Llama3-70b
Individual Schemas Integrated Library Individual Schemas Integrated Library Individual Schemas Integrated Library
Precision 0.637 0.184 0.198 0.018 0.353 0.019
Recall 0.695 0.336 0.047 0.014 0.133 0.022
F-score 0.652 0.238 0.068 0.016 0.175 0.020

Table 2: Subjective evaluation by domain experts. erage F-score 0.687 vs 0.683 in 2022) indicates
Model Consistency Accuracy Completeness potential refinement in our model’s ability to adapt
GPT-4o 4.5 4.3 4.6 to evolving supply chain dynamics.
Llama3-3b 1.8 1.5 1.9
Llama3-70b 3.0 2.7 3.1 Disruption Detection. Our advanced GCNs model,
augmented with logical constraints and corefer-
ence resolution, was rigorously evaluated against
5 Experiments ablation versions and LLM+prompt methods. Ta-
Our evaluation comprises two parts: (1) Schema bles 4 and 5 present the comparative performance
Learning Assessment and (2) Supply Chain Disrup- metrics. The full system achieved the highest F-
tion Prediction. We assess learned schemas against score (0.732), significantly outperforming both ab-
expert knowledge and evaluate our schema induc- lation versions (GCNs+Logical Constraints: 0.707,
tion process’s effectiveness in predicting supply GCNs only: 0.685) and LLM+prompt methods
chain events. Detailed experimental setup and eval- (GPT-4o: 0.624). However, the incremental im-
uation metrics are in Appx. I. provement from the GCNs-only model to our full
system (0.685 to 0.732) suggests that while the
5.1 Schema Learning Performance additional components significantly enhance per-
formance, there remains substantial potential for
We evaluate GPT-4o, Llama3-3b, and Llama3-70b
further optimization in the future.
for schema learning, comparing individual schema
extraction and integrated library generation. Ta-
5.3 Qualitative Analysis & User Interface
bles 1 and 2 present quantitative metrics and sub-
jective evaluations by domain experts. GPT-4o out- Our qualitative analysis of SHIELD’s disruption
performs Llama models, achieving F-scores of predictions, focusing on real-world case studies
0.652 and 0.238 for individual schemas and in- (detailed in Appx. J), complements the quantitative
tegrated library generation, respectively. All mod- findings and further illuminates the system’s practi-
els perform better in individual schema extraction cal utility. A particularly salient example emerged
than integrated library generation, indicating chal- in SHIELD’s accurate prediction of a semiconduc-
lenges in schema integration. Subjective assess- tor shortage resulting from geopolitical tensions,
ments align with quantitative metrics, with GPT- made three weeks prior to widespread reporting.
4o scoring highest across all criteria (consistency: This early insight enabled proactive adjustments
4.5, accuracy: 4.3, completeness: 4.6). Individual to procurement strategies, thereby demonstrating
schemas show strong consistency and complete- the system’s considerable potential in mitigating
ness but slightly lower accuracy, suggesting a trade- complex supply chain risks. We have developed
off between comprehensive coverage and precise an interactive user interface (Fig. 4) for online dis-
detail representation. ruption analysis. This interface allows users to up-
load news report texts (Fig. 4a), evaluate prediction
5.2 Disruption Detection Performance scores, and edit visualization results for the final
Event Extraction & Matching. Table 3 presents disruption analysis (Fig. 4b). More details can be
quarterly results for 2022 and 2023 on event ex- found in Appx. K.
traction and matching using a supply chain news
dataset. Our system maintains consistent perfor- 5.4 Disruption Prediction Case Studies
mance across quarters, with F-scores ranging from Our system effectively forecasted key supply chain
0.671 to 0.700. This stability suggests robust gener- disruptions, providing insights that enabled stake-
alization across different time periods and varying holders to take proactive actions. For example, fol-
event types. The slight improvement in 2023 (av- lowing the passage of the Inflation Reduction Act
(a) User interface for inputting news reports. (b) Visualization and editing of final prediction results.
Figure 4: User interface for online disruption analysis in stage 2, showing the process from news report input to the
visualization and editing of prediction results. More examples are in Appx. K.
Table 3: Event extraction and matching in supply chain news dataset.
Year 2022 2023
Quarter Q1 (Jan-Mar) Q2 (Apr-Jun) Q3 (Jul-Sep) Q4 (Oct-Dec) Q1 (Jan-Mar) Q2 (Apr-Jun) Q3 (Jul-Sep) Q4 (Oct-Dec)
Precision 0.714 0.692 0.705 0.689 0.712 0.698 0.703 0.690
Recall 0.675 0.662 0.678 0.655 0.688 0.670 0.681 0.657
F-score 0.694 0.677 0.691 0.671 0.700 0.684 0.692 0.673

Table 4: Performance comparison of different models on disruption prediction.


Model Precision Recall F-score
Our System (GCNs only) 0.701 0.670 0.685
Our System (GCNs + Logical Constraints) 0.724 0.691 0.707
Our System (GCNs + Logical Constraints + Coreference) 0.754 0.712 0.732

Table 5: Performance comparison of direct human inter- decision-making in a volatile global market.
action with LLMs on disruption prediction.
Model Precision Recall F-score 6 Conclusion
GPT-4o 0.641 0.608 0.624
We present SHIELD, a two-stage framework that
Llama3-3b 0.522 0.489 0.505 integrates Large Language Models (LLMs) with
Llama3-70b 0.557 0.523 0.540 domain expertise, yielding promising results in EV
Our Method 0.754 0.712 0.732
battery supply chain analytics and risk assessment.
While demonstrating particular strength in early
(2022), which incentivized domestic EV battery disruption detection and event prediction for criti-
production, our system predicted potential material cal battery materials, significant challenges remain
shortages. By analyzing shifts in global material in schema integration, real-time adaptability, and
flows and the effects of policy changes on supply- error reduction. Future research will systematically
demand dynamics, it enabled early interventions address these limitations, enhance the system’s ro-
to minimize risks. Similarly, during geopolitical bustness, and explore broader applications across
tensions between Australia and China in 2023, our diverse industries and supply chain ecosystems.
system identified vulnerabilities in the lithium sup-
Acknowledgments
ply chain by monitoring export data and geopoliti-
cal developments, helping stakeholders adapt their This research was supported by the Carnegie Mel-
sourcing strategies in time. In another instance, lon Manufacturing Futures Institute and the Man-
the system anticipated cobalt supply issues result- ufacturing PA Innovation Program. The authors
ing from labor strikes and regulatory changes in also thank the School of Computer Science (SCS)
the Democratic Republic of Congo (2023), allow- at Carnegie Mellon University, particularly the
ing companies to diversify sources and increase High Performance Computing (HPC), for provid-
inventory buffers. These cases, detailed further in ing essential computational resources. The views
Appendix J, illustrate how data-driven predictions expressed are those of the authors and do not nec-
enhance supply chain resilience and support timely essarily reflect those of the funding agencies.
References Rotem Dror, Haoyu Wang, and Dan Roth. 2022. Zero-
shot on-the-fly event schema induction. arXiv preprint
David Ahn. 2006. The stages of event extraction. In arXiv:2210.06254.
Proceedings of the Workshop on Annotating and Rea-
soning about Time and Events, pages 1–8. Xinya Du, Zixuan Zhang, Sha Li, Pengfei Yu, Hongwei
Wang, Tuan Lai, Xudong Lin, Ziqi Wang, Iris Liu, Ben
Abeer Aljohani. 2023. Predictive analytics and machine Zhou, et al. 2022. Resin-11: Schema-guided event pre-
learning for real-time supply chain risk mitigation and diction for 11 newsworthy scenarios. In Proceedings of
agility. Sustainability, 15(20):15088. the 2022 Conference of the North American Chapter of
John R Anderson, Paul J Kline, and Charles M the Association for Computational Linguistics: Human
Beasley Jr. 1979. A general learning theory and its Language Technologies: System Demonstrations, pages
application to schema abstraction1. In Psychology of 54–63.
learning and motivation, volume 13, pages 277–318.
Selby H Evans. 1967. A brief statement of schema
Elsevier.
theory. Psychonomic Science, 8(2):87–88.
Anjali Awasthi, Kannan Govindan, and Stefan Gold.
2018. Multi-tier sustainable global supplier selection Maryam Gallab, Hafida Bouloiz, Youssef Lamrani
using a fuzzy ahp-vikor based approach. International Alaoui, and Mohamed Tkiouat. 2019. Risk assessment
Journal of Production Economics, 195:106–117. of maintenance activities using fuzzy logic. Procedia
computer science, 148:226–235.
Hülya Behret, Başar Öztayşi, and Cengiz Kahraman.
2012. A fuzzy inference system for supply chain risk A Deiva Ganesh and P Kalpana. 2022. Future of ar-
management. In Practical Applications of Intelligent tificial intelligence and its influence on supply chain
Systems: Proceedings of the Sixth International Confer- risk management–a systematic review. Computers &
ence on Intelligent Systems and Knowledge Engineer- Industrial Engineering, 169:108206.
ing, Shanghai, China, Dec 2011 (ISKE2011), pages Myles D Garvey, Steven Carnovale, and Sengun
429–438. Springer. Yeniyurt. 2015. An analytical framework for supply net-
Maurício F Blos, Robson M Da Silva, and Paulo E work risk propagation: A bayesian network approach.
Miyagi. 2015. Application of an agent-based sup- European Journal of Operational Research, 243(2):618–
ply chain to mitigate supply chain disruptions. IFAC- 627.
PapersOnLine, 48(3):640–645.
Mihalis Giannakis and Michalis Louis. 2011. A multi-
Tom Brown, Benjamin Mann, Nick Ryder, Melanie agent based framework for supply chain risk manage-
Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind ment. Journal of Purchasing and Supply Management,
Neelakantan, Pranav Shyam, Girish Sastry, Amanda 17(1):23–31.
Askell, et al. 2020. Language models are few-shot
learners. Advances in neural information processing Mihalis Giannakis and Michalis Louis. 2016. A multi-
systems, 33:1877–1901. agent based system with big data processing for en-
hanced supply chain agility. Journal of Enterprise In-
Shu Cai and Kevin Knight. 2013. Smatch: an evaluation formation Management, 29(5):706–727.
metric for semantic feature structures. In Proceedings
of the 51st Annual Meeting of the Association for Com- Jeevith Hegde and Børge Rokseth. 2020. Applica-
putational Linguistics (Volume 2: Short Papers), pages tions of machine learning methods for engineering risk
748–752. assessment–a review. Safety science, 122:104492.

Alvaro Camarillo, José Ríos, and Klaus-Dieter Althoff. Christian Hendriksen. 2023. Artificial intelligence for
2018. Knowledge-based multi-agent system for manu- supply chain management: Disruptive innovation or
facturing problem solving process in production plants. innovative disruption? Journal of Supply Chain Man-
Journal of manufacturing systems, 47:115–127. agement, 59(3):65–76.

Real Carbonneau, Kevin Laframboise, and Rustam Vahi- Lifu Huang, Heng Ji, Kyunghyun Cho, Ido Dagan, Se-
dov. 2008. Application of machine learning techniques bastian Riedel, and Clare R Voss. 2018. Zero-shot trans-
for supply chain demand forecasting. European journal fer learning for event extraction. In 56th Annual Meet-
of operational research, 184(3):1140–1154. ing of the Association for Computational Linguistics,
ACL 2018, pages 2160–2170. Association for Computa-
Angel X Chang and Christopher D Manning. 2012. Su- tional Linguistics (ACL).
time: A library for recognizing and normalizing time
expressions. In Lrec, volume 12, pages 3735–3740. Andrew K Lampinen, Ishita Dasgupta, Stephanie CY
Chan, Kory Matthewson, Michael Henry Tessler, Anto-
Yubo Chen, Liheng Xu, Kang Liu, Daojian Zeng, and nia Creswell, James L McClelland, Jane X Wang, and
Jun Zhao. 2015. Event extraction via dynamic multi- Felix Hill. 2022. Can language models learn from expla-
pooling convolutional neural networks. In Proceedings nations in context? arXiv preprint arXiv:2204.02329.
of the 53rd Annual Meeting of the Association for Com-
putational Linguistics and the 7th International Joint Manling Li, Sha Li, Zhenhailong Wang, Lifu Huang,
Conference on Natural Language Processing (Volume Kyunghyun Cho, Heng Ji, Jiawei Han, and Clare Voss.
1: Long Papers), pages 167–176. 2021. The future is not one-dimensional: Complex
event schema induction by graph modeling for event the Seventeenth Conference on Computational Natural
prediction. arXiv preprint arXiv:2104.06344. Language Learning, pages 143–152.
Manling Li, Qi Zeng, Ying Lin, Kyunghyun Cho, Heng Jack W Rae, Sebastian Borgeaud, Trevor Cai, Katie Mil-
Ji, Jonathan May, Nathanael Chambers, and Clare Voss. lican, Jordan Hoffmann, Francis Song, John Aslanides,
2020. Connecting the dots: Event graph schema induc- Sarah Henderson, Roman Ring, Susannah Young, et al.
tion with path language modeling. In Proceedings of 2021. Scaling language models: Methods, analy-
the 2020 Conference on Empirical Methods in Natural sis & insights from training gopher. arXiv preprint
Language Processing (EMNLP), pages 684–695. arXiv:2112.11446.
Sha Li, Ruining Zhao, Manling Li, Heng Ji, Chris Partha Pratim Ray. 2023. Leveraging deep learning
Callison-Burch, and Jiawei Han. 2023. Open- and language models in revolutionizing water resource
domain hierarchical event schema induction by incre- management, research, and policy making: A case for
mental prompting and verification. arXiv preprint chatgpt. ACS ES&T Water, 3(8):1984–1986.
arXiv:2307.01972.
Gonzalo A Ruz, Pablo A Henríquez, and Aldo Mas-
Jian Liu, Yubo Chen, Kang Liu, Wei Bi, and Xiaojiang careño. 2020. Sentiment analysis of twitter data during
Liu. 2020. Event extraction as machine reading com- critical events through bayesian networks classifiers.
prehension. In Proceedings of the 2020 conference Future Generation Computer Systems, 106:92–104.
on empirical methods in natural language processing
(EMNLP), pages 1641–1651. Alexander Schrijver. 1998. Theory of linear and integer
programming. John Wiley & Sons.
Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Man-
dar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Lei Sha, Feng Qian, Baobao Chang, and Zhifang Sui.
Zettlemoyer, and Veselin Stoyanov. 2019. Roberta: A 2018. Jointly extracting event triggers and arguments
robustly optimized bert pretraining approach. arXiv by dependency-bridge rnn and tensor-based argument
preprint arXiv:1907.11692. interaction. In Proceedings of the AAAI conference on
artificial intelligence, volume 32.
Georgios Makridis, Philip Mavrepis, and Dimosthenis
Kyriazis. 2023. A deep learning approach using natural Xiaoming Shi, Siqiao Xue, Kangrui Wang, Fan Zhou,
language processing and time-series forecasting towards James Zhang, Jun Zhou, Chenhao Tan, and Hongyuan
enhanced food safety. Machine Learning, 112(4):1287– Mei. 2024. Language models can improve event pre-
1313. diction by few-shot abductive reasoning. Advances in
Neural Information Processing Systems, 36.
Ishani Mondal, Michelle Yuan, Aparna Garimella, Fran-
cis Ferraro, Andrew Blair-Stanek, Benjamin Van Durme, Nathalie Silva, Luís Miguel DF Ferreira, Cristóvão
Jordan Boyd-Graber, et al. 2023. Interactiveie: To- Silva, Vanessa Magalhães, and Pedro Neto. 2017. Im-
wards assessing the strength of human-ai collaboration proving supply chain visibility with artificial neural net-
in improving the performance of information extraction. works. Procedia Manufacturing, 11:2083–2090.
arXiv preprint arXiv:2305.14659.
Jannik Strötgen and Michael Gertz. 2013. Multilin-
Thien Huu Nguyen, Kyunghyun Cho, and Ralph Grish- gual and cross-domain temporal tagging. Language
man. 2016. Joint event extraction via recurrent neural Resources and Evaluation, 47:269–298.
networks. In Proceedings of the 2016 conference of the
North American chapter of the association for computa- Naushad UzZaman, Hector Llorens, James Allen,
tional linguistics: human language technologies, pages Leon Derczynski, Marc Verhagen, and James Puste-
300–309. jovsky. 2012. Tempeval-3: Evaluating events, time
expressions, and temporal relations. arXiv preprint
Sanjoy Kumar Paul. 2015. Supplier selection for man- arXiv:1206.5333.
aging supply risks in supply chain: a fuzzy approach.
The International Journal of Advanced Manufacturing Hongwei Wang, Zixuan Zhang, Sha Li, Jiawei Han,
Technology, 79:657–664. Yizhou Sun, Hanghang Tong, Joseph P Olive, and Heng
Ji. 2022a. Schema-guided event graph completion.
Sanjoy Kumar Paul, Ruhul Sarker, and Daryl Essam. arXiv preprint arXiv:2206.02921.
2017. A quantitative model for disruption mitigation
in a supply chain. European Journal of Operational Sijia Wang, Mo Yu, Shiyu Chang, Lichao Sun, and Lifu
Research, 257(3):881–895. Huang. 2021. Query and extract: Refining event extrac-
tion as type-oriented binary decoding. arXiv preprint
Raúl Pino, Isabel Fernández, David de la Fuente, José arXiv:2110.07476.
Parreño, and Paolo Priore. 2010. Supply chain mod-
elling using a multi-agent system. Journal of Advances Xiaozhi Wang, Ziqi Wang, Xu Han, Zhiyuan Liu, Juanzi
in Management Research, 7(2):149–162. Li, Peng Li, Maosong Sun, Jie Zhou, and Xiang Ren.
2019. Hmeae: Hierarchical modular event argument
Sameer Pradhan, Alessandro Moschitti, Nianwen Xue, extraction. In Proceedings of the 2019 Conference on
Hwee Tou Ng, Anders Björkelund, Olga Uryupina, empirical methods in natural language processing and
Yuchen Zhang, and Zhi Zhong. 2013. Towards robust the 9th international joint conference on natural lan-
linguistic analysis using ontonotes. In Proceedings of guage processing (EMNLP-IJCNLP), pages 5777–5783.
Xuezhi Wang, Jason Wei, Dale Schuurmans, Quoc
Le, Ed Chi, and Denny Zhou. 2022b. Rationale-
augmented ensembles in language models. arXiv
preprint arXiv:2207.00747.
Jason Wei, Maarten Bosma, Vincent Y Zhao, Kelvin
Guu, Adams Wei Yu, Brian Lester, Nan Du, An-
drew M Dai, and Quoc V Le. 2021. Finetuned lan-
guage models are zero-shot learners. arXiv preprint
arXiv:2109.01652.
Diyi Yang, Sherry Tongshuang Wu, and Marti A Hearst.
2024. Human-ai interaction in the age of llms. In Pro-
ceedings of the 2024 Conference of the North American
Chapter of the Association for Computational Linguis-
tics: Human Language Technologies (Volume 5: Tuto-
rial Abstracts), pages 34–38.
Tianyi Zhang, Isaac Tham, Zhaoyi Hou, Jiaxuan Ren,
Liyang Zhou, Hainiu Xu, Li Zhang, Lara J Martin,
Rotem Dror, Sha Li, et al. 2023. Human-in-the-loop
schema induction. arXiv preprint arXiv:2302.13048.
Zixuan Zhang and Heng Ji. 2021. Abstract meaning
representation guided graph encoding and decoding for
joint information extraction. In Proc. The 2021 Confer-
ence of the North American Chapter of the Association
for Computational Linguistics-Human Language Tech-
nologies (NAACL-HLT2021).
A Extended Related Work approach leverages these advancements by employ-
ing an LLM-driven framework that integrates hu-
Supply Chain Risk Management. AI techniques
man feedback and expert knowledge into a human-
have been increasingly applied to predict and mit-
in-loop system, thereby enhancing the practical
igate supply chain risks (Ganesh and Kalpana,
accuracy and relevance of induced schemas.
2022). While agent-based approaches (Pino et al.,
2010; Giannakis and Louis, 2011, 2016; Blos et al., Event Extraction & Analysis. Event extraction
2015) enable inter-agent communication for fore- has evolved from manually crafted features (Ahn,
casting, they often lack robust predictive capabil- 2006) to neural models, including recurrent net-
ities and have limited parameter sets. Rule-based works (Nguyen et al., 2016; Sha et al., 2018), con-
reasoning methods (Gallab et al., 2019; Behret volutional networks (Chen et al., 2015), graph net-
et al., 2012; Paul, 2015; Paul et al., 2017; Awasthi works (Zhang and Ji, 2021), and transformers (Liu
et al., 2018; Camarillo et al., 2018) offer decision- et al., 2020). Recent research has focused on event
making frameworks but provide minimal quantita- argument extraction (Wang et al., 2019) and ex-
tive insights. To address these limitations, Machine plored zero-shot learning (Huang et al., 2018) and
Learning (DL) and Deep Learning (DL) techniques weak supervision (Chen et al., 2015) to enhance
have been employed (Silva et al., 2017; Hegde and performance. Our approach incorporates various
Rokseth, 2020; Garvey et al., 2015; Ruz et al., event extraction techniques, utilizing fine-tuned
2020; Aljohani, 2023; Carbonneau et al., 2008), RoBERTa models and graph convolutional net-
enhancing demand forecasting and disruption pre- works (GCNs) to capture and analyze complex
diction (Hendriksen, 2023; Makridis et al., 2023). event relationships and their cascading impacts.
Recent studies have begun exploring the potential This approach enables a deeper understanding of
of large language models (LLMs) in supply chain supply chain disruptions, distinguishing our system
management (Ray, 2023). However, most current from traditional extraction techniques.
works prioritize predictive performance over inter-
pretability, hindering practitioners’ ability to make B Dataset
informed decisions. Our approach addresses this
gap by integrating LLMs for schema induction, ex- B.1 Schema Learning Dataset
tracting hierarchical knowledge-graph structures Our research began by examining the current state
from academic resources to predict supply chain of EV batteries, focusing on the predominant types
disruptions, thereby enhancing both predictive per- in use, such as lithium iron phosphate and nickel
formance and interpretability. lithium batteries. We analyzed the battery produc-
Schema Induction & Learning. Building on foun- tion process and identified key raw materials, in-
dational works (Anderson et al., 1979; Evans, cluding lithium, cobalt, nickel, and graphite. Sub-
1967), recent advancements in language model- sequently, we investigated the primary sources and
ing have revolutionized schema induction. Large- production volumes of these materials. Through an
scale language models (LLMs) (Brown et al., 2020; extensive review of literature and statistical data,
Rae et al., 2021) have demonstrated remarkable we categorized significant supply chain events into
capabilities in learning and generating schemas eight groups, three of which have long-term im-
with minimal supervision. Researchers have ex- pacts. Each category was further divided into sub-
plored various strategies to enhance these mod- categories, and real-world events were identified to
els, including contextual explanations (Wei et al., illustrate their impact on raw material supplies.
2021; Lampinen et al., 2022), rationale-augmented We also analyzed price trends for key raw mate-
ensembles (Wang et al., 2022b), and incremental rials over the past five years, using data from the
prompting (Li et al., 2023). Transformer-based ap- London Metal Exchange (LME)3 , to assess how
proaches (Li et al., 2020, 2021) have proven partic- news events influenced these prices. This research
ularly effective in managing schema generation for produced an initial scenario document listing the
complex scenarios, representing schemas as graphs. primary raw materials for EV batteries, their price
Integrating human feedback (Mondal et al., 2023; trends, and an analysis of events causing supply
Yang et al., 2024; Zhang et al., 2023) has been chain issues and price fluctuations. Each category
crucial in refining schema induction processes, ad-
3
dressing the limitations of automated methods. Our https://www.lme.com/en/
included at least one real-world example to demon- 2023. We initially developed a Python crawler us-
strate its impact. ing the requests and BeautifulSoup packages
to scrape news titles and summaries from multiple
The initial document was then submitted for re- websites, such as Google News4 and Infoplease5 .
view by a supply chain expert. Based on the ex- This resulted in a collection of 643 records. To filter
pert’s feedback, we refined the events affecting the out news unrelated to the supply chain, we designed
EV battery supply chain into 11 main categories, a prompt leveraging GPT-4o’s language capabili-
three with long-term impacts, and subdivided them ties. Using the summaries from the list, GPT-4o
into 27 subcategories. Each subcategory was il- helped categorize events into various types, such as
lustrated with 1-2 real-world events, and raw ma- natural disasters, wars, trade policy, and political
terials were further subdivided, such as different issues, tagging the relevant countries and regions.
grades of nickel and types of lithium. Categories
with minimal impact were removed, resulting in a Subsequently, we employed large language models
comprehensive and refined scenario document. (LLMs) to evaluate the relevance of each news
event to the EV battery supply chain based on the
Based on the scenario document, we identified following criteria, each worth 25 points:
the raw materials and events related to the EV
battery supply chain and began collecting an aca- 1. Whether natural disasters or humanitarian
demic document dataset. Our data sources included crises occurred in raw material production
Wikipedia entries, supply chain-related papers, and areas, such as China, Australia, Indonesia,
industrial reports. After obtaining the raw data, Congo, Chile, Canada, or in EV production
we manually removed redundant information and countries, such as China, Japan, South Korea,
noise, retaining only the paragraphs most relevant and the United States.
to the EV battery supply chain. Through meticu-
lous organization, we compiled an academic doc- 2. Whether the event could affect trade relations
ument dataset consisting of 125 entries, distilled in the aforementioned countries, including
from 239 diverse sources: 200 academic papers, trade issues, sanctions, or wars.
22 industry reports, and 17 Wikipedia entries. This 3. Whether the event could potentially disrupt
curated dataset provides a focused knowledge base international shipping routes due to conflicts
essential for analyzing and understanding the com- or natural disasters near these routes.
plexities of the EV battery supply chain.
4. Whether the event is directly related to inter-
The resulting dataset encompasses a wealth of national trade.
knowledge related to the EV battery supply chain,
covering aspects such as raw material procurement, Events scoring below 25 points were initially elim-
manufacturing processes, supply chain logistics, inated, followed by a manual review of the remain-
and market dynamics. Table 6 presents the event ing events, resulting in a refined list of 247 sup-
categories and example events. Events marked with ply chain-related news events. The text data was
* indicate potential long-term impacts, highlighting sourced from reputable media outlets, including
the various types of disruptions and their implica- Reuters6 , BBC7 , and CNN8 . Additionally, to gather
tions for the supply chain. Fig. 5 and 6 illustrate the contemporaneous supply chain status information,
sources of the academic papers, Wikipedia entries, we scraped company news and analysis reports
and industry reports used in compiling the dataset, from EV battery-related companies like Ford, Volk-
demonstrating the breadth and diversity of our data swagen, and CATL, as well as supply chain-related
sources. By synthesizing this information, we aim websites, totaling 118 reports. The raw data, in-
to provide a robust foundation for understanding cluding titles, publication dates, and content, was
the complexities and challenges associated with the organized chronologically.
EV battery supply chain.
The raw data contained invalid information and
B.2 Supply Chain News Dataset 4
https://news.google.com/
5
https://www.infoplease.com/
To comprehensively test our system, we con- 6
https://www.reuters.com/
structed an EV Supply Chain News Dataset cov- 7
https://www.bbc.com/
8
ering the period from January 2022 to December https://edition.cnn.com/
Table 6: Event Categories and Example Events. Events marked with * indicate potential long-term impacts.
Event Category Subcategory Example
Investment from U.S. or Ally U.S. invests in EV battery industry in Canada
Acquisition and Investment∗
Investment from Other Country China invests in cobalt mines in DRC
Demand Change Demand for ore from the Philippines increases
Changes in Supply and Demand
Supply Change Tight supplies of nickel ore in Indonesia
Production Halt or Reduction Katanga halts cobalt mining
Enterprise Issue Enterprise Crisis Katanga faces an equity crisis
Production Plan Adjustment Kellyton Graphite increases production by 15%
Macroeconomy The U.S. and EU face continued inflation
Economic Environment
Competition and Market Structure Competition from China’s low-priced EVs
Product Upgrading CATL releases Kirin battery
EV Battery Technology Progress∗
Production Technology Progress Development of graphene batteries
Forced Labor Forced labor in production
Humanitarian and Ethical Crisis Use of Child Labor Child labor in cobalt mining in DRC
Human Rights Issue Large numbers of refugees enter Europe
Production Affected by Disaster Australia floods affect lithium mining
Natural Disaster
Transportation Affected by Disaster Tsunami destroys ports, disrupts shipping
Regional Tension Tensions between North and South Korea
∗ Changes in International Relations China’s relations with the West deteriorate
Political Issue
Industry Nationalization Nationalization of the lithium industry in Chile
Government Intervention Europe promotes EVs for environmental reasons
Sign a Supply Agreement Sign a Supply Agreement PE signs EV battery supply agreement with Tesla
Export Controls China restricts graphite exports
Trade Policy Tax and Duties China’s tax rebates to EV companies
Trade Barriers US tariffs on Chinese EV batteries
Internal Disorder or Rebellion Civil unrest in Yemen
War and Conflict War Between Nations Russo-Ukrainian War
Geopolitical Crisis Houthi rebels attack merchant ship

advertisements, which were cleaned using regular the extensive scope of the collected data. Fig. 7
expressions to remove most invalid information. presents the categories and examples of news ar-
We deployed Llama3-8b to filter out embedded ad- ticles in the dataset, while Fig. 8 shows the distri-
vertisements, ensuring the dataset’s purity and accu- bution of sources in the news dataset, emphasizing
racy. After cleaning, irrelevant content was reduced the dataset’s diversity and comprehensiveness. The
by 15%, and all data was systematically stored in a dataset covers global events that could impact the
database, resulting in a refined meta dataset of 365 supply chain, such as natural disasters, trade issues,
news documents. The metadata contains approxi- wars, enterprise issues, etc.
mately 152,000 words and 3,000 paragraphs. To
validate our system’s ability to detect connections C Hierarchical Structure Extraction
between events, we randomly merged international
news with contemporaneous corporate stories from We utilize large language models (LLMs) to extract
the same quarter, creating 354 fused documents hierarchical structures (H) that capture main events
for a more diverse and challenging dataset. The fi- (E) and sub-events (Esub ) based on our prompt, as
nal fusion dataset contains approximately 660,000 illustrated in Fig. 9.
words and 12,000 paragraphs. In a hierarchical structure (H):
Upon completing dataset collection, we conducted • An event (E) refers to anything that happens
preliminary statistics and analysis on the news related to the EV battery supply chain. There
dataset. Table 8 shows the number of event types can be multiple events ⟨E1 , E2 , . . . , En ⟩ in
included in each quarter in the news dataset, provid- one hierarchical structure H.
ing a comprehensive overview of the various events
tracked over time. Table 9 details the number of • An event_id is a unique identifier code as-
words and paragraphs in the dataset, highlighting signed for each specific event.
◼ The Electric Vehicle Supply Chain Ecosystem: Changing Roles of Automotive Suppliers
• Academic Paper ◼ Electric Vehicle Battery Supply Chain and Critical Materials: A Brief Survey of State of the Art
◼ Electric vehicle battery supply chain and critical materials: a brief survey of state of the art ◼ Estimating the environmental impacts of global lithium-ion battery supply chain: A temporal, geographical, and
◼ Challenges and recent developments in supply and value chains of electric vehicle batteries: A sustainability perspective technological perspective
◼ Cost-effective supply chain for electric vehicle battery remanufacturing ◼ A SWOT Analysis of the UK EV Battery Supply Chain
◼ The supply chain for electric vehicle batteries ◼ The Electric Vehicle Supply Chain Ecosystem: Changing Roles of Automotive Suppliers
◼ Critical issues in the supply chain of lithium for electric vehicle batteries ◼ Building a North American electric vehicle supply chain
◼ Building a Robust and Resilient U.S. Lithium Battery Supply Chain ◼ Friend-shoring battery supply chains
◼ At the mining or extraction stage, major risks include the location of the deposit, cost, geopolitical environment, and ◼ Estimating the environmental impacts of global lithium-ion battery supply chain: A temporal, geographical, and
mining regulations technological perspective
◼ A SWOT Analysis of the UK EV Battery Supply Chain ◼ Global Supply Chains of EV Batteries
◼ Lithium-ion battery supply chain: enabling national electric vehicle and renewables targets ◼ Building a Sustainable Electric Vehicle Battery Supply Chain
◼ Sustainable electric vehicle batteries for a sustainable world: perspectives on battery cathodes, environment, supply ◼ INVESTIGATING THE U.S. BATTERY SUPPLY CHAIN AND ITS IMPACT ON ELECTRIC VEHICLE COSTS THROUGH 2032
chain, manufacturing, life cycle, and policy ◼ GLOBAL STATE OF Sustainable ELECTRIC VEHICLE BATTERIES
◼ Developing pricing strategy to optimise total profits in an electric vehicle battery closed loop supply chain ◼ Collection and recycling decisions for electric vehicle end-of-life power batteries in the context of carbon emissions
◼ Optimising quantity of manufacturing and remanufacturing in an electric vehicle battery closed-loop supply chain reduction
◼ Life-cycle implications and supply chain logistics of electric vehicle battery recycling in California ◼ The Paradox of Green Growth: Challenges and Opportunities in Decarbonizing the Lithium-Ion Supply Chain
◼ Analyzing challenges for sustainable supply chain of electric vehicle batteries using a hybrid approach of Delphi and ◼ Implications of the Electric Vehicle Manufacturers’ Decision to Mass Adopt Lithium-Iron Phosphate Batteries
Best-Worst Method ◼ Reducing new mining for electric vehicle battery metals: responsible sourcing through demand reduction strategies and
◼ Determining requirements and challenges for a sustainable and circular electric vehicle battery supply chain: A mixed- recycling
methods approach ◼ Automated assembly of Li-ion vehicle batteries: A feasibility study
◼ Graphite resources, and their potential to support battery supply chains, in Africa ◼ Field Study and Multimethod Analysis of an EV Battery System Disassembly
◼ Lithium-ion battery supply chain considerations: analysis of potential bottlenecks in critical metals ◼ COVID-19 disrupts battery materials and manufacture supply chains, but outlook remains strong
◼ The case for recycling: Overview and challenges in the material supply chain for automotive li-ion batteries ◼ An overview of global power lithium-ion batteries and associated critical metal recycling
◼ Global Value Chains: Graphite in Lithium-ion Batteries for Electric Vehicles ◼ Does China's new energy vehicles supply chain stock market have risk spillovers? Evidence from raw material price
◼ Traceability methods for cobalt, lithium, and graphite production in battery supply chains effect on lithium batteries
◼ The global cycle of Graphite - A dynamic Material Flow Analysis (2020-2050) of the natural, synthetic and recycled ◼ Deep-sea nodules versus land ores: A comparative systems analysis of mining and processing wastes for battery-metal
graphite value chains to understand the supply of LIB anodes supply chains
◼ Natural graphite demand and supply—Implications for electric vehicle battery requirements ◼ A game theoretic approach for analyzing electric and gasoline-based vehicles’ competition in a supply chain under
◼ Mateerial System Analysis of five battery-related raw materials: Cobalt, Lithium, Manganese, Natural Graphite, Nickel government sustainable strategies: A case study of South Korea
◼ Life cycle assessment of natural graphite production for lithium-ion battery anodes based on industrial primary data ◼ China's lithium supply chain: Security dynamics and policy countermeasures
◼ Sustainability challenges throughout the electric vehicle battery value chain ◼ Exploring recycling options in battery supply chains – a life cycle sustainability assessment
◼ Electric vehicle battery chemistry affects supply chain disruption vulnerabilities ◼ Lithium-Ion Battery Recycling in the Circular Economy: A Review
◼ Global competition in the lithium-ion battery supply chain: a novel perspective for criticality analysis ◼ Implications of circular production and consumption of electric vehicle batteries on resource sustainability: A system
◼ Supply risks of lithium-ion battery materials: An entire supply chain estimation dynamics perspective
◼ Vulnerable links in the lithium-ion battery supply chain ◼ On the influence of second use, future battery technologies, and battery lifetime on the maximum recycled content of
◼ The global battery arms race: lithium-ion battery gigafactories and their supply chain future electric vehicle batteries in Europe
◼ Towards the lithium-ion battery production network: Thinking beyond mineral supply chains ◼ Assessing batteries supply chain networks for low impact vehicles
◼ Challenges and opportunities in lithium-ion battery supply ◼ Mapping a circular business opportunity in electric vehicle battery value chain: A multi-stakeholder framework to create
◼ Identifying supply risks by mapping the cobalt supply chain a win–win–win situation
◼ The Cobalt Supply Chain and Environmental Life Cycle Impacts of Lithium-Ion Battery Energy Storage Systems ◼ An applied analysis of the recyclability of electric vehicle battery packs
◼ Environmental sustainability and supply resilience of cobalt ◼ A Novel Prediction Process of the Remaining Useful Life of Electric Vehicle Battery Using Real-World Data
◼ Perspectives on cobalt supply through 2030 in the face of changing demand ◼ Capturing the battery value-chain opportunity
◼ An integrated supply chain analysis for cobalt and rare earth elements under global electrification and constrained ◼ Digital Twin-Driven Framework for EV Batteries in Automobile Manufacturing
resources ◼ Electric vehicle battery state changes and reverse logistics considerations
◼ Sources of uncertainty in the closed-loop supply chain of lithium-ion batteries for electric vehicles ◼ Optimising the geospatial configuration of a future lithium ion battery recycling industry in the transition to electric
◼ Battery technology and recycling alone will not save the electric mobility transition from future cobalt shortages vehicles and a circular economy☆
◼ Global value chains: cobalt in lithium-ion batteries for electric vehicles ◼ Can Cobalt Be Eliminated from Lithium-Ion Batteries?
◼ Global electrification of vehicles and intertwined material supply chains of cobalt, copper and nickel ◼ Sizing and Locating Planning of EV Centralized-Battery-Charging-Station Considering Battery Logistics System
◼ The cobalt supply chain and life cycle assessment of lithium-ion battery energy storage systems ◼ Taming the Hydra: Funding the Lithium Ion Supply Chain in an Era of Unprecedented Volatility
◼ Towards the lithium-ion battery production network: Thinking beyond mineral supply chains ◼ Battery technology and recycling alone will not save the electric mobility transition from future cobalt shortages
◼ Vertically Integrated Supply Chain of Batteries, Electric Vehicles, and Charging Infrastructure: A Review of Three ◼ Resilience assessment of the lithium supply chain in China under impact of new energy vehicles and supply interruption
Milestone Projects from Theory of Constraints Perspective ◼ Optimal policy for the recycling of electric vehicle retired power batteries
◼ The behavioural evolution of the smart electric vehicle battery reverse supply chain under government supervision ◼ Data requirements and availabilities for a digital battery passport – A value chain actor perspective
◼ Managing resource dependencies in electric vehicle supply chains: a multi-tier case study ◼ Trade structure and risk transmission in the international automotive Li-ion batteries trade
◼ Application of sustainable supply chain finance in end-of-life electric vehicle battery management: a literature review ◼ The Supply Chain Diversification and India–South Korea Cooperation in a Contested East Asia in the Post-COVID-19 Era
◼ Structural characteristics and disruption ripple effect in a meso-level electric vehicle Lithium-ion battery supply chain ◼ Toward Sustainable Reuse of Retired Lithium-ion Batteries from Electric Vehicles
network ◼ Steering extended producer responsibility for electric vehicle batteries
◼ Battery global value chain and its technological challenges for electric vehicle mobility ◼ Traceability Management Strategy of the EV Power Battery Based on the Blockchain
◼ Building a competitive advantage for Indonesia in the development of the regional EV battery chain ◼ Electric vehicle lithium-ion battery recycled content standards for the US – targets, costs, and environmental impacts
◼ Recycling mode selection and carbon emission reduction decisions for a multi-channel closed-loop supply chain of ◼ What is the contribution of different business processes to material circularity at company-level? A case study for
electric vehicle power battery under cap-and-trade policy electric vehicle batteries
◼ A sustainable circular supply chain network design model for electric vehicle battery production using internet of things ◼ To shred or not to shred: A comparative techno-economic assessment of lithium ion battery hydrometallurgical
and big data recycling retaining value and improving circularity in LIB supply chains
◼ The CO2 Impact of the 2020s Battery Quality Lithium Hydroxide Supply Chain ◼ Value recovery from spent lithium-ion batteries: A review on technologies, environmental impacts, economics, and
◼ Electric vehicle supply chain management: A bibliometric and systematic review supply chain
◼ Long-term Indonesia's Nickel Supply Chain Strategy for Lithium-Ion Battery as Energy Storage System ◼ Optimal choice of power battery joint recycling strategy for electric vehicle manufacturers under a deposit-refund
◼ Battery Nickel Bottlenecks system
◼ Life-cycle analysis, by global region, of automotive lithium-ion nickel manganese cobalt batteries of varying nickel ◼ Collection mode choice of spent electric vehicle batteries: considering collection competition and third-party economies
content of scale
◼ Battery minerals from Finland: Improving the supply chain for the EU battery industry using a geometallurgical approach ◼ Electric vehicle battery capacity allocation and recycling with downstream competition
◼ Exploring recycling options in battery supply chains–a life cycle sustainability assessment ◼ Tackling EV Battery Chemistry in View of Raw Material Supply Shortfalls
◼ Assessment of social sustainability hotspots in the supply chain of lithium-ion batteries ◼ Which is better? Business models of partial and cross ownership in an NEV supply chain
◼ Conflict minerals and battery materials supply chains: A mapping review of responsible sourcing initiatives ◼ Materials availability and supply chain considerations for vanadium in grid-scale redox flow batteries
◼ Analysis of nickel sulphate datasets used in lithium-ion batteries ◼ Circularity of Lithium-Ion Battery Materials in Electric Vehicles
◼ Design of battery supply chains under consideration of environmental and socio-economic criteria ◼ Assessment of end-of-life electric vehicle batteries in China: Future scenarios and economic benefits
◼ Analysis of international nickel flow based on the industrial chain ◼ The Resilience of the Renewable Energy Electromobility Supply Chain: Review and Trends
◼ Value recovery from spent lithium-ion batteries: A review on technologies, environmental impacts, economics, and ◼ Industrial Policy, Trade, and Clean Energy Supply Chains
supply chain ◼ Concurrent design of product and supply chain architectures for modularity and flexibility: process, methods, and
◼ Dynamic evolution of the zinc-nickel battery industry and evidence from China application
◼ A perspective on the sustainability of cathode materials used in lithium‐ion batteries ◼ A sustainable framework for the second-life battery ecosystem based on blockchain
◼ An improved resource midpoint characterization method for supply risk of resources: integrated assessment of Li-ion ◼ Comparative evaluation and policy analysis for recycling retired EV batteries with different collection modes
batteries ◼ HOW TECHNOLOGY, RECYCLING, AND POLICY CAN MITIGATE SUPPLY RISKS TO THE LONG-TERM TRANSITION TO ZERO-
◼ Deep‐sea nodules versus land ores: A comparative systems analysis of mining and processing wastes for battery‐metal EMISSION VEHICLES
supply chains ◼ Decarbonizing the automotive sector: a primary raw material perspective on targets and timescales
◼ Industrial policy, trade, and clean energy supply chains ◼ The Emerging Electric Vehicle and Battery Industry in Indonesia: Actions around the Nickel Ore Export Ban and a SWOT
◼ The electric vehicle revolution: Critical material supply chains, trade and development Analysis
◼ Industrial policy for electric vehicle supply chains and the US-EU fight over the Inflation Reduction Act ◼ The Emerging Electric Vehicle and Battery Industry in Indonesia: Actions around the Nickel Ore Export Ban and a SWOT
◼ Strategic Battery Autarky: Reducing Foreign Dependence in the Electric Vehicle Supply Chain Analysis
◼ Electric vehicle battery secondary use under government subsidy: A closed-loop supply chain perspective ◼ Reverse Logistics Network Design of Electric Vehicle Batteries considering Recall Risk
◼ Blockchain review for battery supply chain monitoring and battery trading ◼ Battery capacity needed to power electric vehicles in India from 2020 to 2035
◼ Trade structure and risk transmission in the international automotive Li-ion batteries trade ◼ The future of the automotive sector: Emerging battery value chains in Europe
◼ The EV Revolution: Critical Material Supply Chains, Trade, and Development ◼ Radical innovations as supply chain disruptions? A paradox between change and stability
◼ An Overview of the Lithium Supply Chain ◼ End of Electric Vehicle Batteries: Reuse vs. Recycle
◼ Comparison of lithium-ion battery supply chains–a life cycle sustainability assessment ◼ Securing Decarbonized Road Transport – a Comparison of How EV Deployment Has Become a Critical Dimension of
◼ The ev transition: Key market and supply chain enablers Battery Security Strategies for China, the EU, and the US.
◼ The Lithium Supply Crunch Doesn't Have to Stall Electric Cars ◼ A Review on Battery Market Trends, Second-Life Reuse, and Recycling
◼ Hydrometallurgical Routes to Close the Loop of Electric Vehicle (EV) Lithium-Ion Batteries (LIBs) Value Chain: A Review ◼ Life cycle impact assessment of electric vehicle battery charging in European Union countries
◼ Global warming potential of lithium-ion battery cell production: Determining influential primary and secondary raw ◼ Assessing socio-economic risks in the supply chain of materials required for vehicle electrification
material supply routes ◼ Intelligent disassembly of electric-vehicle batteries: a forward-looking overview
◼ Lithium mining: How new production technologies could fuel the global EV revolution ◼ Research on decision optimization of new energy vehicle supply chain considering demand disruptions under dual credit
◼ The cobalt and lithium global supply chains: status, risks and recommendations policy
◼ Sustainable value chain of retired lithium-ion batteries for electric vehicles ◼ Transition to electric vehicles in China: Implications for private motorization rate and battery market
◼ Assessing the potential of quebec lithium industry: Mineral reserves, lithium-ion batteries production and greenhouse ◼ Mirroring in production? Early evidence from the scale-up of Battery Electric Vehicles (BEVs)
gas emissions ◼ Life-Cycle Assessment Considerations for Batteries and Battery Materials
◼ Current and Future Global Lithium Production Till 2025 ◼ Operation Management of Multiregion Battery Swapping–Charging Networks for Electrified Public Transportation
◼ Lithium-Ion Batteries Recycling Trends and Pathways: A Comparison Systems
◼ Alternative battery chemistries and diversifying clean energy supply chains ◼ Spatial modeling of a second-use strategy for electric vehicle batteries to improve disaster resilience and circular
◼ Lithium and cobalt economy
◼ A Study on the Cradle-to-Gate Environmental Impacts of Automotive Lithium-ion Batteries ◼ On the sustainability of lithium ion battery industry – A review and perspective
◼ Determining requirements and challenges for a sustainable and circular electric vehicle battery supply chain: A mixed- ◼ Comparison of Electric Vehicle Lithium-Ion Battery Recycling Allocation Methods
methods approach ◼ Manufacturing value chain for battery electric vehicles in Pakistan: An assessment of capabilities and transition
◼ Status and gap in rechargeable lithium battery supply chain: importance of quantitative failure analysis pathways
◼ Critical Factors to Consider in Purchasing for a Sustainable Inbound Supply Chain: A Perspective on Large Scale Lithium- ◼ The End of Globalized Production? Supply-Chain Resilience, Technological Sovereignty, and Enduring Global
ion Battery Manufacturing Interdependencies in the Post-Pandemic Era
◼ Assessing batteries supply chain networks for low impact vehicles ◼ Environmental feasibility of secondary use of electric vehicle lithium-ion batteries in communication base stations
◼ A comparative assessment of value chain criticality of lithium-ion battery cells ◼ Supply chain risks of critical metals: Sources, propagation, and responses
◼ Conflict minerals and battery materials supply chains: A mapping review of responsible sourcing initiatives ◼ Decentralized Planning of Lithium-Ion Battery Production and Recycling
◼ Battery Critical Materials Supply Chain Challenges and Opportunities: Results of the 2020 Request for Information (RFI) ◼ Improvements in electric vehicle battery technology influence vehicle lightweighting and material substitution decisions
and Workshop ◼ Rethinking Chinese supply resilience of critical metals in lithium-ion batteries
◼ Key Strategic Issues in Supply Chain Domain Pertaining to Battery Industry. ◼ Optimal pricing strategy in the closed-loop supply chain using game theory under government subsidy scenario: A case
◼ Identifying trends in battery technologies with regard to electric mobility: evidence from patenting activities along and study
across the battery value chain ◼ Predictive model for energy consumption of battery electric vehicle with consideration of self-uncertainty route factors
◼ Electric Vehicle Battery Supply Chain and Critical Materials: A Brief Survey of State of the Art ◼ Perspectives on Cobalt Supply through 2030 in the Face of Changing Demand
◼ Estimating the environmental impacts of global lithium-ion battery supply chain: A temporal, geographical, and ◼ McKinsey Electric Vehicle Index: Europe cushions a global plunge in EV sales
technological perspective ◼ Lithium in International Law: Trade, Investment, and the Pursuit of Supply Chain Justice
◼ A SWOT Analysis of the UK EV Battery Supply Chain

Figure 5: Sources of academic papers.


◼ Gold is Revealed after All the Yellow Sands are Blown Away - Investment Strategy for Energy Metals and Materials
•Industry Report Industry in 2024
◼ The EV Battery Supply Chain Explained ◼ 20240222-Huaxin Securities-Huaxin Securities Minor Metals Industry In-depth Report: Lithium Price Bottom-seeking
◼ Global Supply Chains of EV Batteries Journey, Latest Inventory of Global Lithium Resource Supply
◼ Electric vehicle battery chemistry affects supply chain
◼ Battery supply chain challenges - RMIS


Electric vehicle supply chain
The ultimate guide to the EV battery supply chain
•Wikipedia Entries
◼ Electric vehicle battery
◼ Electric Vehicle Battery Supply Chain and Critical Materials
◼ Electric vehicle supply chain & Batteries
◼ The geopolitics of electric car batteries - LSE Blogs
◼ Lithium-ion battery
◼ Global Supply Chains of EV Batteries
◼ Lithium nickel manganese cobalt oxides(NMC)
◼ Supply Chain for EV Batteries: 2020 Trade and Value- ...
◼ Lithium iron phosphate battery(LFP)
◼ Achieving resilience and sustainability for the EV battery .
◼ Sodium-ion battery
◼ Electric Vehicle Batteries: A Guidebook for Responsible Corporate Engagement Throughout the Supply Chain
◼ Lithium nickel cobalt aluminium oxides(NCA)
◼ Trends in batteries Battery demand for EVs continues to rise
◼ The EV Battery Supply Chain Explained
◼ Baichuan Yingfu Lithium Carbonate Market Weekly Report Week 12
◼ Electric Vehicle Battery Supply Chains: The Basics
◼ Shanghai Dongsheng Futures Nickel Annual Report: Oversupply continues, hidden dragon in the abyss (2024-01-23)
◼ The EV Battery Supply Chain Explained
◼ Minmetals Securities Fengchi "Tram" Series 2: Lithium carbonate prices are bottoming out, how far is the spring of
◼ Electric Vehicle Battery Supply Chains: The Basics
lithium battery positive electrode materials?
◼ Life Cycle Assessment studies of rare earths production - Findings from a systematic review
◼ Baichuan Yingfu Cobalt Salt Market Weekly Report Week 12 (2024-03-21)
◼ The EV Battery Supply Chain Explained
◼ Minmetals Securities Lithium Thinking Series 1: Will lithium carbonate prices fall too much?
◼ High concentration from resources to market heightens risk for power lithium-ion battery supply chains globally
◼ Baichuan Yingfu Nickel Market Weekly Report Week 12 (2024 3.15-3.21)
◼ The EV Battery Supply Chain Explained
◼ Kaiyuan Securities Nonferrous Metals Industry In-depth Report: Australian Mine 2023Q4 Tracking, Medium- and Short-
◼ Lithium extractivism and water injustices in the Salar de Atacama, Chile: The colonial shadow of green
term Expansion Projects Continue, Cost Reduction is the Main Goal for the 2024 Fiscal Year
electromobility
◼ Nickel Mine

Figure 6: Sources of Wikipedia entries and industry reports.

Table 7: The number of event types included in each quarter in the news dataset.
Event Type 2022Q1 2022Q2 2022Q3 2022Q4 2023Q1 2023Q2 2023Q3 2023Q4
Acquisition and Investment 0 5 4 0 2 2 2 1
Changes in Supply and Demand 6 5 4 3 3 1 2 3
Enterprise Issue 3 1 1 3 0 1 0 0
Economic Environment 3 3 5 6 4 2 6 1
Humanitarian and Ethical Crisis 1 5 3 1 2 0 2 3
Natural Disaster 6 8 6 5 6 6 6 6
Political Issue 3 14 18 16 14 10 15 10
EV Battery Technology Progress 1 1 1 2 4 3 2 0
Sign a Supply Agreement 0 3 1 1 2 2 1 4
Trade Policy 2 6 5 2 5 1 2 9
War and Conflict 3 6 12 10 5 7 7 7

Table 8: The number of times each country is mentioned in the news dataset in each quarter.
Country 2022Q1 2022Q2 2022Q3 2022Q4 2023Q1 2023Q2 2023Q3 2023Q4
USA 6 9 18 15 17 12 9 10
China 1 10 12 6 12 4 4 9
EU 3 11 6 4 8 4 8 2
Japan 0 1 0 1 1 2 2 1
Russia 4 10 4 14 6 6 4 3
Other 20 42 38 33 31 23 35 40

Table 9: Statistics of the number of words and para- • Use ’and’ if no sub-events can be missing.
graphs in the dataset.
• Use ’or’ if some sub-events can be missing.
Total Paragraphs Total Words
Meta Data 3,022 152,489 • Use ’xor’ if sub-events cannot exist simulta-
Fusion Data 12,070 660,054 neously.
Relations describe the connections between events.
• A description provides a detailed 2-3 sentence For example, if ev1.2 is caused by (happens after)
textual explanation of the event. ev1.1, it is expressed as ’ev1.1>ev1.2’.

• Participants include all sub-events (Esub ) re- Our prompt includes demonstration and chain of
lated to the main event, and a subsubevent thought (CoT) techniques:
(Esubsub ) can be used if an event is part of a • We manually annotated the hierarchical struc-
sub-event within the hierarchy. ture for one text in the schema learning dataset
to use as an example in the prompt.
The suffix P0.5 indicates the importance of a sub-
event to its parent event. The Gate specifies the • We provided a step-by-step CoT, showing how
relationship between the main event and its sub- E and Esub in H were extracted from specific
events: sentences in the schema learning dataset.
• Acquisition and Investment ◼ US battery supply chain investments reach US$92 billion since Biden took office
◼ Exclusive: Canada to invest C$2 billion on mineral strategy for EV battery supply chain ◼ Biden’s EV bet is a gamble on critical minerals
◼ China's EV battery materials industry set for $11bn capacity buildup ◼ Dead EV batteries turn to gold with US incentives
◼ Why carmakers are pouring billions into new electric vehicle battery factories ◼ DOE intends to award up to $37M to advance EV battery recycling, transportation and design
◼ Private equity in talks with UK's BMI for EV battery exposure ◼ UK Inflation Rate Reaches 40-Year High As Food Prices Surge
◼ CATL and Indonesia jointly build a nearly $6 billion power battery industry chain project ◼ Bank Of England To Get More Aggressive With 50 BPS Hike Later In The Week
◼ Durham battery storage company raises $100 million ◼ Huge Tax Cuts Are Being Questioned By Investors As Pounds Sinks
◼ Volkswagen announces $20 billion effort to build its own EV batteries ◼ Biden Threatens Windfall Tax As He Accuses Oil Companies Of War Profiteering
◼ Panasonic to open $4B EV battery plant in Kansas ◼ Inflation Of The United Kingdom Jumps To 41-Year High Of 11.1%
◼ CATL announces construction of second European factory in Hungary ◼ China Sets 5% As Their Economic Growth Target For 2023
◼ Tesla co-founder’s startup gets $2 billion to boost EV battery production ◼ Ukraine Secures First IMF Loan To A Country At War
◼ CATL and HGP establish partnership to jointly promote 5GWh battery energy storage application ◼ Largest Oil Refinery In Africa Launched By Aliko Dangote
◼ GM and POSCO Future M Investing $1bn in North American EV Battery Supply Chain ◼ China’s Economy Experienced A Growth Of 6.3% In The Second Quarter
◼ Companies invest in EV battery factories in Europe ◼ China To Kickstart Economy, After Plans To Improve Internal Migration
◼ Ecoprobm, SK, Ford investing in québec; building cathode plant to solidify EV supply chain in NA ◼ A 40% Windfall Tax Was Approved By Italian Government As A Result Of Soaring Profits
◼ Redwood Materials raises $1B to expand circular battery supply chain in US ◼ France Plans To End The Use Of Fossil Fuels By 2030
◼ Nissan leads $2.5 billion investment to build two more EVs in UK ◼ South Africa Gets $1 Billion Loan From World Bank To Tackle Power Crisis

• Trade Policy •Changes in Supply and Demand


◼ As the US struggles to “green” supply chains, new EU battery regulation offers lessons ◼ How a battery shortage could threaten US national security
◼ The New Climate Bill Demands All-American EV Batteries ◼ EV battery report: Taiwan's rising role in the global supply chain
◼ New US Climate Bill Seeks to Onshore Electric Vehicle Supply Chain ◼ EV battery report: China remains dominant with growing production capacity and presence
◼ The Inflation Reduction Act places a big bet on alternative mineral supply chains worldwide
◼ The New Climate Bill Demands All-American EV Batteries ◼ How a handful of metals could determine the future of the electric car industry
◼ U.S. Push to Secure EV Battery Supply Chains and the Role of China ◼ EV battery report: Malaysia may be dark horse in Southeast Asia's EV sphere
◼ U.S. strikes at China with EV battery deal ◼ Raw materials in short supply for EV makers struggling to meet customer demand
◼ EU Could End Reliance On Chinese Battery Supply Chain By 2030 Says T&E ◼ EV Has a Problem: 90% of the Battery Supply Chain ‘Does Not Exist’
◼ Ford-CATL Partnership Illustrates the Challenge of Decoupling EV Supply Chains ◼ Do you like minting money?’: Musk urges entrepreneurs to enter lithium space as Tesla’s supply
◼ Global EV battery supply chain puzzles over China graphite curbs woes persist
◼ US-Canada Critical Mineral, EV Battery, and Semiconductor Cross-Border Supply Chain Issue ◼ Tin’s Critical Role in the Battery Supply Chain
◼ New US rules on Chinese batteries could push up price of electric cars ◼ Surging EV sales hitting high lithium prices, supply chain constraints: experts
◼ China restricts exports of graphite as it escalates a global tech war ◼ What is Vietnam’s Mining Capacity for EV Batteries?
◼ China says Biden plan to shut it out of US battery supply chain violates WTO rules ◼ Supply Chain Disruptions in the Energy Industry: Challenges with the Supply of Lithium-ion
◼ US to limit Chinese firms, battery parts from winning EV tax credits Batteries
◼ Senator asks Treasury to bar Chinese battery firms, minerals from US EV tax credits ◼ Making EVs without China’s supply chain is hard, but not impossible – 3 supply chain experts
◼ UK Issues New Round of Targeted Sanctions Against Russia outline a strategy
◼ Further Sanctions Against Russia Being Discussed By EU ◼ Ford's answer to EV supply chain hell: Cheaper batteries
◼ Gas Supplies To Poland And Bulgaria To Be Cut Off By Russia ◼ Almost 400 new mines needed to meet future EV battery demand, data finds
◼ Russian Billionaire Shields Assets From European Sanctions ◼ Power spike: How battery makers can respond to surging demand from EVs
◼ Singapore's National Dish Affected By The Malaysian Export Ban ◼ China’s Battery Supply Chain Tops BNEF Ranking for Third Consecutive Time, with Canada a Close
◼ Gulf States Sanction And Boycott India After Unwanted Remarks Second
◼ Russia’s Economy Will Be Hit By Further Sanctions ◼ Canada vaults to second spot ahead of the U.S. in global EV battery-supply chain ranking
◼ China Sanctions Pelosi, Halts Cooperation With The United States Over Pelosi’s Taiwan Trip ◼ Is the EV Battery Supply Chain Ready for the Approaching Demand?
◼ China Vows To Take Countermeasures As The United States Approves $1.1BN Arms Sales To ◼ Jump-starting electric car batteries: Will supply problems stall California’s mandate?
Taiwan ◼ Chinese companies gain momentum in U.S. electric vehicle supply chain
◼ Grain Export Deal Between Ukraine And Russia Brokered By United Nations Suspended By Russia ◼ EV batteries: Can the West catch up with China?
◼ Grain Deal Extended By Russia And Ukraine Amid Disagreement ◼ China Has Perfectly Tangled The Battery Value Chain With Electric Vehicles - A Combo The U.S.
◼ G7 Request For Black Sea Grain Deal To Be Extended And Europe Will Find Hard To Beat
◼ Russia Confirm It Will Not Renew Grain Deal With Ukraine ◼ It's official: The battery crunch is the new chip shortage
◼ Ukraine Welcomes The Arrival Of First Grain Ships Using New Route ◼ An EV Talent Gap Will Weaken the U.S. Battery Supply
◼ China Promise To Deepen Trade Ties With Vietnam ◼ Auto workers worry it takes less labor to build electric cars. Maybe not, some researchers say
◼ Eight North Korean Sanctioned By South Korea Over Arms Trade ◼ Fear of cheap Chinese EVs spurs automaker dash for affordable cars

• EV Enterprise Related • Natural Disaster


◼ VW and Bosch to upscale EV battery output in Europe ◼ Three Tiny Islands Have Borne the Brunt of Tonga’s Tsunami
◼ Governor Ivey Joins Dura Automotive to Celebrate Grand Opening of High-Tech Factory in Muscle ◼ More than 30,000 displaced by floods in Indonesia’s Sumatra
Shoals for EV Battery Enclosures ◼ Second Round Of Cyclone Hits Madagascar In The Space Of Two Weeks
◼ Auto Giants Race to Build U.S. EV Battery Assembly Plants ◼ Heavy rains, landslides kill scores in Brazilian mountain city
◼ CATL's German factory obtains battery cell production license ◼ Australian Flood Worsens As State Emergency Perform Evacuation
◼ Automakers race to build EV battery supply chains in North America ◼ Wildfire Spreads Near Chernobyl Disaster Site
◼ General Motors Fortifies EV Battery Supply-Chain Links ◼ Tornadoes Break Out In Texas As Weather Worsens And 23 Are Injured
◼ GM's North American battery supply chain is key to EV profits ◼ The Philippines Continue To Be Bombarded By Tropical Storm Megi
◼ CATL's German factory successfully achieves battery cell production ◼ South Africa's Government To Begin Rebuilding After Disastrous Floods
◼ The South is building the most vibrant EV and battery hub in the US ◼ Floods From Heavy Rain Destroys Kabul Homes Killing 22 People
◼ Thunderstorms In Quebec And Ontario With 5 Recorded Deaths
◼ Australia May Face A Summer Of Flood And Rains
•Production Technology Progress ◼ Tens Of Thousands Of People Displaced Due To Flood In China
◼ The Transportation Transformation: Battery Research Today and Tomorrow ◼ Japan Faces Its Worst Recorded Heat Wave Since 1875
◼ CATL releases Kirin battery with global highest integration ◼ Earthquake In The Southern Iranian Region Kills Five People
◼ Ford releases new battery capacity plan, raw materials details to scale EVs ◼ Floods From Torrential Rain Threatens Communities In Australia
◼ How the US plans to transform its lithium supply chain ◼ Mayor Of London Declares Emergency Response To The Heatwave
◼ Local, clean and circular supply chains: Panasonic advances EV battery tech ◼ Heatwaves And Wildfires Continue To Occur More Often
◼ How Lithium Batteries Can Power the US Economy ◼ Factories Were Forced To Close Down As China Experiences Worst Heatwave In 6 Decades
◼ Ford taps Michigan for new LFP battery plant; new battery chemistry offers customers value, ◼ Half A Million People Affected By Flood In Nigeria, According To Emergency Reports
durability, fast charging, creates 2,500 more new American jobs ◼ More Than 80 People Reported Dead Trying To Escape Flood In Nigeria
◼ Electric Vehicle Battery Manufacturing Capacity in North America in 2030 is Projected to be ◼ Greenhouse Gases Reach Record High Levels In The Atmosphere
Nearly 20 Times Greater than in 2021 ◼ Carbon Emission To Hit Record High In 2022
◼ Ascend Elements Opens North America's Largest Electric Vehicle Battery Recycling Facility in ◼ Summer Heat Waves Estimated To Be Responsible For Up To 20,000 Deaths
Georgia ◼ Scientists Reveal Rising Fetal Distress Due To Climate Change
◼ Study unveils policy insights for reshoring EV battery production ◼ 60,000 People Reported With Covid-Related Death In China In Less Than 40 Days
◼ Batteries: EVs to use silicon, solid state for next-generation batteries ◼ Eastern Asia Witnessing Extreme Cold Temperature
◼ BMW powers Spartanburg with ‘local for local’ battery supply chain ◼ Turkiye And Syria Devastated By Massive Earthquake
◼ New EV Battery Materials Will Beget New Dilemmas ◼ Multiple And Rapidly Spreading Wildfire Kills 23 In Chile
◼ Panasonic needs four more EV battery plants, executive says ◼ Death Toll Rises And Rescue Efforts Continue In Turkiye After Earthquake
◼ Study Finds Ozone Recovery May Be Slowed By Australian Wildfires
•Economic Environment ◼ Global Warming Cited As The Cause For Elongated Drought In The Horn Of Africa
◼ Indonesia’s Battery Industrial Strategy ◼ Wildfire Destroys Region In Russia With Extensive Damage To Infrastructure
◼ DOE makes $3.1B available for battery manufacturing incentives ◼ Key Measures Predict That The Earth Is Falling Sick
◼ Developing a resilient Canadian battery supply chain ◼ Pacific El-Nino To Increase The Heat Of The Planet In 2024
◼ Battery Policies and Incentives Database Contributes to U.S. Efforts To Build a Secure Electric ◼ Early Data Report Showing 2023 To Be Hottest Year On Record
Vehicle Battery Supply Chain ◼ Heatwaves And Wildfire Smoke Sandwich The US In Fresh Climate Concerns
◼ US increases production to catch China in global battery race ◼ The World’s Hottest Day Recorded Since Recording Began In 1979
◼ The CHIPS Act Is Essential. So Is a Resilient EV Battery Supply Chain ◼ Panic In India As The Yamuna River Rises To Unprecedented Levels
◼ EV tax credits could stall out on lack of US battery supply ◼ Hail Storms Hit Italy, And A Fourth Heat Wave Predicted In Europe
◼ Electric Vehicle Battery Production May Lead To Coal Country’s Return ◼ Flood Triggered In Hong Kong After A Heavy Typhoon
◼ DOE taps 20 companies to receive $2.8B for battery manufacturing, minerals processing build-out ◼ Australians Exposed To Smoke Blanket Following Hazard Reduction Burns
◼ The future of vehicles is electric': Biden announces $2.8B for battery supply chain ◼ New Reports Show That 10% Of Swiss Glaciers Depleted In 2 Years
◼ Canada Has an EV Edge, If It Acts Now ◼ New Climate Discovery Shows September As The Hottest Month On Record
◼ S2.8B U5 EV supply-chain push appears to favor red states ◼ Hailstorms Become More Severe Around Sydney, Australia
◼ Why Canada has the potential to become an EV battery supply chain powerhouse ◼ A Long Year Of Several Wildfires May Change The Climate Of Canada
◼ Dreadful Heatwave Spreads Across Brazil
◼ Heavy Flood And Rain In Tanzania Kill 47 In Hanang District
◼ Fumes And Fires After Volcanic Eruption In Iceland

Figure 7: Categories and examples of news articles in the dataset.


• Sign a Supply Agreement • Political Issues
◼ LG Energy signs $9bn EV supply chain deal in Indonesia ◼ In 2024, Republican EV attacks may fall short as swing states reap investment
◼ GM signs agreement to source cobalt from Glencore ◼ Burkina Faso: Military coup prompts fears of further instability
◼ One-time tourist hotspot to supply key electric car battery ingredient for Stellantis ◼ North Korea missile tests: What does Kim Jong-un want?
◼ CATL and FlexGen reach 10GWh battery energy storage system supply agreement ◼ North Korean Missile Tests Reach New Milestone
◼ Stellantis signs non-binding supply deal for raw materials needed for EV batteries ◼ Nuclear Weapons Testing Resumed In North Korea, According To Surrounding Countries
◼ The U.S.-Zambia-DRC Agreement on EV Batteries Production: What Comes Next? ◼ Amid Russian-Ukraine Crisis, Finland Pushes to Join NATO
◼ Honda signs supply deal with Ascend Elements for recycled battery materials ◼ Complete Reversal Of Trump’s Withdrawal As Biden Approves Redeployment Of US Troops To
◼ Renault signs deal for EV battery supply with Verkor Somalia
◼ Ford inks long-term lithium supply contracts ◼ Turkey Moves To Block Finland And Sweden NATO Bid
◼ Canada, Japan agree to work more closely on battery supply chains ◼ Biden Pledges Support For Taiwan In A Statement That Has Raised Eyebrows
◼ Samsung SDI to supply EV batteries to Hyundai Motor starting 2026 ◼ Russian Diplomat Resigns In Geneva To Protest Against Russia-Ukraine Tensions
◼ Exclusive: US, Indonesia to discuss potential for deal on EV minerals ◼ South Korea Says North Korea Launched 8 Short-Range Ballistic Missiles
◼ Honda, Mitsubishi Corp sign pact to optimise use of EV batteries ◼ The US Has To Improve Bilateral Relationship, Says China’s Defense Chief
◼ CATL and Stellantis Group sign strategic memorandum of understanding to supply lithium iron ◼ South Korea To Counter The North Korean Threat By Boosting Defense Capacity
phosphate batteries to Stellantis Group in European market ◼ Tension In Taiwan Strait From China’s Military Activities
◼ Third Aircraft Carrier Launched By China To Boost Military Might
•War and Conflict ◼ European Union Executives Back Ukraine’s Membership Bid
◼ Russia’s War in Ukraine Reveals a Risk for the EV Future: Price Shocks in Precious Metals ◼ Putin Issues Warning To Finland And Sweden Against NATO Agenda
◼ Russia-Ukraine conflict exposes risks in EV supply chains ◼ The United States Of America Intends To Increase Its Military Presence Throughout Europe
◼ America Prepares for a Russian Invasion of Ukraine ◼ Islamic State Raids Medium Security Prison, Freeing Insurgents
◼ Russia Launches Military Attack On Ukraine ◼ Act As Partner, Not Opponent, China’s Wang Yi Tells Australia
◼ Putin Progress In Donbas Slow But Visible Says Boris Johnson ◼ State Of Emergency Declared In Sri Lanka As The President Flees To The Maldives
◼ Battle Rages On In Eastern Europe As Explosions Rock Ukraine’s Capital ◼ Biden Assures The Middle East That United State Will Remain An Active Partner
◼ UK Defense Minister Visits Ukraine Amidst Crisis ◼ Ukraine President Zelenskiy Fires Spy Chief And Top State Prosecutor
◼ Tension In Drone Crashes Into Russian Oil Refinery In A Possible Attack ◼ Fresh Crisis Loom In Sri Lanka As Parliament Elects Ranil Wickremesinghe As President
◼ Ukraine Engage In Exchange Of Prisoners of War With Separatist Region ◼ EU Has Launched Four Legal Cases Against the UK Over The Northern Ireland Protocol
◼ Two More Britons Charged As Foreign Mercenaries By Separatist Region ◼ Top Delegation From The US Visits Kyiv, Promises Their Continued Support
◼ Enemy Drones’ Attack On Gas Rig Shot Down By Israel ◼ Taiwan: Pelosi Departs Taipei Due To Sound Of Chinese Fury
◼ Ukrainian Flag To Be Hoisted On Snake Island After Russian Retreat ◼ South Korea's Aid Offer Rejected By North Korea, Calls President Yoon Simple
◼ Rocket Attack On Apartment Building In Ukraine Leaves Six People Dead ◼ Final Draft Of Nuclear Disarmament Treaty At The United Nations Gets Blocked By Russia
◼ Russian-Controlled Region Hit By Ukrainian Rockets In Preparation For A Counter-Attack ◼ World Largest Electronic Market Shut Down In China As Shenzhen Imposes Lockdown
◼ Ukraine Flags Russian Strike Risk, As They Set To Begin Grain Exports ◼ Russian Envoy Confirms That Putin And Xi Will Meet In Person Next Week
◼ Russia Confirms That Blast Has Killed 40 Ukrainian Prisoners ◼ India Confirms It Discovers Fraudulent Shell Companies Linked With China
◼ Ukraine Reports Extensive Damage To Nuclear Plants By Russian Rocket ◼ With Nuclear Talks At Halt, Israel Gives Stern Warning Over The Capability Of Iran On Uranium
◼ A Recent Blast In Moscow Killed Daughter Of Putin Ally, Darya Dugina ◼ An Arrest Warrant Has Been Issued In South Korea For The Developer Of The Cryptocurrency Luna
◼ Zelensky Vows Ukraine Will Take Back Crimea When It Chooses ◼ The United States Has Been Accused By China Of Sending Dangerous Signals To Taiwan
◼ Prime Minister Of Ukraine Expresses Gratitude To Germany And Calls For More Weapons ◼ In Anticipation Of United States Vice President’s Visit, North Korea Fire Ballistic Missile
◼ Zelensky Claims Significant Gains As Ukrainian Forces Retake A Key City ◼ A Ballistic Missile Launched By North Korean Is Believed To Have Flown Over Japan
◼ Russia Proclaims The Annexation Of Ukrainian Territory As Military Setback Looms ◼ Coup Leader Ibrahim Traore Named As Transitional President Of Burkina Faso
◼ Lyman, A Key City In Eastern Ukraine, was Retaken, As Russian Troops Retreat ◼ President Xi Jinping Announced That China Would Never Renounce The Right To Use Force Over
◼ Ukrainian Tanks Break Through Russian Lines In Kherson, In The Southern Part Of Ukraine Taiwan
◼ Putin’s War Effort Suffers Huge Blow Following Massive Blasts Of Crimean Bridge ◼ Chief Of Cybersecurity In Germany Sacked Over Reports Of Ties With Russia
◼ Moscow's Campaign In Ukraine Suffers A New Blow, As Gunmen Killed 11 People In Russia Army ◼ A Rare Talk Between United States And Russia Defense Ministers, As They Discuss Ukraine War
Base ◼ Xi Jinping Begins His Third Term, Marking Him As The Most Powerful Leader Of China In Decades
◼ At Least Four People Were Killed When Russia Launched Kamikaze Drone Attack ◼ China Accused Of Establishing Illegal Police Station In The Netherlands
◼ The Conflict With Ukraine Intensifies In The East, Poland, NATO Say Missile Likely Not From Russia ◼ A Law To Mobilize Convicted Russians Has Been Signed By Putin
◼ Two Russian Airbases Far From Ukraine Frontline Rocked By Explosions ◼ North Korea Fires Four Ballistic Missiles As Seoul And U.S. Ends Drill
◼ Russia's Missile Strike Leaves Ukraine's Second City, Kharkiv, Without Power ◼ Yevgeny Prigozhin, An Ally Of Putin, Admits Interfering In United State Elections
◼ Three People Dead After Drone Attack On Russian Bomber Base ◼ North Korea Denies Being Involved In Arms Deal With Russia
◼ Russia Claims Victory After A Long Battle For The Salt Mine Town In Ukraine ◼ President Of Taiwan, Tsai Ing-Wan, Emphasizes The Sovereignty Of Taiwan
◼ Ukraine To Receive Leopard 2 Tanks From Germany ◼ United Kingdom’s Prime Minister, Rishi Sunak, Pledges His Support During Visit To Kyiv
◼ An Israeli Raid In Jericho Leaves Multiple Palestinian Militants Dead ◼ Russia Demands Recognition Of Annexed Region Before Negotiations
◼ United States Drone Crashes After Encounter With Russian Jet ◼ Prisoner Swap: Brittney Griner For “The Merchant Of Death”
◼ Ukraine Receives Leopard 2 Tanks From Germany ◼ Ukraine President Volodymyr Zelensky Addresses The Us Congress Upon Visit
◼ Putin Travelled To Ukraine To Visit The Occupied Kherson Region ◼ North Korea Accuses U.S. Of Escalation As They Deliver Tanks To Ukraine
◼ 25 People Dead, After Series Of Russian Air Strikes Hit Ukrainian Cities ◼ Ukraine Disappointed As U.K. And U.S. Refuse To Send Them Their Fighter Jets
◼ Russia Launches Biggest Drone Attack In War With Ukraine ◼ European Union Calls For Urgent Joint Arm Purchase To Help Ukraine
◼ Hospital In Ukraine Destroyed By Missile Launched By Russia ◼ Ballistic Missile Fired By North Korea Off East Coast
◼ Russian Strike Kills Two-Year-Old Girl In Ukraine ◼ Biden Visits Ukraine For The First Time Since Russia’s Invasion
◼ Advance On Moscow Halted By Wagner Chief, Yevgeny Prigozhin ◼ China At G20 Meeting Calls For Join Action In Debt Settlement
◼ Russian Air Strike Hits Idlib Market, Kills Nine People ◼ U.K. And EU Come To An Agreement Over Northern Ireland
◼ Ukraine To Receive Cluster Munitions From The United States ◼ Xi Jinping Secures Third Term As The President Of China
◼ Russian Strike Hits Center Of Ukraine, Kills Seven People ◼ U.S., U.K., And Australia Reach Agreement On Nuclear Submarine Project
◼ Russian Aircraft Destroyed By Ukrainian Drone ◼ President Of Honduras Confirms Her Country Has Switched Ties From Taiwan To China
◼ Wagner Boss, Yevgeny Prigozhin, Listed As Part Of The Crew Of Plane Crash With No Survivors ◼ International Criminal Court Issues Arrest Warrant For Russian President
◼ Attack On Market In A Ukrainian City Has Killed At Least 17 People ◼ Russia’s President, Vladimir Putin, Pays Surprise Visit To Mariupol
◼ Several Missiles Has Been Launched By Ukraine On Crimea ◼ President Of China, Xi Jinping Visits Russia For The First Time Since Russia’s Invasion
◼ Ukraine To Receive Long-Range Missiles From The United States ◼ Honduras Announces Ending Its Diplomatic Tie With Taiwan
◼ Ukraine To Receive Seized Iranian Ammunitions From United States ◼ Finland's Bid To Join NATO Approved By Turkey
◼ Israel In Bid To Repel Militants Declares War Against Hamas ◼ Japan Reveals Plans To Develop Long-Range Missile Amid Tension With China
◼ Airstrike Hits Hospital In Gaza, Killing Over Hundred People ◼ Agreement On Vital Nuclear Weapons Deal Reached By United States And South Korea
◼ Missile Strike On Kharkiv Kills Six Postal Workers ◼ Syria Reinstated Into Arab League After Relations With Assad Normalize
◼ 15 People Confirmed Dead In An Israeli Attack On An Ambulance In Gaza City ◼ Former Pakistan’s Prime Minister, Imran Khan, Arrested In Islamabad
◼ Hospital In Gaza Has Been Surrounded By Israeli Tanks ◼ Joint Action Against China To Be Pledge By United States And EU
◼ Israel-Hamas Truce Ends, As Israel Launches Attack On Gaza ◼ Zelensky Plans To Attend The G7 Summit In Japan
◼ Immunity Granted To Putin And Bric Leaders By South Africa
◼ Ceasefire Of 24 Hours In Sudan Announced By United States And Saudi Arabia
• Humanitarian and Ethical Crisis ◼ Chinese Control Of Pirelli, Blocked By Italian Government
◼ China’s electric vehicle battery supply chain shows signs of forced labor, report says ◼ Former Security Officials In Ukraine Have Been Charged With Treason
◼ EV battery imports face scrutiny under US law on Chinese forced labor ◼ NATO Chief Confirms That Sweden’s NATO Bid Backed By Turkey
◼ Sudan protesters: Ready to die for freedom ◼ North Korea Launches Ballistic Missile After Threatening The United States
◼ Amazon Rainforest Reaches Dire New Record For Deforestation ◼ President Of South Korea Promises Ukraine $150m Aid
◼ Evacuees In Ukrainian Azovstal Iron And Steel Works Share Horror Stories ◼ Zelenskiy Dismisses Ukraine’s Ambassador To London
◼ North Korea Announces its first Covid Outbreak Since The Start of The Pandemic: Triggers ◼ Niger’s President Removed, As Soldiers Announce Coup On National Television
National Emergency ◼ Leader Of Niger Coup, Abdourahmane Tchiani, Declares Himself Leader Of The Country
◼ One Month Since Burkina Faso’s Zinc Mine Trap: Miners’ Wives Pray For Miracles ◼ Former Pakistan Prime Minister Imran Khan Sentenced To Three Years In Prison
◼ Floods And Monsoon Rain Lead To Humanitarian Crisis In Pakistan ◼ Assassination Plot Against The President Of Ukraine Neutralized
◼ Ukraine Government Confirm The Discovery Of 440 Bodies In Unmarked Graves ◼ Politicians And Journalists In The United Kingdom Sanctioned By Russian Government
◼ Gunman Attacks A Russian School In The City Of Izhevsk ◼ French Ambassador In Niger Given 48 Hours To Leave The Country
◼ Coal Mine Plans To Crush UK Climate Goals ◼ Coup Plotters Take Over Gabon And Put The President Under House Arrest
◼ Chaos Erupt During The Arrest Of El Chapo’s Son, Leaves At Least 29 Dead ◼ Constitutional Court In Gabon Swears In Military Junta Leader As Interim President
◼ Lethal Cough Syrup Kills 200 Children In Indonesia ◼ Emmanuel Macron Says French Ambassador In Niger Is Being Held Captive
◼ Suicide Blast Kills At Least 54 People In Political Gathering In Pakistan ◼ Canada And India Both Expels Envoys Over The Killing Of Sikh Leader
◼ Aid Enters Into Gaza Through The Rafah Crossing Point ◼ Canada Asked By India To Withdraw Its Diplomatic Staff From India
◼ Gaza Healthcare Collapses As Palestine Plunges Into Humanitarian Disaster ◼ European Union Says Ukraine Is Ready To Start Accession Talks
◼ Foreign Nationals And Injured Palestinians Allowed Through Rafah Border Crossing ◼ Ceasefire Deal Of Four Days Have Been Agreed By Israel And Hamas
◼ First Day Of Ceasefire Begins, With Hamas Releasing 24 Hostages
◼ Qatar Confirms the Extension of Ceasefire Between Israel And Hamas By Two Days
◼ President Putin Orders Increase In The Size Of Russian Military
◼ President Putin Confirms He Would Run For Fifth Term As The President Of Russia
◼ General Assembly Of The United Nations Has Voted For An Immediate Ceasefire In Gaza
◼ Russian Opposition Leader, Navalny Found In Remote Penal Colony

Figure 8: Distribution of sources in the news dataset.


According to the provided paragraphs:

### {}_Paragraphs_provided ###

extract a detailed hierarchical structure related to the EV battery supply chain.


The hierarchical structure should include the following levels:
- **Event**: Anything that happens related to the EV battery supply chain.
- **Event ID**: A unique identifier for each event.
- **Description**: A detailed 2-3 sentence explanation of the event.
- **Participants**: All sub-events related to this event and their importance, the importance needs
to be set as 0 ~ 1, the higher the more important.
- **Gate**: The relationship between an event and its sub-events:
- Use **’and’** if no sub-events can be missing.
- Use **’or’** if some sub-events can be missing.
- Use **’xor’** if sub-events cannot exist simultaneously.
- **Relations**: The event-event relations (e.g., ev1.1>ev1.2, which means ev1.2 happens after ev1.1)
.
- If any level is empty, set its value to ’xxxx’.

Strictly use the exact following format for each event:


‘‘‘
Event N
event: [Event Name]
event_id: evN
description: [Detailed Description]
participants: [Subevent 1] evN.1_P[Importance], [Subevent 2] evN.2_P[Importance], ...
Gate: [Gate]
Relations: [Event Relations]

Subevent N.1
subevent: [Subevent Name]
event_id: evN.1
description: [Detailed Description]
participants: [Subsubevent 1] evN.1.1_P[Importance], ...
Gate: [Gate]
Relations: [Event Relations]

Subsubevent N.1.1
subsubevent: [Subsubevent Name]
event_id: evN.1.1
description: [Detailed Description]
participants: [Subsubsubevent 1] evN.1.1.1_P[Importance], ...
Gate: [Gate]
Relations: [Event Relations]
‘‘‘

Figure 9: Example of hierarchical structure extraction. (Part 1)

The prompt given to the LLMs is detailed and spe- the EV battery supply chain domain.
cific, ensuring that the models understand the ex-
act format and type of information we are extract- D Schema Generation & Merging
ing. By integrating demonstration and CoT tech-
niques, our prompt provides clear guidance to the With human-in-the-loop schema induction, our
LLMs, improving the accuracy and relevance of schema learning dataset generated 125 individ-
the extracted structures. Below is an example of ual schemas ⟨S1 , S2 , . . . , S125 ⟩. To create a single
the prompt used in Fig. 9. comprehensive schema, it is essential to merge all
individual schemas into a final schema (Sfinal ). The
To validate our approach, we tested the prompt with process of merging schema format files involves
various texts from the schema learning dataset. The systematically integrating multiple schemas into
hierarchical structures extracted were compared a cohesive schema. The key components include
with manually annotated structures to ensure accu- context, id, events, and relations. These compo-
racy and consistency. This process ensured that the nents determine the information in each event and
LLMs reliably produced high-quality hierarchical its correlation with other events, hence the merging
structures that aligned with expert knowledge in process must address all of them.
Use the provided example for guidance:
### Example:
**Input Paragraph**:
‘‘‘
Three main methods are used in lithium-ion recycling: pyrometallurgical, hydrometallurgical,
bioleaching, and direct recycling. The battery is melted in a hot furnace to recover some of the
cathode metal in pyrometallurgy. Pyrometallurgy employs extreme heat to transform metal oxides
into cobalt, copper, iron, and nickel alloys. Although it has a straightforward process and a
reasonably mature technology, the main drawbacks are its high cost and high environmental
pollution. Hydrometallurgy is a metal recovery method involving aqueous solutions to perform
leaching processes to precipitate a particular metal. In hydrometallurgy, specialized solution
reagents are primarily used to leach the targeted metals out from the cathode substance.
Although it is a highly effective and power-efficient method, its drawbacks include a lengthy
production time and a complicated process. Combinations of both pyrometallurgy and
hydrometallurgy are also used due to their advantages in sorting starting materials for cells.
The bioleaching technique uses bacteria to retrieve precious metals, but it is challenging
because the bacteria need a substantial amount of time to grow and are easily susceptible to
contamination.
‘‘‘

**Extracted Hierarchical Structure**:

‘‘‘
Event 1
event: lithium-ion recycling
event_id: ev1
description: Methods for recycling lithium-ion batteries including pyrometallurgical,
hydrometallurgical, bioleaching, and direct recycling.
participants: pyrometallurgical ev1.1_P1, hydrometallurgical ev1.2_P1, bioleaching ev1.3_P1
Gate: or
Relations: ev1.1>ev1.3, ev1.2>ev1.3

Subevent 1.1
subevent: pyrometallurgical
event_id: ev1.1
description: Employs extreme heat to transform metal oxides into cobalt, copper, iron, and nickel
alloys.
participants: metal oxides ev1.1.1_P1, cobalt ev1.1.2_P0.5, copper ev1.1.3_P0.5, iron ev1.1.4_P0.5,
nickel alloys ev1.1.5_P0.5
Gate: and
Relations: ev1.1.1>ev1.1.2, ev1.1.1>ev1.1.3, ev1.1.1>ev1.1.4, ev1.1.1>ev1.1.5

Subevent 1.2
subevent: hydrometallurgy
event_id: ev1.2
description: Uses aqueous solutions to leach targeted metals out from the cathode substance.
participants: xxxx
Gate: xxxx
Relations: xxxx

Subevent 1.3
subevent: bioleaching
event_id: ev1.3
description: Uses bacteria to retrieve precious metals.
participants: xxxx
Gate: xxxx
Relations: xxxx
‘‘‘

Think about this extracted structure step by step:


Starting with the first sentence in the paragraph ’Three main methods are used in lithium-ion
recycling: pyrometallurgical, hydrometallurgical, bioleaching, and direct recycling.’ From this
sentence, we learn that ’pyrometallurgical’, ’hydrometallurgical’, ’bioleaching, and direct
recycling’ are three methods of ’lithium-ion recycling’, so select ’lithium-ion recycling’ as
the event, and the three methods as subevents and participants of ’lithium-ion recycling’.

Figure 9: Example of hierarchical structure extraction. (Part 2)


Algorithm 2 Schemas Merging Pseudocode lationSubject and relationObject and updating the
1: Input: List of schemas relations accordingly. It is important to ensure that
2: Output: Merged schema both subject_name and object_name are present
3: 1. Merge contexts from all schemas:
4: for all schema in schemas do in the name_to_id dictionary, which stores event
5: for all context in schema["@context"] do names and their related event IDs. The updated
6: if context not in merged_contexts then relation is added to the merged_relations list if it
7: Add context to merged_contexts
8: end if is not already present, ensuring all connections are
9: end for accurately maintained.
10: end for
11: 2. Merge events from all schemas by event name: Finally, the comprehensive merged schema (Sfinal )
12: for all schema in schemas do
13: for all event in schema["events"] do is created by including all merged contexts, events,
14: event_name = event["name"] and relations. The detailed pseudocode for merging
15: if event_name not in merged_events then schemas is shown in Algorithm 2. This algorithm
16: merged_events[event_name] = event
17: else ensures that all relevant information is retained and
18: merged_events[event_name] = accurately integrated, resulting in a comprehen-
merge_event_details(merged_events[event_name],
event)
sive schema that encapsulates the full breadth of
19: end if the data from the schema learning dataset. The
20: end for final schema (Sfinal ) enables accurate and efficient
21: end for
22: 3. Merge relations / update event IDs by event names: knowledge extraction and organization, enhancing
23: for all schema in schemas do the utility of the dataset for downstream tasks such
24: for all relation in schema["relations"] do as event prediction and analysis.
25: subject_name = GET event name by event ID
relation["relationSubject"]
26: object_name = GET event name by event ID rela- E Schema Management System
tion["relationObject"]
27: if subject_name in name_to_id and object_name The schema management interface (Fig. 10) facili-
in name_to_id then
28: relation["relationSubject"] =
tates the visualization, editing, and management of
name_to_id[subject_name] schemas. It includes the following modules:
29: relation["relationObject"] =
name_to_id[object_name] E.1 Schema Viewer
30: if relation not in merged_relations then
31: Add relation to merged_relations The schema viewer is crucial for visualizing
32: end if
33: end if schemas, providing an intuitive representation of
34: end for events. It organizes events into a left-to-right tree
35: end for structure, highlighting parent-child relationships.
36: 4. Final Schema:
37: The final merged schema includes all merged contexts, Within this structure, before-after relationships
events, and relations, and is saved for evaluation. among child nodes are indicated through arrows
and vertical ordering. Users can expand event
nodes to reveal details such as descriptions, im-
To begin the merging process, we first aggregate portance levels, and participant roles.
the context data from all schemas. Each context is
added to a merged_contexts_list, ensuring that du- Key features of the schema viewer include:
plicate contexts are avoided. This step is crucial to • Interactive Exploration: Users can click
maintain a unified context for the merged schema. on nodes to expand or collapse details about
Next, we proceed to merge events from all schemas. events and sub-events.
Using the event name as the identifier, we check if • Contextual Information: Hovering over a
the event already exists in the merged_events_list. node displays additional context and metadata
If the event exists, its details are merged with the associated with the event.
existing event; otherwise, the event is added di-
• Dynamic Layout: The tree structure dynam-
rectly to the list. This ensures that all events are
ically adjusts to accommodate the addition
comprehensively integrated without duplication.
or removal of nodes, maintaining a clear and
Following the merging of events, we then merge organized visual representation.
relations and update event IDs. This involves re- • Collapsible Subtrees: Users can collapse and
trieving the event names from the event IDs for re- expand subtrees to manage large schemas.
Figure 10: User interface for our schema management system.

• Search Functionality: A search bar allows the schema editing process.


users to quickly locate specific events or enti- • Schema Versioning: The editor maintains dif-
ties within the schema. ferent versions of schemas, allowing users to
• Real-Time Data Binding: The viewer up- track changes over time and revert to previous
dates in real-time as changes are made, ensur- versions if necessary.
ing the displayed schema is always current. • Bulk Operations: Users can perform bulk
• Highlighting and Filtering: Users can high- operations such as adding multiple events or
light specific paths or filter events based on updating several nodes at once.
criteria such as importance or type.
• Conflict Resolution: The editor provides
E.2 Schema Editor tools to resolve conflicts when multiple users
make changes simultaneously.
The schema editor allows users to interactively
modify schemas. Users can add, edit, and delete E.3 Frontend Architecture
events, sub-events, and relationships within the
schema. Key functionalities include: The frontend of system is implemented as a single-
page web application using React and TypeScript.
• Drag-and-Drop Interface: Users can drag
This setup connects to an API server that provides
and drop nodes to reassign parent-child rela-
application logic and access to a centralized schema
tionships or reorder events.
database. The use of a browser-based application
• Form-Based Editing: Clicking on a node offers several advantages, including no need for
opens a form where users can edit event de- user installations, centralized data management,
tails, such as descriptions, importance levels, and extensive functionality through JavaScript li-
and participant roles. braries. Key components include:
• Validation Checks: The editor performs real-
• React9 : A JavaScript library for building user
time validation to ensure that all changes ad-
interfaces, providing the foundation for the
here to the schema format and constraints.
application’s dynamic and responsive design.
• Undo/Redo Features: Users can easily undo
9
or redo changes to maintain the integrity of https://react.dev/
• TypeScript10 : A statically typed superset • Gunicorn14 : A Python WSGI HTTP server
of JavaScript, enhancing code reliability and for running web applications, ensuring robust
maintainability. and scalable performance.
• GoJS11 : A JavaScript library for creating in- • nginx15 : A high-performance web server and
teractive diagrams, enabling robust schema reverse proxy, providing load balancing and
visualization. enhancing security.
• API Integration: The frontend communi- • SQLite16 : A lightweight, disk-based database,
cates with the backend through API calls, chosen for its simplicity and reliability.
fetching and submitting schema data. • RESTful API17 : The backend exposes a
• Responsive Design: The application is op- RESTful API for the frontend to interact with
timized for various screen sizes and devices, schema data, supporting CRUD operations.
ensuring usability across different platforms. • Security Features: Implementations such as
• State Management: The application uses HTTPS, authentication, and authorization to
state management libraries such as Redux ensure data privacy and integrity.
to manage and synchronize the state of the • Scalability: The architecture is designed to
schema data across different components. scale horizontally, with load balancers and
• Performance Optimization: Techniques database replication as needed.
such as code splitting and lazy loading are em-
ployed to ensure fast load times and smooth E.5 AI-Driven Suggestions
interactions. The interface incorporates AI-driven suggestions
The client-side application requests Schema Defini- to assist users in schema creation and modification.
tion Files (SDF) from the API server and displays Large Language Models (LLMs) analyze existing
them to users. Edits to the SDF are maintained schemas and user inputs to provide recommenda-
locally until the user saves the changes, synchro- tions for schema elements, relationships, and struc-
nizing the server-side copy with the client’s modifi- tures. These suggestions are presented in real-time,
cations. A simple locking mechanism is employed enhancing user productivity and ensuring the cre-
to prevent simultaneous edits by multiple users on ation of accurate and comprehensive schemas.
the same schema, ensuring data integrity. Key features of AI-driven suggestions include:
E.4 Backend Architecture • Contextual Recommendations: The system
provides context-aware suggestions based on
The backend of the interface is developed in the current schema and user actions.
Python, leveraging the Falcon web server frame-
• Smart Auto-Completion: As users type or
work, served by Gunicorn and nginx, and supported
modify schema elements, the interface offers
by a SQLite database. The backend is designed to
auto-completion options to expedite the edit-
be lightweight, minimalist, and easy to compre-
ing process.
hend. Most functionalities are concentrated in the
frontend to maintain responsiveness and interactiv- • Error Detection: The AI models detect po-
ity, allowing the backend to focus primarily on data tential errors or inconsistencies in the schema
management. Python’s versatility and popularity and suggest corrections.
make it a suitable choice for the dynamic require- • Learning from User Feedback: The AI mod-
ments of the system. Static typing in Python is els improve over time by learning from user
enforced using Mypy12 to facilitate development feedback and interactions, refining their sug-
and reduce trivial bugs. Key components include: gestions and increasing accuracy.
• Falcon13 : A minimalist web framework for • Interactive Tutorials: The interface includes
building high-performance APIs, facilitating tutorials and guidance to help users under-
efficient communication between the frontend stand and leverage AI-driven suggestions ef-
and backend. fectively.
10 14
https://www.typescriptlang.org/ https://gunicorn.org/
11 15
https://gojs.net/latest/index.html https://nginx.org/en/
12 16
https://mypy-lang.org/ https://www.sqlite.org/
13 17
https://falcon.readthedocs.io/ https://restfulapi.net/
F Details of Event Extraction F.2 Event Argument Extraction
F.1 Event Span Identification Event argument extraction involves identifying the
roles and participants associated with events. This
Event span identification involves locating and
task is framed as extractive question answering,
marking the spans of events within input text. We
where the model extracts argument spans from the
use two models for this task:
text based on role-specific questions. We fine-tune
Base Model: This model is a fine-tuned version RoBERTa-large (Liu et al., 2019) on our internally
of the RoBERTa-large language model (Liu et al., annotated dataset with a sequence tagging loss
2019), trained on an internally annotated dataset. function. For supply chain disruptions, arguments
The task is formulated as sequence tagging, where might include the specific factories, transportation
the model identifies the start and end positions of modes, or materials directly affected by the event.
event spans. For instance, in the context of sup-
ply chain disruptions, the model identifies spans The extraction process is as follows:
corresponding to events like factory shutdowns, 1. Role-Specific Questions: The model is
transport delays, or material shortages. This aligns trained to answer questions like "Which fac-
with the cross-sentence event detection described tory was shut down?" or "What material was
in the main text: delayed?" This method ensures that the argu-
ments are specific and relevant.
EventDetectmulti-sentence (T) → EC (16)
2. Contextual Embeddings: This step is en-
where T represents the input text and EC the de- riched by contextual embeddings generated
tected events. The model uses contextual informa- by BERT:
tion from neighboring sentences to accurately de-
tect event boundaries, ensuring that even complex BERTcontext (EC ) → CE (17)
events spanning multiple sentences are correctly
identified. generating contextual embeddings CE . These
embeddings provide rich semantic informa-
Guided Model: Inspired by Wang et al. (2021),
tion, enabling the model to better understand
this model uses a query-based approach to focus on
the context and improve the accuracy and rel-
schema-related events. The process involves two
evance of the extracted arguments.
stages as follows:
1. Discriminator Stage: Queries representing F.3 Time Expression Linking &
event types are paired with sentences to pre- Normalization
dict if the query corresponds to an event type Time expression linking connects time expressions
mentioned in the sentence. For example, to their corresponding events. Similar to argu-
queries include "factory shutdown due to la- ment extraction, this task uses extractive question
bor strike" or "delay in shipping materials." answering to find start and end times for events.
This stage helps in filtering sentences that are We fine-tune RoBERTa-base using the TempEval3
likely to contain relevant events. dataset (UzZaman et al., 2012).
2. Span Extraction Stage: Sentences identi- The process includes:
fied in the discriminator stage are further pro-
cessed to extract event spans using sequence 1. Extraction: The model identifies time expres-
tagging. This ensures that the extracted spans sions within the text and links them to the
are relevant to the supply chain context. By corresponding events, ensuring that the time-
using sequence tagging, the model accurately line of events is accurately captured.
marks the start and end points of events within
2. Normalization: Identified time expressions
the identified sentences.
are then normalized into standard formats us-
This approach supports the cross-sentence event de- ing SUTime (Chang and Manning, 2012) and
tection described in the main text, enriching event HeidelTime (Strötgen and Gertz, 2013). For
spans with relevant context and ensuring high pre- example, expressions like "next Monday" are
cision in event identification. converted into specific dates.
For supply chain disruptions, this ensures that time- This is critical for tracking entities like factories,
lines for events like "shipment delayed from March suppliers, and shipments across multiple reports of
15 to March 20" are accurately captured. This inte- supply chain disruptions. This supports the corefer-
grates into the event parameter extraction process, ence resolution and event linking described in the
ensuring coherence and consistency. main text:

F.4 Event Temporal Ordering CorefLink(EC ) → EL (19)

Event temporal ordering determines the chrono- yielding linked events EL .


logical sequence of events. We frame this task as F.6 Graph Convolutional Networks (GCNs)
extractive question answering to address label im- for Event Relationship Modeling
balance issues, fine-tuning RoBERTa-large (Liu
et al., 2019) with a sequence tagging loss. We leverage Graph Convolutional Networks
(GCNs) to model complex event relationships and
Steps include: assess each event’s impact. This involves construct-
ing a graph where events are nodes and their inter-
1. Pairwise Temporal Relations: The model
actions are edges.
identifies pairwise temporal relations between
events, such as "Event A happened before Steps include:
Event B."
1. Node Importance Calculation: Each node’s
2. Consistency Checking: Pairwise temporal re- importance is calculated using centrality mea-
lations are processed using Integer Linear Pro- sures, such as degree centrality, betweenness
gramming (ILP) (Schrijver, 1998) to ensure centrality, and eigenvector centrality. These
consistency and resolve any conflicts. This measures help in understanding the influence
method helps in constructing a coherent time- of each event within the network.
line of events. 2. Edge Impact Calculation: Edges represent
This is crucial for understanding the sequence of the magnitude of impact, quantified by mea-
disruptions in supply chains, such as how a factory sures such as event severity and frequency of
shutdown leads to delayed shipments. This aligns occurrence.
with the logical constraints and argument corefer- The impact score is then calculated as:
ence to maintain event relationships modeled by
GCNs: ImpactScore(ei ) = Centrality(ei )+Magnitude(ei )
(20)
LogicCoref(PC ) → PF (18)
where Centrality(ei ) reflects the event’s impor-
tance within the network, and Magnitude(ei ) quan-
F.5 Coreference Resolution tifies the event’s impact intensity.
We perform both within-document and cross-
F.7 Logical Constraints and Argument
document coreference resolution using models fine-
Coreference
tuned on datasets like OntoNotes 5.0 (Pradhan
et al., 2013). To ensure the robustness of our event extraction
pipeline, we apply logical constraints and argument
The resolution process involves: coreference resolution.
1. Entity Clustering: Entity and event coref- This involves multiple steps to refine the extracted
erence clusters are identified and linked to event parameters and ensure logical consistency:
ensure consistency across documents. This
Logical Constraints Application:
helps in tracking the same entities and events
mentioned in different parts of the text. 1. Defining Logical Rules: We define a set of
logical rules to maintain consistency within
2. Cross-Document Linking: Linking entities
the extracted events. These rules include:
and events across multiple documents ensures
that all references to a specific factory, sup- • Temporal constraints: An event must oc-
plier, or shipment are recognized as the same cur before another if there is a chrono-
entity. logical dependency.
• Causal relationships: If Event A causes that all logical rules and coreference chains
Event B, then Event A must be identified are satisfied. This step is crucial for main-
as a precursor to Event B. taining the integrity of the event extraction
pipeline.
2. Implementation: The defined logical rules
are implemented using a logic-based reason- 3. Feedback Loop: A continuous feedback loop
ing system that checks for any violations and is established where the output is reviewed
rectifies them. For instance, if an event is and refined based on new data and expert feed-
detected as occurring before its cause, the sys- back. This iterative process helps in improv-
tem flags this inconsistency and corrects the ing the model’s performance over time.
sequence.
By applying these detailed logical constraints and
Argument Coreference Resolution: advanced coreference resolution techniques, we
ensure that the event extraction pipeline produces
1. Coreference Detection: We identify coref-
high-quality, reliable, and contextually accurate
erences within and across documents. This
event data, which is essential for robust supply
involves detecting instances where different
chain disruption analysis.
expressions refer to the same entity or event.
• Within-Document Coreference: Ensures G Details of Event Matching &
that all mentions of an entity within a Instantiation
single document are linked. Event matching and instantiation involve aligning
• Cross-Document Coreference: Links a schema from the schema library with events ex-
mentions of the same entity or event tracted by the schema extraction component, specif-
across multiple documents to ensure ically for predicting supply chain disruptions. This
global consistency. process begins by instantiating one of the Eschema
from the integrated library or selecting the ex-
2. Refinement Process: tracted event Eext that best matches the schema
• Cluster Formation: Entities and events event Eschema . Subsequently, the task entails match-
identified as coreferent are grouped into ing events in the Eschema with their correspond-
clusters. ing events in Eext extracted from the news dataset.
Events in both Eext and Eschema are organized in
• Coreference Chains: We create chains a highly structured manner, with parent events
of coreferent mentions, which are used divided into child events. Events also contain
to refine event parameters and ensure temporal information, indicating that some events
that all related mentions are consistently must precede others. Logical relationships are also
linked. defined: AND-gates connect all necessary child
• Manual Verification: After automatic events for a parent event, OR-gates connect one or
coreference resolution, manual verifica- more needed child events, and XOR-gates indicate
tion is performed by domain experts to that only one child event can be present.
ensure accuracy and address any ambi- For example, a document about a raw material
guities. shortage in the EV battery supply chain might align
Combining Logical Constraints & Coreference: with a "Supply Chain Disruption" schema in the
schema library. Following the instantiation, a "no-
1. Integration: The logical constraints and tify suppliers" event in the schema might match
coreference resolution processes are inte- with a graph G event describing a notification sent
grated to produce a coherent and logically to cobalt suppliers. The "suppliers" participant of
consistent set of event parameters: the schema event might match with the "cobalt
suppliers" participant of the extracted event.
LogicCoref(PC ) → PF (21)
G.1 Matching Process & Techniques
2. Validation: The final set of event parameters Our approach to event matching and instantiation
PF undergoes a validation process to ensure involves several key steps and techniques to en-
sure accurate alignment between schema events in the schema to the "cobalt suppliers" entity in the
and extracted events. This is particularly critical extracted event:
in the context of predicting supply chain disrup-
tions, where precise event matching can provide Instantiate(Ematched , Sschema ) → Einst (24)
actionable insights.
where Einst is the instantiated event enriched with
G.1.1 Similarity Calculation attributes from the schema.
To determine the similarity between schema events
G.1.3 Consistency Checks
and extracted events, we calculate a similarity score
based on semantic and structural similarities. Se- After matching events, we perform consistency
mantic similarity (SemSim) is computed using sen- checks to ensure that the instantiated schema ad-
tence transformers to encode the semantic content heres to logical and temporal constraints. This
of events. Structural similarity (StrSim) takes into includes verifying that:
account the hierarchical and temporal relationships • All necessary child events are present (AND-
between events. gates).
Semantic Similarity: We use a sentence trans- • At least one required child event is present
former model to encode events into semantic vec- (OR-gates).
tors. The cosine similarity between these vectors
provides a measure of how semantically similar • Only one of the mutually exclusive child
two events are: events is present (XOR-gates).

vext · vschema These checks ensure that the instantiated schema is


SemSim(Eext , Eschema ) = (22) logically coherent and temporally consistent:
∥vext ∥∥vschema ∥
where vext and vschema are BERT embeddings of ConsistencyCheck(Einst , Sschema ) (25)
extracted and schema events.
Structural Similarity: We consider the context G.2 Continuous Improvement
of events within their respective hierarchies. For To enhance the accuracy and robustness of our
example, an event’s predecessors and successors, matching and instantiation process, we incorporate
its parent event, and its child events all contribute to continuous improvement through manual review
its structural context. Events with similar structures and feedback from domain experts. This involves:
in both schema and extracted graphs are more likely
• Validating the instantiated events with domain
to match:
experts to ensure they accurately reflect real-
|Pext ∩ Pschema | world scenarios.
StrSim(Eext , Eschema ) = (23)
|Pext ∪ Pschema |
• Refining our models based on feedback, ad-
where Pext and Pschema are the parameter sets for justing similarity metrics, and improving our
the extracted and schema events. semantic and structural encoding techniques.

G.1.2 Event Matching • Iteratively updating our schema library and


Once the similarity scores are calculated, we match extraction models to incorporate new insights
each extracted event Eext with the schema event and improve performance.
Eschema that has the highest similarity score. This By leveraging domain expertise and feedback, we
involves instantiating the schema event with infor- continually refine our event matching and instan-
mation from the extracted event, ensuring that all tiation process, ensuring it remains effective and
relevant details and relationships are preserved. relevant for predicting and analyzing supply chain
Example: Consider a schema event "notify suppli- disruptions.
ers" in the context of a raw material shortage. An
H Details of Disruption Prediction
extracted event describing an email notification to
cobalt suppliers would match this schema event if Given an instantiated event graph Ginst = (N, E),
the similarity score is high. The instantiation pro- where N represents event nodes (e.g., specific sup-
cess involves mapping the "suppliers" participant ply chain activities) and E denotes event-event
Algorithm 3 Event Matching and Instantiation • Node Scoring: Using the learned representa-
1: Input: Extracted events Eext , schema library events tions, the GCN scores and selects unmatched
Eschema events in the instantiated graph.
2: Output: Instantiated events Einst
3: Calculate Similarity ▷ Compute similarities • Prediction Output: The first-stage predic-
4: for each Eext in Eext do
5: for each Eschema in Eschema do tion output consists of the most likely missing
6: Sim(Eext , Eschema ) ← α·SemSim(Eext , Eschema )+ events.
β · StrSim(Eext , Eschema )
7: end for H.2 Constrained Prediction
8: end for
9: Match Events ▷ Match extracted events to schema events This stage applies logical constraints and hierarchi-
10: for each Eext in Eext do cal relations to refine the initial predictions from
11: Ematched ← arg max Sim(Eext , Eschema )
Eschema the schema-guided prediction stage. Key steps in-
12: Einst ← Instantiate(Ematched , Sschema ) clude:
13: Perform ConsistencyCheck(Einst , Sschema )
14: end for • Logical Constraints: We refine initial predic-
15: Continuous Improvement ▷ Manual review and
feedback tions (ŷ) to produce final predictions (ŷ ′ ) that
16: for each Einst do adhere to known rules:
17: UpdatedModels ← ValidateRefine(Einst )
18: end for ŷ ′ = arg min Constrain(ŷ)
19: Return: Instantiated events Einst ŷ ′ ∈Y (27)
subject to C(ŷ ′ ) = true
temporal links (e.g., dependencies or sequences
of activities), the goal is to classify whether un- where C represents constraint sets. For ex-
matched schema events (nodes) could potentially ample, a constraint might ensure that a major
occur within this graph. supplier’s disruption increases risk for depen-
dent manufacturers.
Formally, let I be the set of matched schema events
within the graph. The task involves classifying • Hierarchical Relationships:
each node in the remaining schema event nodes, – Child-to-Parent Propagation: If a child
represented by N \ I, as a missing event (positive event node is predicted or matched, its
or negative) given the instantiated graph. parent node is also predicted.
To address the limitations of existing methods, we – AND-Siblings Propagation: If a pre-
developed a novel approach that leverages the struc- dicted node has AND-sibling nodes, all
tural information within the schema graph and in- its siblings are also predicted.
corporates logic gates and hierarchies. Our ap-
proach consists of three stages: (1) schema-guided – Iterative Refinement: The constrained
prediction, (2) constrained prediction, and (3) argu- prediction approach is applied iteratively
ment coreference. until no further nodes can be predicted.

H.1 Schema-Guided Prediction H.3 Argument Coreference

In this stage, we utilize a trained graph neural net- In this phase, we utilize coreference entity links
work specifically designed for schema graphs to and instantiated entities to generate predictions for
score and select unmatched events in the instanti- the arguments associated with the predicted events.
ated graph. Key steps include: Key steps include:

• Graph Neural Network: A GCN is trained • Coreference Links: Coreference entity links
on schema graphs to learn representations of specified in the schema are used to ensure
nodes and edges. The propagation rule is consistency among entity mentions:
given by: Rij = arg, max Coref(Ei , Ej )
(l+1) (l) (l) Ei ,Ej ∈E (28)
H = σ(AH W ) (26)
subject to Coref(Ei , Ej ) = true
where H(l) is the hidden state at layer l, A is
the adjacency matrix, W(l) is the weight ma- where (Ei , Ej ) denotes each event pair and
trix, and σ is a non-linear activation function. Rij represents their relation.
• Instantiated Entities: Instantiated entities strates a logical hierarchical structure, while the
from the previous stages are leveraged to gen- schema produced from news reports presents a
erate arguments for predicted events. well-defined temporal sequence. Experts manu-
ally review the schemas to verify these attributes,
• Final Output: This stage produces the final
providing qualitative feedback on the logical co-
prediction output, including both events and
herence and comprehensiveness of the extracted
their arguments.
structures. Each schema is rated on a scale from 1
to 5, where 1 indicates poor quality and 5 indicates
I Experiment Details
excellent quality.
I.1 Experiment Setup
Objective Disruption Detection. We compare the
In Experiment 5.1, we evaluate the efficacy of three instantiated schemas learned by our system with
distinct Large Language Models (LLMs) in ex- manually annotated ground truth to assess the de-
tracting hierarchical structures from our schema gree of overlap. This comparison uses an evalu-
learning dataset. Leveraging domain expert knowl- ation metric similar to Smatch (Cai and Knight,
edge, we annotate individual schemas for each 2013), which involves breaking down both our
article in our academic corpus using our propri- schema and the ground truth into quadruples of the
etary system viewer and editor. We then employ form relation(event1, event2, importance). For in-
the methodology outlined in Appx. D to synthesize stance, the event of Raw Material Mining includes
these schemas into an integrated library. This com- the subevent of Lithium Mining with an associ-
bination of individual schemas and the integrated ated importance value, represented by the quadru-
library serves as the ground truth for our hierar- ple subevent(raw material mining, lithium mining,
chical information extraction phase. Our schema importance). Other relations include participants,
learning performance evaluation consists of two gates, sequential events, etc.
key components. First, we compare the hierarchical
information extracted by the three LLMs against To evaluate the results, we 1) map the events in
our established ground truth. Second, we assess the learned schema Sl to those in the ground-truth
the consistency, accuracy, and completeness of the schema Sgt , 2) establish a one-to-one mapping of
hierarchical structures derived from the textual con- quadruples between the learned schema Sl and the
tent of each article in the schema learning dataset, ground-truth schema Sgt , 3) calculate Precision,
with domain experts actively participating in this Recall, and F-score as follows:
evaluation process. number of matched quadruples in Sl
Precision =
In Experiment 5.2, we apply a similar annotation total quadruples in Sl
(29)
methodology to our news dataset as used for the
schema learning dataset. However, annotating the
news dataset presents unique challenges, as news number of matched quadruples in Sl
reports typically do not explicitly elucidate the con- Recall =
total quadruples in Sgt
nections between events. Instead, they often em- (30)
ploy speculative language to describe event interre-
lations. To ensure annotation accuracy, we heavily
Precision · Recall
rely on domain knowledge derived from scenario F-score = 2 · (31)
Precision + Recall
documents throughout the annotation process. Sub-
sequently, we utilize the ground truth extracted J Disruption Prediction Case Studies
from these reports to evaluate our system’s perfor-
mance in predicting news report outcomes. J.1 Case 1: Impact of the Inflation Reduction
Act in August 2022
I.2 Evaluation Metrics In August 2022, the United States passed the Infla-
Subjective Schema Learning. For subjective tion Reduction Act, which included significant in-
schema evaluation, we ensure that the event centives for domestic EV battery production. This
schemas generated from each paper and news re- led to a rapid increase in investments but also high-
port are consistent, accurate, and complete. The lighted potential material shortages, causing disrup-
schema derived from academic papers demon- tions in the EV battery supply chain.
System Prediction: Our system predicted the pos- analyzing news reports on strike activities and up-
sibility of short-term material shortages by analyz- dates on government regulations in the DRC. It
ing the market response data to the Inflation Re- also assessed historical data on cobalt supply and
duction Act, monitoring global distribution reports demand to identify vulnerabilities and integrated
of EV battery materials, and assessing the impact expert feedback on the impact of labor strikes and
of increased domestic production incentives on the regulatory changes on cobalt production.
supply and demand balance. Outcome: The system provided early warnings
Outcome: The system identified the risks posed by about the potential disruptions, enabling compa-
the sudden increase in demand for battery materials, nies to adjust their supply chain strategies. This
providing early warnings to stakeholders. This included diversifying sources of cobalt and increas-
allowed them to take proactive measures such as ing inventories to buffer against supply shortages.
securing long-term supply contracts and exploring
alternative materials to mitigate potential shortages. K SHIELD’s User Interface

J.2 Case 2: Lithium Supply Chain Disruption The SHIELD user interface is designed to be in-
in Early 2023 tuitive and user-friendly, facilitating the efficient
upload and analysis of news reports.
In early 2023, significant disruptions in the lithium
supply chain were caused by escalating geopolitical News Report Upload. On the right side of the in-
tensions between Australia and China. As Australia terface, users can upload their collected news report
is one of the world’s largest suppliers of lithium, texts. It includes a text box for input and a submis-
political factors heavily influenced its export poli- sion button to upload the report (see Fig. 11a). Key
cies, severely impacting the global supply chain for features include:
EV batteries, which rely heavily on lithium. • Upload Box: Allows users to paste or type
System Prediction: Our system accurately pre- their news report texts.
dicted the potential supply disruption by analyzing • Submit Button: Initiates the analysis process
various news reports on geopolitical developments once the report is uploaded.
and export data. The system monitored news re-
• Uploaded Reports List: Displays previously
lated to geopolitical tensions between Australia and
uploaded news reports, enabling users to re-
China, analyzed export data indicating changes in
view and compare past submissions easily.
Australia’s lithium export policies, and integrated
insights from scenario documents highlighting the Disruption Analysis Results. After submitting a
dependence of the EV battery supply chain on Aus- news report, users can view the real-time results of
tralian lithium exports. the disruption analysis on the left side of the inter-
Outcome: The system flagged the risk of Aus- face (see Fig. 11b). The comprehensive overview
tralia’s export restrictions to China, providing early of the analysis include:
warnings of potential disruptions in the EV bat- • Generated Schema: Displays the hierarchi-
tery supply chain. This allowed stakeholders to cal of events identified in the news report.
proactively seek alternative sources and mitigate
• Events List: Lists all detected events and
the impact on production.
their details, allowing to see which events
J.3 Case 3: Nickel and Cobalt Supply Issues were identified and how they are connected.
in March 2023 • Evaluation Score: Shows the real-time eval-
uation score, assessed against the schema li-
In March 2023, a major disruption in the global
brary for accuracy and completeness.
supply chain occurred due to large-scale worker
strikes and regulatory changes in the Democratic • Schema Editing: Allows to edit the gener-
Republic of Congo (DRC), a primary supplier of ated schema. Users can make changes to the
cobalt. Cobalt is crucial for EV batteries, and the structure, relationships, and details of events.
disruption had a significant negative impact on the • Regenerate Evaluation: Users can choose to
global supply chain. regenerate the evaluation score based on the
System Prediction: Our system successfully fore- edited schema, ensuring that the modifications
casted the potential supply chain interruptions by are reflected in the updated score.
L Author Contributions Frontend Interface:

Schema Learning Dataset: • Aike Shi and Yifei Dong: Provided the design.

• Yuzhi Hu: Created the scenario document. • Wei Liu: Optimized the design.

• Yifei Dong: Collected paper lists and Wiki- • Aike Shi: Implemented the interface.
data lists. Paper:
• Aike Shi: Collected Wikipedia lists and • Yifei Dong: Contributed to schema learning
weekly report lists. parts.
• Yifei Dong, Wei Liu, and Aike Shi: Extracted • Yuzhi Hu: Contributed to the news dataset
paragraphs from collected articles. and schema dataset parts and created and opti-
mized Figs. 5, 6, 7, and 8.
Supply Chain News Dataset:
• Wei Liu, Yifei Dong, and Aike Shi: Con-
• Yuzhi Hu: Conducted data crawling and clas- tributed to Figs. 1, 2, 3, 4, 9, 10, and 11.
sification.
– Yifei Dong and Wei Liu designed all fig-
• Yuzhi Hu, Yifei Dong, Wei Liu, and Aike Shi: ures.
Labeled ground truth for the news dataset.
– Wei Liu created and optimized all fig-
Schema Learning: ures.
• Yifei Dong and Aike Shi: Designed and mod- – Aike Shi designed the user interface ele-
ified prompts for generating structured infor- ments for Figs. 4, 10, and 11.
mation, designed the format for structured in-
• Jason O’Connor: Contributed to paper revi-
formation, wrote scripts for converting struc-
sions and suggestions.
tured information to SDF, and for generating
structured information using Llama3. • Zhi-Qi Cheng: Organized and rewrote the en-
tire paper, and supervised the modification of
• Yifei Dong, Aike Shi, Wei Liu, and Yuzhi all figures.
Hu: Generated structured information using
GPT-4o with zero-shot learning. Presentation and Project Webpage:

Human Curation: • Aike Shi and Zhi-Qi Cheng: Presented the


paper.
• Yifei Dong, Wei Liu, Aike Shi, and Yuzhi Hu:
Curated LLM-generated SDFs. • Yifei Dong: Built the project website.
• Aike Shi: Created the video demo.
• Yifei Dong and Aike Shi: Wrote scripts for
schema merging. • Wei Liu and Yuzhi Hu: Provided materials for
the project website.
System Construction:
Feedback and Guidance:
• Zhi-Qi Cheng: Provided guidance for system
implementation, designed the system proto- • Jason O’Connor: Provided feedback and
type, and performed system implementation. project guidance from a supply chain expert
perspective.
• Yifei Dong and Aike Shi: Performed system
debugging and testing. • Zhi-Qi Cheng, Kate S. Whitefoot, and Alexan-
der G. Hauptmann: Provided guidance and
Evaluation: supervision for the entire project.
• Wei Liu and Yifei Dong: Designed evaluation
metrics.
• Aike Shi and Yifei Dong: Wrote evaluation
scripts.
(a) News report upload section of the user interface.

(b) Visualization and editing of the final prediction results.

Figure 11: User interface for the disruption prediction analysis in SHIELD.

You might also like