Making AI Less Thirsty': Uncovering and Addressing The Secret Water Footprint of AI Models
Making AI Less Thirsty': Uncovering and Addressing The Secret Water Footprint of AI Models
DOI:10.1145/ 3724499
a major roadblock to sustainability
Uncovering and addressing the secret water and create social conflicts, as
freshwater resources suitable for
footprint of AI models. human use are extremely limited
and unevenly distributed.
BY PENGFEI LI, JIANYI YANG, MOHAMMAD A. ISLAM, As acknowledged in Google's
AND SHAOLEI REN sustainability report9 and the recent
Making
U.S. datacenter energy report,25
the expansion of AI products and
services is a key driver of the rapid
increase in datacenter water con-
sumption. Even excluding the water
usage in leased third-party colo-
AI Less
cation facilities, one technology
company's self-owned datacenters
alone directly withdrew 29 billion
liters and consumed (that is, evapo-
rated) more than 23 billion liters of
freshwater for onsite cooling in 2023,
‘Thirsty’
nearly 80% of which was potable
water.9,a This amount of annual water
consumption even rivals that of a
major household-name beverage
company.21 Importantly, the compa-
ny's datacenter water consumption
increased by ∼ 20% from 2021 to 2022
and by ∼ 17% from 2022 to 2023,9 and
another technology company's data-
center water consumption saw ∼ 34%
and ∼22% increases over the same
periods, respectively.18 Furthermore,
according to the recent U.S. datacen-
ter energy report, the total annual
onsite water consumption by U.S.
has enabled remarkable
A R T I F ICI A L I N T EL L IGENCE (A I)
datacenters in 2028 could double or
breakthroughs in numerous areas of critical even quadruple the 2023 level, reach-
importance, including tackling global challenges such ing approximately 150–280 billion
liters and further stressing the water
as climate change. On the other hand, many AI models, infrastructures.25
IMAGE BY AND RIJ BORYS ASSOCIAT ES WITH ASSISTANCE OF SH UT TERSTOCK.A I
especially large generative ones like GPT-4, are trained AI represents the fastest-expand-
ing workloads in datacenters.9,25 For
and deployed on energy-hungry servers in warehouse- example, a recent study suggests that
scale datacenters, accelerating the datacenter energy global AI could consume 85–134TWh
consumption at an unprecedented rate.25 As a result, of electricity in 2027,6 whereas a
more aggressive projection by the
AI's carbon footprint has been undergoing scrutiny, recent U.S. datacenter energy report
driving the recent progress in AI carbon efficiency.24,31 predicts that AI servers' electricity
consumption in the U.S. alone will
However, AI's water footprint—many millions of liters surpass 150–300TWh in 2028.25 Even
of freshwater consumed for cooling the servers and for
electricity generation—has largely remained under the a The detailed difference between water with-
radar and keeps escalating. If not properly addressed, drawal and water consumption is presented
in the section Water Withdrawal versus Water
AI's water footprint can potentially become Consumption.
JU LY 2 0 2 5 | VO L. 6 8 | N O. 7 | C OM M U N IC AT ION S OF T HE ACM 55
sustainability
considering the lower estimate, the structure. The urgency can also be
combined scope-1 and scope-2 water reflected in part by the recent com-
withdrawal of global AI is projected mitment to “Water Positive by 2030”
to reach 4.2–6.6 billion cubic meters from industry leaders, including
in 2027, which is more than the total Google9 and Microsoft,18 and by the
annual water withdrawal of four to inclusion of water footprint as a key
six Denmarks or half the U.K.b Simul- metric into the world's first interna-
taneously, a total of 0.38–0.60 billion tional standard on sustainable AI to
cubic meters of water will be evapo- be published by the ISO/IEC.19
rated and considered “consumption” In this article, we advocate for
The combined due to the global AI demand in 2027. a holistic approach to sustainable
scope-1 and scope-2 Moreover, these global estimates will
be exceeded by the total water with-
AI that extends beyond the carbon
footprint to also address the water
water withdrawal drawal and consumption attributed footprint. Specifically, we present a
agricultural, industrial, or municipal Figure 1. Example of a datacenter’s operational water usage: onsite scope-1 water usage
uses (normally excluding water used for datacenter cooling (via cooling towers in the example), and offsite scope-2 water us-
for hydroelectricity generation).22 As age for electricity generation. The icons for AI models are only for illustration purposes.
water is a finite shared resource, wa-
ter withdrawal indicates the level of
Heat
competition as well as dependence Scope-1 Water Exchanger
on water resources among different
sectors. Cooling
˲ Water consumption is defined Tower
as “water withdrawal minus water
Scope-2 Water Warm
discharge”; it refers to the amount Water
of water “evaporated, transpired,
Cooling
incorporated into products or crops, Tower Chilled
Water
or otherwise removed from the im- Power
mediate water environment.”13 Water Plant
consumption reflects the impact on
downstream water availability and is
crucial for assessing watershed-level Water ChatGPT AlphaGO
scarcity.22 Source
JU LY 2 0 2 5 | VO L. 6 8 | N O. 7 | C OM M U N IC AT ION S OF T HE ACM 57
sustainability
tially exacerbating the overall stress is blue water extracted from riv-
on water resources due to higher ers, lakes, or groundwater, which is
scope-2 water consumption. directly accessible for human use but
Scope-2 water usage. In many often more limited in availability.
countries, thermoelectric power
is among the top sectors in terms Estimating AI's Water Footprint
of water withdrawal and water We present a general methodology
consumption.23 Thus, similarly to for estimating AI's water consump-
scope-2 carbon emissions, data- tion footprint. To obtain the water
centers are accountable for off-site withdrawal footprint, we simply
Just as addressing scope-2 water usage associated with replace the WUE with water with-
scope-2 carbon electricity consumption, which
forms part of the “true water cost
drawal efficiency.
Operational water footprint. We
emissions is of datacenters,” as highlighted by collectively refer to onsite scope-1
Table 1. Estimate of GPT-3’s operational water consumption footprint. “*” denotes datacenters under construction as of July 2023,
whose PUE and WUE are projected by Microsoft.
and the datacenter hosting the AI popular online service, is a large number of Microsoft's datacenters
model has a power usage effective- language model (LLM) based on are located in the U.S., where the av-
ness (PUE) of θt that accounts for the subsequent versions of GPT-3. We erage EWIF provided by Reig et al.23
non-IT energy overhead. Then, the present a case study to estimate the is 3.14L/kWh and significantly lower
total operational water footprint of operational water consumption for than 4.35L/kWh noted by the recent
the AI model can be written as W ater- the full GPT-3 model with 175 billion report.25 The specific location for
Operational = ∑ t=1 T
et · [ρs1,t
+ θ t · ρ s2,t]. parameters.4 We exclude embodied training GPT-3 is not public. Thus,
Embodied water footprint. Simi- water footprint due to the lack of we consider Microsoft's different
lar to accounting for the embodied public data for scope-3 water us- datacenter locations while excluding
carbon footprint,12 the total scope-3 age. We choose GPT-3, as Microsoft Singapore and Taiwan, as EWIF data
water footprint is amortized over publishes its location-wise WUE and for these regions is not available in
the lifespan of a server. Specifically, PUE.16,17 The results are summarized Reig et al.23
if W
represents the total water used in Table 1. Inference. As a representative us-
to manufacture the AI servers and Training. GPT-3 was trained and age scenario for an LLM, we consider
the servers are expected to operate deployed by OpenAI in Microsoft's a conversation task, which typically
for a period of T 0, then the embodied datacenters, with an estimated train- includes a CPU-intensive prompt
water footprint over a period of T is ing energy of 1287MWh.20 In line phase that processes user input (also
calculated as W aterEmbodied = T_ 0
T
·W
with the practice of estimating the known as prompt) and a memory-
By adding the operational and carbon footprint, we use the most intensive token phase that produces
embodied water footprints, we recent annualized average onsite outputs.30 More specifically, we
obtain the total water footprint as PUE and WUE for each location, as consider a medium-sized request,
WaterTotal = ∑ t=1 T
et · [ρs1,t + θ t · ρ s2,t] reported by Microsoft.16,17 For power each with approximately ≤800 words
_
+ T . In practice, to obtain a rough
T ·W
plant water efficiency, different refer- of input and 150–300 words of out-
0
estimate, we can use the average val- ences may provide different esti- put.30 The official estimate indicates
ues for the annualized WUE and the mates of EWIF. Thus, for consistency that GPT-3 consumes an order of
estimated AI server energy consump- across regions, we use the EWIF 0.4kWh of electricity to generate
tion. provided by Reig et al.23 to estimate 100 pages of content, equivalent to
Case Study: Estimating GPT-3's scope-2 water consumption, as it roughly 0.004kWh per page.4 While
operational water-consumption employs the same methodology for no details are provided, the estimate
footprint. The core of ChatGPT, a calculating EWIF. Moreover, a large likely considers only the GPU energy
JU LY 2 0 2 5 | VO L. 6 8 | N O. 7 | C OM M U N IC AT ION S OF T HE ACM 59
sustainability
Figure 2. (a) The U.S. eGRID-level scope-2 water consumption intensity factor vs. carbon emission rate.23,33 The dashed line represents a
linear regression model, showing that the eGRID-level scope-2 carbon emission and water consumption efficiencies are not aligned. (b)
A five-day snapshot of scope-2 carbon emission rate and water consumption intensity in Virginia, starting from Apr. 4, 2022. The values
are calculated based on the fuel mixes, carbon emission rate and water consumption intensity for each fuel type.23,32,33 The scope-2
carbon and water efficiencies only have a weak Pearson correlation coefficient of 0.06 in Virginia. (c) A five-day snapshot of energy fuel
mixes serving Virginia, starting from Apr. 4, 2022.32
Carbon (kg/kWh)
0.35
Water (L/kWh)
Water (L/kWh)
12
Percentage
2.4 40
NYUP NWPP
8 0.30
AZNM 2.1 20
4 0.25
MROE 1.8
HIOA 0
0 1.5 0.20
0.1 0.3 0.5 0.7 0.9 MON TUE WED THU FRI MON TUE WED THU FRI
Carbon (kg/kWh)
(a) Carbon/water efficiency (b) Hourly carbon/water efficiency (c) Hourly energy fuel mixes
used during token generation. To tive and significantly lower than the sustainable AI. As an initial step to
account for both the prompt phase recently reported 4.35 L/kWh.25 raise awareness among end users
and the non-GPU energy consump- While no official information is about the water resource impacts
tion of servers, we assume a per-re- available on resource consumption, of their AI usage, we recommend
quest server energy consumption of some subsequent models such as tracking and reporting AI's water
0.004kWh for our conversation task. GPT-4 could consume substantially consumption in AI model cards and/
The PUE, WUE, and EWIF are the more energy and water than GPT-3 or through cloud dashboards.
same as those used for estimating for processing the same request, Moreover, a comprehensive
the training water consumption. especially under the reasoning understanding and reporting of AI's
Our estimate of inference water mode.3,26 With continued efforts to scope-2 water consumption asso-
consumption for GPT-3 is on the reduce AI's computational demand ciated with electricity generation
conservative side, and the actual and improve overall water efficiency, remain limited. While datacenters
water consumption could be several the water consumption per request have increasingly adopted climate-
times higher. Specifically, when may decrease in the future. However, conscious cooling-system designs
considering service-level objectives the total water consumption is likely to minimize onsite water consump-
(SLOs) for LLM response times in to continue rising due to the grow- tion,9,14,29 these efforts mostly focus
enterprise-grade Nvidia DGX H100 ing demand for AI services and the on scope-1 water usage while largely
systems for conversation tasks, the increasing scale of AI applications.25 overlooking scope-2 impacts. Just as
server-level inference energy con- addressing scope-2 carbon emissions
sumption for a much smaller model Our Recommendations is important for mitigating climate
(for example, Llama-3-70B) is already We provide our recommendations change, it is equally crucial to address
approximately 0.010kWh per medi- to address AI's water footprint from scope-2 water consumption to reduce
um-sized request when using a state- scheduling and policy perspectives, AI's “true water cost,” as noted by the
of-the-art LLM inference solution making future AI more environmen- recent U.S. datacenter energy report.25
and accounting for non-GPU server tally sustainable. To better reflect the true impacts
overhead.30 For the Falcon-180B More transparency and compre- of datacenters on water resources,
model, which is comparable in size hensive reporting. Despite its grow- some tech companies such as Meta
to GPT-3-175B, server-level energy ing importance, AI's water footprint have begun to include scope-2 water
consumption reaches approximately has received relatively less attention. consumption in their sustainability
0.016kWh per medium-sized re- For example, while AI model cards reports.15 We recommend reporting
quest.30 Furthermore, we emphasize routinely include carbon emissions scope-2 water consumption as a stan-
that Microsoft's datacenters already and serve as an important report- dard practice. This approach makes
have some of the lowest onsite WUE ing framework for understanding offsite water consumption visible to
in the industry. If the same model AI's environmental impacts, they AI model developers as well as end us-
is deployed in a third-party coloca- currently omit information on AI's ers and can unlock new opportunities
tion datacenter, the scope-1 direct water consumption. The lack of for demand-side flexibility, alleviating
water consumption may be several transparency may obstruct efforts to the overall strain on water resources.
times higher. Additionally, our EWIF drive innovations that enhance wa- Finally, despite the enormous
for the U.S. (3.14L/kWh) is conserva- ter sustainability and support truly scope-3 supply-chain water foot-
print,2 there is limited data available Conclusion data centers: A case study in hot-arid climates.
Resources, Conservation and Recycling 181 (2022),
for embodied water usage by chip In this article, we uncover AI's water 106194;https://tinyurl.com/289qqpu3
manufacturing. We recommend usage as a critical concern for socially 12. Luccioni, A.S., Viguier, S., and Ligozat, A-L. Estimating
the carbon footprint of BLOOM, a 176B parameter
further research on scope-3 water responsible and environmentally language model. J. Mach. Learn. Res. 24, 1, Article 253
consumption to achieve a compre- sustainable AI. We present a prin- (Mar. 2024), 15.
13. Macknick, J., Newmark, R., Heath, G., and Hallett,
hensive understanding of AI's overall cipled methodology to estimate AI's K.C. A review of operational water consumption
water footprint and to foster corpo- water footprint. Then, using GPT-3 as and withdrawal factors for electricity generating
technologies. NREL Tech. Report: NREL/TP-
rate water stewardship. an example, we show that a large AI 6A20-50900 (2011).
'When' and 'where' matter. Judi- model can consume millions of liters 14. Meta. Sustainability: Water (2023); https://tinyurl.
com/2c65p7u3.
ciously deciding when and where of water for training. We also discuss 15. Meta. Sustainability Report (2024); https://tinyurl.
com/2yagb6vk.
to train a large AI model can sig- that the scope-1 and scope-2 water 16. Microsoft. Microsoft in Your Community; https://
nificantly affect the water footprint. efficiencies vary spatially and tem- tinyurl.com/257z85pr.
17. Microsoft. Microsoft's Sustainability Targets (2023);
Water efficiency exhibits a spatial- porally—judiciously deciding when https://tinyurl.com/2cgo3aff.
temporal diversity—onsite water and where to run a large AI model can 18. Microsoft. Environmental Sustainability Report (2024);
https://tinyurl.com/2eum2dhg.
efficiency changes due to variations significantly cut the water footprint. 19. Naden, C. AI for sustainability. Intern.
of outside weather conditions, and In addition, we recommend increased Electrotechnical Commission e-tech (2023); https://
tinyurl.com/2bj6m6dz.
offsite water efficiency changes due transparency and comprehensive 20. Patterson, D. et al. Carbon Emissions and Large
to variations of the grid's energy fuel reporting of AI's water footprint, and Neural Network Training (2021); arXiv:2104.10350
[cs.LG]
mixes to meet time-varying demands highlight the necessity of holistically 21. PepsiCo. ESG - Water (2023); https://tinyurl.
(Figure 2). Therefore, we can dy- addressing water footprint along com/2d5qfzuk.
22. Reig, P. What's the difference between water use
namically schedule AI training and with carbon footprint to enable truly and water consumption? World Resources Institute
inference in a water-wise manner to sustainable AI. Commentary (2013); https://tinyurl.com/2n9uqrtz.
23. Reig, P., Luo, T., Christensen, E., and Sinistore, J.
cut the water footprint. For example, AI's water footprint can no longer Guidance for calculating water use embedded in
we may schedule AI training at stay under the radar and must be purchased electricity. World Resources Institute
(2020).
midnight and/or in a datacenter with addressed as a priority as part of the 24. Schwartz, R., Dodge, J., Smith, N.A., and Etzioni, O.
Green AI. Commun. ACM 63, 12 (Nov. 2020), 54–63;
better water efficiency. Likewise, collective efforts to combat global 10.1145/3381831
if informed of the real-time water water challenges. 25. Shehabi, A., et al. 2024 United States Data Center
Energy Usage Report. Lawrence Berkeley National
efficiency, some water-conscious Laboratory LBNL-2001637 (Dec. 2024).
users may prefer to use AI inference Acknowledgment 26. Shumba, N. et al. A water efficiency dataset for
African data centers. In Proceedings of NeurIPS
during water-efficient hours and/or The work of Pengfei Li and Shaolei Workshop on Tackling Climate Change with Machine
in water-efficient datacenters, which Ren was supported in part by NSF Learning (2024).
27. Siddik, Md. A.B., Shehabi, A., and Marston, L. The
can reduce AI's water footprint by CCF-2324916. The work of Moham- environmental footprint of data centers in the United
enabling demand-side flexibility. mad A. Islam was supported in part States. Environmental Research Letters 16, 6 (2021),
064017.
'Follow the Sun' or 'Unfollow the by NSF ECCS-2152357 and CCF- 28. Singapore Public Utilities Board. Sectoral Water
Sun.' To cut the carbon footprint, 2324915. Efficiency Benchmark; https://tinyurl.com/2935uhn8
29. Solomon, S. Sustainable by design: Next-generation
it is preferable to “follow the sun” datacenters Consume zero water for cooling (2024);
when solar energy is more abun- https://tinyurl.com/2xmxxxnz
References 30. Stojkovic, J. et al. DynamoLLM: Designing LLM
dant. Nonetheless, to cut the water 1. Ahmed, K., Islam, M.A., Ren, S., and Quan, G. inference clusters for performance and energy
Exploiting temporal diversity of water efficiency to efficiency. In Proceedings of the IEEE Intern. Symp.
footprint, it may be more appealing make data center less “thirsty”. In Proceedings of the on High-Performance Computer Architecture (2025).
to “unfollow the sun” to avoid high- 11th Intern. Conf of Autonomic Computing (2014). 31. Strubell, E., Ganesh, A., and McCallum, A. Energy
2. Apple. Environmental Responsibility Report. 2024; and policy considerations for deep learning in NLP.
temperature hours of a day when https://tinyurl.com/jwg93lu. In Proceedings of the 57th Ann. Meeting of the Assoc.
WUE is high. This conflict can also 3. Banse, A., Rince, S., and Defour, V. EcoLogits for Computational Linguistics (2019), 3645–3650;
Calculator (2024); https://tinyurl.com/2au2xaml. 10.18653/v1/P19-1355
be shown in Figure 2a and 2b, where 4. Brown, T. et al. Language models are few-shot 32. U.S. Energy Information Administration. Open Data;
learners. In Advances in Neural Information
we see misalignment between the Processing Systems 33, LarochelleH. Curran
https://tinyurl.com/2c5qq6wk.
33. U.S. EPA. eGRID Data Explorer; https://tinyurl.com/
scope-2 water consumption intensity Associates, Inc. (2020), 1877-1901; https://tinyurl. yn72page.
com/yqlf8fzk
factor and carbon emission rate: 5. Cohen, E. and Ramaswami, A. The water withdrawal
Minimizing one footprint might footprint of energy supply to cities. J. of Industrial
Pengfei Li is a Ph.D. candidate at the University of
Ecology 18, 1 (2014), 26-39; https://tinyurl.
increase the other footprint. This com/24ffjzox California, Riverside, CA, USA.
observation further corroborates the 6. de Vries, A. The growing energy footprint of artificial Mohammad A. Islam is an assistant professor at the
intelligence. Joule (Oct. 2023); https://tinyurl. University of Texas, Arlington, TX, USA.
previous finding that the environ- com/2cmwv5m3
Jianyi Yang is an assistant professor at the University of
mental impacts of carbon and water 7. Frost, K. and Hua, I. Quantifying spatiotemporal
Houston, TX, USA.
impacts of the interaction of water scarcity and water
footprints are not substitutable.10,25 use by the global semiconductor manufacturing Shaolei Ren is an associate professor at the University of
Therefore, to judiciously achieve a industry. Water Resources and Industry 22 (2019); California, Riverside, CA, USA.
https://tinyurl.com/247zadpz
balance between “follow the sun” 8. Gao, P.X., Curtis, A.R., Wong, B., and Keshav, S. It’s
for carbon efficiency and “unfollow not easy being green. SIGCOMM Comput. Commun.
Rev. (2012).
the sun” for water efficiency, we need 9. Google. Environmental Report. 2024; https://
to reconcile the potential water- sustainability.google/reports/.
10. Islam, M.A. et al. Exploiting spatio-temporal diversity
This work is licensed under a
carbon conflicts by using holistic for water saving in geo-distributed data centers.
Creative Commons Attribution-NoDerivs
IEEE Transactions on Cloud Computing 6, 3 (2018),
approaches that are carbon-efficient 734-746;https://tinyurl.com/2c43smrv
International 4.0 License.
and water-wise. 11. Karimi, Leila et al. Water-energy tradeoffs in © 2025 Copyright held by the owner/author(s).
JU LY 2 0 2 5 | VO L. 6 8 | N O. 7 | C OM M U N IC AT ION S OF T HE ACM 61