0% found this document useful (0 votes)

24 views27 pages

Proactive Moderation

This document explores the concept of proactive moderation in online discussions, particularly focusing on Wikipedia Talk Pages, as an alternative to the prevalent reactive moderation practices. It highlights the existing proactive behaviors of human moderators and investigates how algorithmic tools can support these efforts to prevent antisocial behavior before it escalates. The study includes interviews with moderators and the development of a prototype tool to assist in proactive moderation, revealing both its potential benefits and challenges.

Uploaded by

promptengineer21

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views27 pages

Proactive Moderation

Uploaded by

promptengineer21

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

Proactive Moderation of Online Discussions: Existing

Practices and the Potential for Algorithmic Support

CHARLOTTE SCHLUGER, Cornell University, USA
JONATHAN P. CHANG, Cornell University, USA 370
CRISTIAN DANESCU-NICULESCU-MIZIL, Cornell University, USA
KAREN LEVY, Cornell University, USA
To address the widespread problem of uncivil behavior, many online discussion platforms employ human
moderators to take action against objectionable content, such as removing it or placing sanctions on its authors.
This reactive paradigm of taking action against already-posted antisocial content is currently the most common
form of moderation, and has accordingly underpinned many recent efforts at introducing automation into the
moderation process. Comparatively less work has been done to understand other moderation paradigms—such
as proactively discouraging the emergence of antisocial behavior rather than reacting to it—and the role
algorithmic support can play in these paradigms. In this work, we investigate such a proactive framework for
moderation in a case study of a collaborative setting: Wikipedia Talk Pages. We employ a mixed methods
approach, combining qualitative and design components for a holistic analysis. Through interviews with
moderators, we find that despite a lack of technical and social support, moderators already engage in a number
of proactive moderation behaviors, such as preemptively intervening in conversations to keep them on track.
Further, we explore how automation could assist with this existing proactive moderation workflow by building
a prototype tool, presenting it to moderators, and examining how the assistance it provides might fit into
their workflow. The resulting feedback uncovers both strengths and drawbacks of the prototype tool and
suggests concrete steps towards further developing such assisting technology so it can most effectively support
moderators in their existing proactive moderation workflow.
CCS Concepts: • Human-centered computing → Interactive systems and tools; Collaborative and social
computing systems and tools; • Computing methodologies → Natural language processing.
Additional Key Words and Phrases: antisocial behavior, content moderation, algorithmic assistance, hybrid
systems
ACM Reference Format:
Charlotte Schluger, Jonathan P. Chang, Cristian Danescu-Niculescu-Mizil, and Karen Levy. 2022. Proactive
Moderation of Online Discussions: Existing Practices and the Potential for Algorithmic Support. Proc. ACM
Hum.-Comput. Interact. 6, CSCW2, Article 370 (November 2022), 27 pages. https://doi.org/10.1145/3555095

1 INTRODUCTION
Online discussion platforms enable new forms of interaction and knowledge sharing and have be-
come key in supporting online collaboration. However, incivility challenges their benefits, harming
the mental and emotional health of individuals who are exposed to antisocial behavior or targeted
by personal attacks [1, 53], reducing engagement [14], or distracting from the underlying goals of
Authors’ addresses: Charlotte Schluger, jes543@cornell.edu, Cornell University, Ithaca, NY, USA, 14850; Jonathan P. Chang,
jpc362@cornell.edu, Cornell University, Ithaca, NY, USA, 14850; Cristian Danescu-Niculescu-Mizil, cristian@cs.cornell.edu,
Cornell University, Ithaca, NY, USA, 14850; Karen Levy, karen.levy@cornell.edu, Cornell University, Ithaca, NY, USA, 14850.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee
provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the
full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored.
Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires
prior specific permission and/or a fee. Request permissions from permissions@acm.org.
© 2022 Copyright held by the owner/author(s). Publication rights licensed to ACM.
2573-0142/2022/11-ART370 $15.00
https://doi.org/10.1145/3555095

Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW2, Article 370. Publication date: November 2022.
370:2 Charlotte Schluger et al.

the discussion [2]. As a result most online platforms use moderation to keep discussions within
their community guidelines.
Moderation of public discussion platforms traditionally consists of a process in which human
moderators respond to instances of antisocial behavior they discover—either by manually searching
themselves or through reports from community members. This approach is labor-intensive, making
it difficult to scale to the amount of content generated by modern day platforms, and can be
practically and emotionally challenging for the human moderators [23, 25]. These challenges have
led to increasing interest in the use of algorithmic tools to (partially) automate the process of
content moderation. Much of the recent work in this direction has focused on applying machine
learning to automatically detect antisocial comments [51, 65], a technique which has gone on to
see use in industry through tools like the Perspective API.1
While such tools have the potential to alleviate the labor-intensiveness of content moderation,
they do not address one crucial limitation: this entire process is fundamentally reactive, consisting
of responding to antisocial content that has already been posted. Even the best reactive moderation
can only come after damage has already been done—possibly exposing any number of users to
incivility and distracting or preventing productive discussions. Despite the technical attention
that has been given to developing automatic classifiers for antisocial behavior, there has been
surprisingly little work done to understand how well these tools actually fit into a human moderation
workflow [35]: Is locating uncivil comments for the purpose of reactive moderation actually the
sole task where human moderators need algorithmic support? Or, are there other aspects of the
moderation workflow where an algorithmic tool can be of help?
In this work, we start addressing these questions by studying the potential for algorithmic
assistance in a different moderation paradigm—proactive moderation—through a case study of a
collaborative setting: Wikipedia Talk Pages. We focus on a specific proactive moderation strategy
that prior work has identified as needing algorithmic support: identifying and monitoring conver-
sations that are at-risk of devolving into uncivil behavior, with the goal of intervening early to
avoid derailment or to mitigate its effects in a timely manner [39, 59]. We interview moderators to
understand their proactive moderation practices and identify their needs for algorithmic support.
To explore the feasibility of addressing these needs with the technology available today, we build a
prototype tool for assisting proactive moderation and observe how moderators engage with it in
the context of their existing workflow. In our analysis we pay particular attention to the ethical
issues that come to the fore, and discuss their implications for system design and future work.
Concretely, our case study aims to address the following primary research questions:
(1) (How) do moderators act proactively to prevent the derailment of discussions into uncivil
behavior?
(2) (How) can an algorithmic tool assist moderators in their proactive moderation workflow?
Building on the answers to these questions, we identify concrete next steps towards further
developing such assisting technology so that it can support moderators in their existing proactive
moderation workflow.

2 BACKGROUND: THE LANDSCAPE OF MODERATION

Our work contributes to an extensive line of research on online discussion moderation which has
aimed to both understand existing practices and develop new ones. Thus, to motivate our focus on
the proactive paradigm and put our contributions in context, we begin by briefly surveying how
prior work has characterized moderation practices, with an emphasis on identifying where existing
practices fall on the reactive-proactive spectrum and exploring the role of algorithmic tools.
1 https://www.perspectiveapi.com

Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW2, Article 370. Publication date: November 2022.
Proactive Moderation of Online Discussions 370:3

Why did you revert my edit?

The plot synopsis is too
explicit, anyone reading this
article will immediately ﬁnd
out the plot twist!

They were just following the Proactive moderation

Wikipedia policy to not use monitors a conversation that
spoiler warnings, nor to omit
seems to be at risk of turning
signiﬁcant information about
a work of ﬁction. uncivil, with the goal of
intervening to avoid
Can we please remove the
derailment or to mitigate its
spoilers? Policy aside, I feel effects in a timely manner.
like I've been robbed of the
enjoyment of watching the
Time
series.

No, sorry, but this is an

encyclopedia. All articles are Pre-screening moderation
there to be full and complete. holds each comment in a
queue, requiring a moderator
to explicitly approve it before
it can be publicly viewed.
You’ve lost all touch with
reality...you have your head
up your a**. An encyclopedia
is for people to use, a******. Reactive moderation takes
action after an uncivil
comment has been posted,
e.g., by removing the
comment or imposing
sanctions on its author.

Fig. 1. Three types of moderation paradigms exemplified in the context of a conversation between two
Wikipedia editors that eventually derails into a personal attack (orange).

2.1 Practices of Moderation

In the popular consciousness, content moderation is often associated with the removal of illegal,
hateful, or otherwise objectionable content [23], especially high-profile cases such as Reddit’s 2015
mass banning of hate communities [9]. However, removal of content is but one part of content
moderation, which in reality involves a wide and diverse range of hidden labor [16, 48, 64] that,
while less visible to the public eye than a high-profile ban, is no less important to maintaining the
everyday functioning of an online community [29].
The need to look beyond content removal is particularly salient in collaborative platforms like
Wikipedia, which have eschewed top-down, platform-driven governance in favor of a community
model of moderation [55], wherein ordinary community members volunteer to take on moderator
roles [3, 29]. Unlike platform-employed professional moderators, who are often seen as distant
and separate from the communities they moderate [55], volunteer moderators are by definition
part of their communities, and many will continue to participate in conversations and other
informal interactions [47, 64]. Volunteer moderators must therefore balance their “dual identities”
as both regular community members and authority figures. Different ways of managing this

Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW2, Article 370. Publication date: November 2022.
370:4 Charlotte Schluger et al.

balance will result in different conceptions of one’s role and purpose as a moderator; in interviews,
volunteer moderators have described their work with metaphors that range from the formal (“police”,
“governor”, “manager”) to the informal (“team member”, “facilitator”, “adult in the room”) [57]. This
diversity in attitudes towards moderation naturally leads to a diversity in employed methodology.
While many platforms that use volunteer moderation offer their moderators formal tools similar
to those wielded by their platform-employed professional counterparts, such as the authority
to remove comments [22, 34] or suspend users [12, 47], volunteer moderators often express a
preference for softer, social approaches to keeping the community in line [59]. Examples of such
approaches include publicly modeling good behavior [58], educating users about the rules [6], and
mediating disputes [3].
Given this wide breadth of moderation practices, researchers have sought to group and categorize
them as a first step towards studying and understanding moderation [26, 57]. In particular, an
emerging line of work has proposed to distinguish different practices based on when they happen
relative to the objectionable action being moderated: some practices are meant to respond to such
behavior after it happens, while others are meant to discourage or prevent such behavior in the first
place [40]. The former group has been referred to as reactive [47, 59] or ex-post [26] moderation, and
the latter as proactive or ex-ante. Additionally, other work has identified a third set of practices in
this space, known as pre-screening moderation [40, 57], which occupy a middle ground by inserting
an extra step of moderator approval in between the creation and public posting of user-created
content. In this work, we adopt the proactive/reactive/pre-screening categorization of moderation
practices in the context of online discussion moderation. In the following sections (2.2-2.4) we
discuss each in the context of online discussion platforms and compare their relative merits. Then,
we survey the existing state of algorithmic approaches to moderation (2.5) and identify a key gap
in the literature, algorithmic approaches to proactive moderation, which we seek to fill.

2.2 Reactive Moderation of Online Discussions

Currently, discussion moderation in online platforms most commonly takes the form of reacting to
instances of antisocial behavior or rule violations [23]. The exact response to such content may
vary [49]. On most large scale platforms, a standard response is to remove antisocial content that
gets discovered by or reported to a centralized team of moderators [10, 23]. Other platforms take
a slightly softer approach and simply limit the visibility of such content rather than removing
it outright [45]; some platforms extend this type of approach and limit content visibility on a
user-specific basis, allowing the blocking of content from known bad actors [37]. A further user-
level response is to temporarily or permanently ban the authors of antisocial content from the
platform [12, 36].
Regardless of the exact nature of the response, however, reactive moderation in general is
characterized by the fact that it relies on locating and responding to antisocial content that has
already been posted. Moderator responses in this paradigm thus serve as a form of post-hoc damage
control, designed both to prevent the harm of the antisocial content from spreading too far, and to
signal to the broader community that such behavior is unacceptable [40], thereby discouraging
other users from reproducing it [60].
While reactive moderation is widely used and studied, it also comes with an inherent weakness:
because it involves taking action against antisocial content that has already been posted, the
offending content has an opportunity to be seen and to spread before moderators are able to take
action, if they ever do at all. This can harm users exposed to the antisocial content, in addition to
harming the platform by preventing or distracting from productive discussions. For this reason, it
has been argued that truly limiting the damage of antisocial behavior requires taking action before
antisocial content can be seen by general audiences [40].

Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW2, Article 370. Publication date: November 2022.
Proactive Moderation of Online Discussions 370:5

In addition to potentially harming platforms and their users, reactive moderation strategies can
further harm the moderators tasked with regularly viewing and taking action on disturbing content,
from the extreme stress of reacting to hateful comments to retribution from users in response to a
moderation action threatening their personal safety [4, 16, 64]. We believe these harms implore us
to investigate alternative approaches to moderation that may be able to avoid some of this human
suffering.

2.3 Pre-screening Moderation of Online Discussion

One approach to taking action before antisocial content can be seen by general audiences is
pre-screening: the paradigm by which all content must be reviewed and explicitly approved by
moderators before it appears on the platform [40]. This approach, hearkening back to the days of
traditional pen-and-paper media, is still employed by a handful of platforms such as the New York
Times comment section.2 Additionally, there has been work on hybrid automatic/human systems
for comment pre-screening [52].
However, most platforms avoid this strategy because it raises a host of practical issues. Pre-
screening is highly labor intensive, and scales poorly as a platform grows: for example, even with
the help of algorithmic pre-screening, the New York Times currently only allows comments on
top stories for 8 hours during weekdays. Moreover, pre-screening prevents real-time interaction
between users on a platform by introducing a delay between users submitting content and that
content appearing on the platform while moderators review it. Finally, pre-screening has been
subjected to criticism on the grounds of suppressing free speech [23].
While pre-screening may prevent antisocial behavior from reaching general audiences, it still
relies on moderators to react to and address all attempts at antisocial behavior users make on their
platform—this reaction has just shifted from the public square to the moderators’ private space.
This shift limits the effectiveness of pre-screening by reproducing the problems described above.

2.4 Proactive Moderation of Online Discussion

In online discussion platforms, proactive approaches to moderation can aim to discourage undesir-
able actions or to encourage prosocial behavior and productive conversations [58, 59]. Proactive
strategies can range from static design decisions to dynamic interventions in which moderators
play a more active role. They can also be broadly applied to the entire community or targeted
towards specific situations, users, or conversations.
Static strategies primarily involve deliberate choices in platform design aimed at promoting
pro-social behaviors. These have a long and established history in social computing; now-common
design choices such as activity indicators [19] and explicitly listed rules [40] were initially developed
as measures to encourage the development and adoption of social norms within online communities.
More recent developments in this direction include limitations on community size or rate of
participation [26] and systems that prompt users to reflect more deeply on comments they have
read [33, 43, 44]. While such design elements have historically been broadly applied at the level
of the entire platform, there has been work on UI-level interventions that are more targeted to
specific situations [31, 47]. In the space of online discussions, such work has included psychological
priming strategies that are deployed to users when they are about to comment in a discussion
thread [56, 61].
As platforms have grown and evolved, they have developed more dynamic strategies for proactive
community management, in which moderators take a more direct role. For instance, a natural
development from static listing of rules involves moderators personally sending reminders of
2 https://help.nytimes.com/hc/en-us/articles/115014792387-Comments

Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW2, Article 370. Publication date: November 2022.
370:6 Charlotte Schluger et al.

community rules in high-impact situations, such as when welcoming newcomers [59]. As a further
step from this, recent work has looked at how moderators can model good behavior in their
own interactions, as a way of implicitly signaling to the community what proper behavior looks
like [33, 58].
One dynamic strategy in particular has picked up increasing interest: moderators actively monitor
ongoing conversations in order to proactively prevent them from derailing into uncivil behavior
or, at least, to be in a position that allows them to mitigate the effects of derailment in a timely
manner [39, 47]. Unlike the more static strategies discussed above, this dynamic and highly-targeted
moderation strategy requires substantial time and effort on behalf of the moderators and thus
scales poorly. As such, prior work has advocated for offering algorithmic support for this type of
strategy, in the form of “predictive suggestions for threads or discussions that might soon devolve”
that could help “moderators to focus their attention where it is needed.” [59]. This current work
aims to develop a system that can provide such algorithmic support and to test its feasibility.
As with pre-screening moderation, proactively preventing conversations from derailing into
incivility raises ethical concerns around censorship and free speech. We engage with these is-
sues, assess the risk of potential misuses of a tool for assisting proactive moderation—such as
shutting down conversations or blaming participants proactively based only on a prediction of
future incivility—and discuss implications on its design. While these concerns are warranted and
deserving of a meaningful inquiry, we argue that the well-established harms to users, platforms,
and moderators in the popular reactive moderation framework motivate us to explore alternate ap-
proaches to moderation. Because the status quo is not a neutral ethical choice, we must investigate
such alternative approaches even though it requires a careful ethical analysis. We return to these
questions about the ethics of proactive moderation in Section 5.3.2.

2.5 Algorithmic Tools for Moderation

Multiple studies on content moderation have identified a problem of scale: even if antisocial behavior
is a small fraction of all content that gets posted, the sheer size of modern online platforms, together
with the relatively small number of moderators present on most platforms, makes it infeasible for
human moderators to keep up with all the content in need of moderation [16, 24, 47, 65]. This
has led to mental strain and burnout among moderators [16] and has directly inspired calls for
the development of technological assistance to reduce the burden on human moderators [64]. As
Gillespie writes, “the strongest argument for the automation of content moderation may be that,
given the human costs, there is simply no other ethical way to do it, even if it is done poorly”
[24]. Technological responses to this call have ranged in complexity: basic tools include simple
word-based filters [7, 47, 64] and blocklists [37], while more advanced systems attempt to use
machine learning and natural language processing techniques to automatically identify antisocial
content [20, 51, 65].
Regardless of the choice of technical backend, most algorithmic tools for moderation are optimized
for the reactive and pre-screening paradigms. A common use case is to apply the filter or classifier
to content that has already been submitted for public posting; while in rare cases this can be applied
as a pre-screening approach where the filter automatically blocks certain submitted content from
getting posted (usually involving high-precision filters that look for hand-chosen terms known to
cause problems in a specific micro-community) [47, 64], the more common application is to use the
filter or classifier to flag content for review by human moderators (allowing the content to stay
public in the meantime) [8, 21, 35, 47], which results in a reactive moderation workflow.
In comparison, algorithmic assistance for proactive moderation has been understudied. As
described in Section 2.4, research in the proactive space has largely focused on user-facing inter-
ventions rather than moderator-facing tools. Historically, this could be attributed to a technology

Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW2, Article 370. Publication date: November 2022.
Proactive Moderation of Online Discussions 370:7

gap: prior work has argued that an algorithmic tool for proactive moderation, with comparable
scope to tools currently available for reactive and pre-screening moderation, would require tech-
nology that can predictively identify discussions that are about to devolve into antisocial behavior
[59]. Such technology has not been available until very recently, with a series of systems having
been developed recently that are capable of making such ahead-of-time forecasts [13, 46, 66]. We
therefore identify an opportunity to explore the potential of such predictive technology to provide
algorithmic assistance for proactive moderation. Our current work aims to explore this promising
new direction.

3 CASE STUDY: WIKIPEDIA TALK PAGES

In order to begin understanding the landscape of proactive moderation and potential for algorithmic
assistance within the framework of community moderation, we conduct a case study of moderation
on Wikipedia Talk Pages. On Wikipedia, discussions are not the primary content the platform pro-
vides; rather, Wikipedia hosts conversations on ‘Talk Pages’ in order to facilitate discussion around
the content of articles or broader policies governing the encyclopedia [41].3 In this collaborative,
goal-driven discussion environment, antisocial behavior is particularly impactful, threatening the
health of the editor community and disrupting productivity [32, 42].
Moderation of Talk Page discussions is community driven [55]: the Wikipedia community elects
administrators with broad technical powers on the platform such as deleting articles or blocking
other users.4 A subset of these administrators choose to engage in discussion moderation. We
note that there is no formal designation distinguishing discussion moderators from the rest of the
administrators, and that discussion moderation practices (e.g., when a personal attack is subject for
removal) are left largely at the discretion of these administrators.5
The use of automated tools to assist in moderation has a long history on Wikipedia. Fully
automated systems known as “bots” have been used to identify vandalism since as early as 2006 [29],
and to this day bots continue to play key roles on Wikipedia, not only in vandalism detection but
also in more social aspects of community management such as welcoming and educating new users
[50, 67]. While bots may be capable of handling mechanical tasks, other aspects of moderation
still require a human touch, and for such tasks moderators make use of a different class of tools:
“assisted editing programs” which are designed to augment (rather than replace) human moderation
work [21]. A common design pattern in this space is to organize moderators’ workload into work
queues that help moderators prioritize situations in need of attention; this approach is exemplified
by the popular tool Huggle [21] which organizes edits based on their algorithmically-determined
likelihood of being vandalism. Beyond anti-vandalism, similar algorithmic approaches are used in
another widely-used tool, ORES, to detect more types of rule violations in article edits [28].
Taken together, the goal-driven nature of Wikipedia Talk Page discussions, the large degree of
discretion given to moderators, and familiarity Wikipedia moderators have with algorithmic tools
in their workflow make this a convenient setting for an initial case study of proactive moderation
practices. The fact that using automated tools is already a broadly accepted moderation practice on
Wikipedia allows our study to focus specifically on the proactive aspect of our experimental tool
rather than being confounded by moderators’ thoughts on automation in general. The preponder-
ance of existing tools on Wikipedia also gives us a good starting point from which to take design
cues. Moreover, we believe the goal-driven nature of the discussions provides moderators with
3 See the Wikipedia talk page guidelines: https://en.wikipedia.org/wiki/Wikipedia:Talk_page
4 https://en.wikipedia.org/wiki/Wikipedia:Administrators
5 Inaddition, the community grants an elected committee of arbitrators even broader powers to impose binding resolutions
in order to resolve particularly severe disputes on Wikipedia, including but not limited to disputes in discussions (https:
//en.wikipedia.org/wiki/Wikipedia:Arbitration).

Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW2, Article 370. Publication date: November 2022.
370:8 Charlotte Schluger et al.

a strong motivation to improve their moderation practices, while the large degree of discretion
granted to Wikipedia moderators gives them the freedom to consider and attempt alternative
strategies.
It is important to note upfront that the empirical setting for our case study does impose some
limitations on our work. Given the unique structure and culture of Wikipedia, our goal is not to
report findings that generalize to any type of platform, but rather to begin understanding proactive
moderation practices and the potential for algorithmic support in the specific setting of goal-driven
online discussions. In the process, we provide a blueprint that other researchers can follow to begin
understanding proactive moderation in other types of online communities, both for its own sake
and for comparison with this setting.

4 METHODS
In order to begin to understand the moderators’ proactive moderation workflow, as well as to
investigate the potential of integrating algorithmic assistance into this workflow, we engaged
two methodological approaches: (1) we conducted interviews with moderators of public online
discussions, and (2) developed a prototype tool for assisting proactive moderation. The interviews
explored moderators’ experiences with proactive moderation in general, and also involved putting
our prototype tool in their hands to observe how they may use a proactive moderation tool in
practice and to inform its design. We engaged and iterated these approaches simultaneously, and
each analysis was informed by the other.

4.1 Interviews
Following a rich line of prior literature that uses interviews to pull back the curtain on moderation
practices [8, 16, 27, 59, 64], we conducted semi-structured interviews with nine administrators on
Wikipedia who engage in Talk Page moderation. Each interview was conducted over Zoom and
lasted approximately one hour; we subsequently produced full de-identified transcripts and coded
the data using thematic analysis.
The interviews had two primary goals. The first half of the interview focused on participants’
current practices: the role of administrators in moderation on Wikipedia, the goals of moderation,
the ways participants moderate proactively, and how they reason about the future of conversations
to inform their proactive interventions. The second half of the interview focused on understanding
the potential of an algorithmic tool for assisting proactive moderation. By having participants
use our prototype tool on real discussions happening live on Wikipedia Talk Pages, we had the
opportunity to observe how it fits into their proactive moderation workflow, to what extent it
addresses their needs, and to get feedback on design and ideal usage of such a tool directly from its
target audience. The generic script of the interviews is included in Appendix A.
We conducted these interviews with Institutional Review Board (IRB) approval and recruited
participants through snowball sampling: by asking each participant to recommend any other
individuals they know who do discussion moderation on Wikipedia. While our interviews provided
invaluable direct access to moderators and their domain specific knowledge, this recruitment
procedure does impose a potential limitation on our work by potentially biasing our findings to the
one branch of the Wikipedia moderator social graph that our sampling procedure reached.

4.2 Prototype Tool for Assisting Proactive Moderation

Our prototype tool is implemented as a password protected website that includes two main features:
a ranked view of ongoing conversations ordered according to their likelihood of derailing into future
antisocial behavior (Figure 2), and a conversation view giving a comment-by-comment breakdown
of risk levels within the discussion (Figure 3). The tool currently works with discussions taking

Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW2, Article 370. Publication date: November 2022.
Proactive Moderation of Online Discussions 370:9

place on two different public discussion platforms (Wikipedia Talk Pages and Reddit, although for
the purpose of this work, we focus on the former) and is demonstrated in the Video Figure.6,7 We
now delve into the technical details of this tool.
4.2.1 Backend: The CRAFT Architecture. Our prototype tool is powered by a recent Natural Lan-
guage Processing paradigm—conversational forecasting—which can be used to predict the future
occurrence of an event in a conversation based on its current state [46, 66]. Prior work has applied
this paradigm to the task of forecasting future antisocial behavior in online discussions, and found
that algorithmic forecasting models are approaching the level of human intuition on this task, albeit
with lower accuracy [66]. While algorithmic models are not yet at the level of human performance,
they are at a level that is comparable to models already used in existing Wikipedia moderation
tools: the current state-of-the-art model for forecasting future antisocial behavior, CRAFT, has a
reported F1 score of 69.8% [13], which is close to that of some models used in the popular ORES
moderation tool.8 We therefore believe that such models, while imperfect, are mature enough to
be potentially useful to moderators. In light of this, we use the publicly available CRAFT model
trained for forecasting future antisocial behavior on Wikipedia9 to power our prototype tool.
Formally, given a discussion 𝐷, modeled as a sequence of 𝑛 comments in reply-order, 𝐷 =
{𝑐 1, 𝑐 2, ..., 𝑐𝑛 }, CRAFT is trained to output the probability that a specified conversational event—in
this case, antisocial behavior—will occur in the (not-yet-existent) next comment 𝑐𝑛+1 in 𝐷. In other
words, CRAFT(𝐷) = 𝑝 (isAntisocial(𝑐𝑛+1 )). CRAFT is an online model, in that it updates its forecast
for each new comment in the discussion. That is, once an actual comment 𝑐𝑛+1 happens, CRAFT
can read the content of 𝑐𝑛+1 to produce an updated forecast 𝑝 (isAntisocial(𝑐𝑛+2 )), and so on.
Since Wikipedia does not provide any special functionality to track live Talk Page conversations—
in fact, Talk Pages are treated equivalently to normal article pages, with no special support for
conversations—we implemented a system that parses and monitors changes in the discussions
happening on a selected set of Wikipedia Talk pages10 in real time. To this end, we use the diff
functionality from the Wikipedia API and use a set of heuristics to determine whether a change
on the Talk Page represents a new comment in a discussion on that page, and which discussion
it belongs to. At regular intervals, the backend pulls the latest updates to every talk page being
tracked, parses the updates to extract the conversations happening on the page, and runs CRAFT on
those conversations to get an updated forecast of the risk of future incivility for each conversation.
The tool also keeps track of how the CRAFT forecast for a discussion has changed over time. That
is, for a (possibly ongoing) discussion 𝐷 = {𝑐 1, 𝑐 2, 𝑐 3, . . . }, the tool creates and maintains a history
of CRAFT forecasts {CRAFT({𝑐 1 }), CRAFT({𝑐 1, 𝑐 2 }), CRAFT({𝑐 1, 𝑐 2, 𝑐 3 }), . . . }.
4.2.2 Frontend: The Moderator’s Display. Our prototype frontend consists of two sections: a
Ranking View and a Conversation View. The frontend adopts design metaphors used in exist-
ing Wikipedia moderation tools, and comes with a broad range of features and parameters in order
to engage interview participants in a discussion that can inform future design.
On Wikipedia, assisted editing programs used by moderators often center around the organiza-
tional concept of the work queue, in which content that was algorithmically determined to warrant
human review is presented to moderators in a centralized, prioritized list [21, 29]. This is a proven
6 We use password protection to avoid any potential misuses of this technology.
7A link to the Video Figure can be found at https://www.cs.cornell.edu/~cristian/Proactive_Moderation.html.
8 https://ores.wikimedia.org/v3/scores/enwiki/?model_info
9 Provided through the ConvoKit package (https://convokit.cornell.edu) [11].
10 For this project, we choose pages that we reasoned are likely to have conflict and need moderation: Barack_Obama,

Bernie_Sanders, Coronavirus_disease_2019, COVID-19_pandemic, Donald_Trump, Joe_Biden, Kim_Jong-un, and

Global_warming.

Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW2, Article 370. Publication date: November 2022.
370:10 Charlotte Schluger et al.

Fig. 2. The Ranking View of our prototype tool, showing a list of live conversations on Talk Pages, sorted by
their predicted risk of derailing into antisocial behavior.

design metaphor, having been used in empirically-studied moderator tools on Wikipedia [21] and
elsewhere [8]. As such, our prototype tool similarly centers around a Ranking View (Figure 2) as
its main feature. Based on a list of Talk Pages to include, our prototype tool provides a CRAFT score
ranking of all the conversations taking place on any of these pages, sorted in the order of their risk
of derailing in the future. Each conversation is represented as a row in the ranking and is color coded
to indicate its forecasted risk of derailing (shades of red, with darker red corresponding to higher
risk). This reflects the most up to date CRAFT forecast for that conversation, computed based on all
the comments posted so far in the conversation, i.e., CRAFT({𝑐 1, 𝑐 2, . . . , 𝑐𝑛 }). Additionally, to make
it easy to identify rapidly escalating situations, each conversation in the ranking is also decorated
with an arrow whose direction and size reflect the gradient and size of the most recent change in
CRAFT forecast, i.e., CRAFT({𝑐 1, 𝑐 2, . . . , 𝑐𝑛 }) − CRAFT({𝑐 1, 𝑐 2, . . . , 𝑐𝑛−1 }); for example a large red
up-facing arrow would signal that the tension is rapidly rising in the respective conversation.
In addition to displaying summary level information about a conversation, each row of the
Ranking view is a clickable link that leads to the Conversation View (Figure 3), which displays
the entire history of that conversation. The Conversation View presents the text of each comment
in the conversation along with the time it was posted, its author, and the CRAFT score (color coded
as before) at the time that comment was posted, i.e., taking into account the conversation up to and
including that comment. This provides some level of transparency as to why the algorithm placed
the conversation at a certain position in the Ranking View, allowing the moderator to observe
how the predicted risk evolves as a conversation progresses. This design bears similarities to how
algorithm decisions are presented to moderators in the experimental Reddit reactive moderation
tool Crossmod [8].

Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW2, Article 370. Publication date: November 2022.
Proactive Moderation of Online Discussions 370:11

Fig. 3. The Conversation View of our prototype tool, showing a conversation with CRAFT scores alongside
each comment. Each score represents the predicted risk of derailment at the time the corresponding comment
was posted (taking into account the entire preceding context).

5 FINDINGS
5.1 Moderator Goals: Content and Environment
To contextualize our discussion, we start by understanding the broad goals moderators have in our
particular domain of Wikipedia Talk Page discussions. Following from the goal oriented nature of
these discussions, as discussed in Section 3, participants highlighted how maintaining civil and
productive discussions is not the end goal of their moderation. Rather, keeping discussions civil
and functional is a crucial intermediary goal towards their primary goals: maintaining high quality
content on the platform—in this case, encyclopedia articles—and maintaining a good environment
for editors. As P6 puts it:
P6: [When I find a conversation headed downhill] I would not really care about the
threads as having the thing go on, I’d care about the article and the environment of
Wikipedia. I think those are the two things that I care about.

Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW2, Article 370. Publication date: November 2022.
370:12 Charlotte Schluger et al.

Discussion moderation is crucial to maintaining these goals: antisocial behavior in discussions

contributes directly to a hostile platform environment. Moreover, it can threaten the platform’s
content when it pushes editors to give up on editing an article or leave Wikipedia altogether [63], or
when it prevents or distracts from the conversations necessary for content creation and refinement.
This finding corroborates prior work showing how volunteer moderators are motivated by a variety
of goals, including supporting a positive social environment and maintaining on-topic discussions
relevant to their platform [18, 55, 64].
A further consideration for moderators is that Wikipedia relies heavily on experienced users to
contribute to the articles [29]; when these important content creators act uncivilly in a discussion,
moderators are hesitant to sanction them because of their perceived value in writing articles even
though this incivility threatens the Wikipedia environment and alienates other users [14, 30]. This
exposes one way that the dual goals of moderation are in tension on Wikipedia. As P3 explains:
P3: I do believe that the English Wikipedia as a whole has a civility problem. [...] The
community as a whole is far too willing to forgive incivility in the name of well—they’re
an experienced administrator or they’re a really good content creator, so we’ll just let
them get by or say it wasn’t that bad. And I think that that is not the path to a healthy
community in the long term. I mean we have an editor retention problem, we know
that. Everybody knows that. And I do think that the civility of the community is a
significant part of that.
In their view, moderators’ imbalanced approach to the dual goals of moderation threatens the
platform overall, and contributes to the difficulty retaining users on Wikipedia. Any tool aimed at
assisting moderators must consider both goals and not either in isolation.

5.2 Proactive Moderation Practices

5.2.1 Acting Proactively. Considering the broad goals of moderation on Wikipedia, we move to
address one of our main research questions: Do moderators act proactively to prevent the derailment
of discussions into antisocial behavior, and if so, what is their workflow?
First, we confirm that moderators on Wikipedia do in fact engage in a variety of proactive
moderation strategies. The starting point in their workflow is their ability to foresee whether a
conversation is at risk of derailing. If they consider that this risk is elevated, they can further start
monitoring it, or even decide to intervene in the discussion to avoid future derailment. For example:
P6: Sometimes I can sit by and see things developing and I might drop by with a
comment. I don’t tend to get involved in very big issues and charge in but I will go in
and say, ‘This is becoming an inappropriate way of speaking. Let’s talk collaboratively.
Let’s talk constructively.’ But do I monitor ongoing discussions for it? I suppose I look
at some of the administrator notice boards, but I suppose I actually tend to sit more
on the sidelines and watch other people engage in things, and only come in if I felt I
had something to contribute or something to say like, ‘Tone this down.’ And there is a
good chance somebody else might too.
While moderators’ have access to formal administrative tools, called sanctions on Wikipedia11 —
such as blocking and interaction bans—proactively imposing any formal sanction is not permitted
by Wikipedia’s moderation guidelines and would raise ethical concerns; sanctions can only be
used in response to a tangible offense. Therefore, the proactive interventions that moderators can
employ are limited to informal moderation techniques.
11 Fromthe Wikipedia:Sanctions page: “Sanctions are restrictions on editing Wikipedia that are applied to users or topic
areas by the Wikipedia community and the Arbitration Committee in order to resolve disputes and curtail disruptive
behaviour.” (https://en.wikipedia.org/wiki/Wikipedia:Sanctions)

Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW2, Article 370. Publication date: November 2022.
Proactive Moderation of Online Discussions 370:13

Participants identify a variety of informal strategies they use to guide conversations which they
assess to be at risk of derailment. For example, moderators will join a discussion as a level-headed
participant in order to refocus the discussion on its original topic. P5 explains their strategy:
P5: In some of those cases I just engage as an additional participant rather than in
discussion moderation just in order to just try and aid in those methods by bringing
the discussion back on to context.
A similar strategy is to leave just one comment in a discussion to acknowledge a growing
dispute and try to neutralize it before it gets out of hand and irreparably damages the conversation.
Prior work has described this as a moderator acting as a “mediator”, stepping into a conversation
facing rising tensions in order to resolve conflicts between clashing discussion participants [57].
P8 explains their strategy:
P8: I’ll just leave a comment being like, ‘Hey guys, I think this might be going off topic,’
and then I’ll give my version of events. So it will be my opinion on it, in a very neutral
way where I address each of their concerns. If I do it in a very polite way I think that
typically a third party—especially an admin—does put the conversation back on topic.
A different version of this strategy is to remind users of platform rules when moderators anticipate
they will be violated. Prior work has described this mode as a moderator acting as a “referee”,
working to “resolve disputes by referencing rules or a body of accepted knowledge” [57]. This can
be seen as a more targeted version of automatic reminders, such as those triggered when interacting
with newcomers [31]. P4 explains this strategy as they apply it:
P4: When we have [discussions in which there seems to be a significant chance of
undesirable behavior arising], periodically we’ll put up notices like, ‘Hey, remember
to keep civil, keep your comments about the content of the discussion, not the other
editors directly.’
These three interventions show the wide range in the depth of moderator involvement required for
different proactive interventions. Joining a discussion as a participant to try to bring it back on
track requires contextual knowledge of the conversation topic at hand and continued involvement
in a discussion. Similarly, leaving one comment to address the concerns of discussion participants
requires contextual knowledge of the conversation and topic at hand, but does not require ongoing
engagement. Finally, reminding users of the platform’s policies only requires a prediction of which
policies may be violated, while the reminder itself can take the same form across discussions on
different topics and does not require continued engagement.
While some participants discuss how their proactive interventions can often bring discussions
back on topic and avoid severe derailment beyond hope of repair, other participants describe how
proactive interventions can backfire. Even when moderators forgo their formal sanctioning powers
in favor of a softer approach, some users may react negatively to what they perceive as a threat of
future sanctions. This implication may alienate users and limit the effectiveness of any proactive
intervention. P1 explains:
P1: I did [proactive interventions in discussions] much more when I was younger. It
doesn’t work very well, I think because the idea is if you’re coming in as sort of like
an uninvolved administrator, [...] the assumed context is that you’re getting ready to
sanction them, which is never as useful as a friendly reminder. If I personally know one
of the parties to the dispute, which happens on occasion, I might send them a direct
email or a direct message, [...] just to try to hear what’s going on. I found it particularly
ineffective to post on Wiki to cool down, or something.

Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW2, Article 370. Publication date: November 2022.
370:14 Charlotte Schluger et al.

This highlights one specific challenge moderators face when acting proactively: demonstrating to
users that they genuinely want to help the conversation progress free of antisocial behavior rather
than arriving early in preparation for future sanctions. This corroborates prior work showing how
discussion moderators may shy away from joining discussions despite a desire to do so, because
of their role as a moderator [27]. Thus, executing a successful proactive intervention requires a
nuanced approach that considers the ways a moderator’s actions will be perceived by users.

5.2.2 Benefits of Acting Proactively. In addition to the established drawbacks of reactive moderation—
and the respective benefits of the proactive paradigm—discussed in prior work and elaborated
in Section 2, our interviews reveal an additional issue: reactive interventions struggle to balance
the two goals of moderation we identified above (Section 5.1), high-quality content creation and
positive interactional environment.
Since the reactive paradigm is only to act after a clear violation of community norms, in this case
moderators can and do impose alienating formal sanctions. So, when experienced users who make
otherwise valuable content behave antisocially, actions to sanction them—intended to maintain a
healthy environment on the platform—alienate them and hence threaten the further development
of the platform. On the other hand, protecting these antisocial users just because they create good
content can cause disruption to the platform environment and alienate other users. P7 explains
this conundrum:
P7: [When experienced editors clash,] that’s where we, as administrators, sometimes
have a very difficult task. We don’t want to block experienced editors because they are
very useful, very valuable. [...] By the same token, we don’t want disruption. So, we’ve
walked this very fine line where we try to hold experienced users who are sometimes
misbehaving accountable without trying to block. It is a very difficult and fine line to
walk and I think it would be nice if we had some way to better keep people civil, and
better [...] get people to work together.
Thus, in the reactive paradigm, antisocial content can threaten moderators’ goals regardless
of whether or not is addressed—disrupting the environment if it is not sanctioned, or alienating
high value users if it is. Moreover, moderators face a significant challenge in realizing their dual
moderation goals in the face of incivility from established users through the reactive paradigm,
threatening their emotional health and consuming a lot of their time. P2 explains:
P2: [When] someone has been incredibly uncivil to lots and lots of people, but he’s
also an incredibly influential editor, it is an excruciating process to kind of get through
the kind of pieces that I need to to try and rein in his incivility. I just have to be patient,
[because] it’s ongoing and long.
Therefore, addressing incivility from valuable content creators through the reactive paradigm
threatens moderators themselves, in addition to their goals.
Where reactive moderation faces this dilemma, the proactive paradigm offers a solution. Because
proactive interventions come before any tangible incivility in a conversation, they are more well
suited to take a softer and less alienating form, as discussed in Section 5.2.2. This allows moderators
to support a healthy environment by preventing incivility in discussions while avoiding the
drawbacks of reactive strategies. P2 explains their preference for using the proactive paradigm to
address rising tensions in a conversation:
P2: I did not become an administrator in order to block people. There are definitely
people that became administrators because that’s what they want to do, they wanted
to police behavior. I actually spend a fair amount of time policing behavior in terms

Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW2, Article 370. Publication date: November 2022.
Proactive Moderation of Online Discussions 370:15

of my overall workload, but like I said, I try to operate in the social sphere and really
kind of have conversations rather than using that.
While not all moderators share this preference, proactive moderation offers those who do use it a
more nuanced approach to moderation, better suited to balance their multiple moderation goals,
rather than appeal directly to one or the other.
5.2.3 Foreseeing Future Derailment. One crucial prerequisite of proactive moderation is identifying
which conversations are at risk of derailing into incivility. We find that moderators on Wikipedia
use their own intuition about the future trajectory of conversations towards this end, considering a
variety of factors to internally reason about the future of the conversations they see. For example:
Q: Given a civil conversation, do you think it is possible to predict if it will eventually
derail into uncivil comments?
P7: Yes. Not always but yes. I would say, certainly with experience, you get a feel for
it where if a discussion has started off on the wrong foot, maybe someone got [their
edits] reverted and then they opened, you know, maybe not an uncivil but kind of a
terse message like, “Hey, why did you undo my edit?,” that’s not uncivil but...It started
things off on a bit of a wrong foot. I could guess that some of those things might get
uncivil.
Moderators use a variety of factors to make predictions about the future of conversations. Five
participants report using direct observations from the conversation, like the conversation content
or tone, to do forecasting. Using these direct features allows moderators to update their predictions
over time as the conversation develops, whenever they check in on the conversation. On the other
hand, the other four participants report forecasting solely based on metadata, including features of
the conversation and of the interlocutors. Salient conversation properties identified by participants
include the ostensible conversation topic (as indicated by the conversation header) and the number
of participants in the conversation. Salient interlocutor metadata include level of experience on
the platform, identity, and usernames. Drawing on their past experiences, participants consider
such features to estimate the risk that a conversation is likely to derail in the future.

5.3 Evaluating the Feasibility of Algorithmic Assistance

Equipped with an understanding of moderators’ goals and practices, we now proceed to explore
concrete ways in which an algorithmic tool can assist with their proactive moderation workflow.
We consider components of the workflow where moderators suggest that technical support is
needed, and assess the feasibility of offering this support algorithmically with existing technology
in an ethical and efficient manner. We show the feasibility of algorithmic assistance with one
crucial aspect of the workflow—discovering at risk conversations—and discuss design and ethical
implications that arise from observing how moderators engage with our prototype tool. We also
discuss other needs that emerge from our interviews, but that do not lend themselves to algorithmic
assistance due to both technical and ethical challenges, such as assigning blame for derailment or
profiling users based on their prior behavior.
5.3.1 Discovery of At-risk Conversations: Need and Support. We previously uncovered how mod-
erators use their own intuition to decide which conversations to proactively moderate; now, we
turn to the challenges moderators face in this crucial process and the resulting need for additional
support.
One idealized form of proactive moderation that all participants found appealing is to identify
conversations that they suspect are highly likely to derail and monitor them so that they can
intervene proactively at an opportune moment or to react immediately to any uncivil behavior

Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW2, Article 370. Publication date: November 2022.
370:16 Charlotte Schluger et al.

that does arise. However, moderators’ ability to identify at-risk conversations to monitor is limited
by the scale of the platform. P9 explains how even within the subset of topics they are interested
in and engage in, their ability to effectively proactively monitor conversations is limited by their
sheer number, which forces them to use only simplistic strategies, such as random discovery, to
identify at-risk conversations to monitor:
P9: There are too many [potentially at-risk conversations] to proactively monitor. I
know there’s about 65 or 60 ongoing ones which are certainly always going to be at
risk. [...] So I usually either wait until I’m asked, or I happen to see something, or I skip
around and happen to notice something.
The problem of scale is exacerbated by the inherent difficulty of determining when a conversation
is in need of a proactive intervention. While every participant we interviewed believes there are
some contexts in which they can foresee derailment, as described in Section 5.2.3, there is a wide
range in how broad this context is and how confident participants are in their forecasts. Four
participants believe that they can confidently forecast antisocial behavior in any Wikipedia context,
but four others believe that they can only do so in very specific contexts with low confidence, and
the last participant believes they can only make such forecasts in conversations on a handful of
specific topics among discussion participants they know personally.
Given that moderators are often uncertain about their forecasts of a conversation’s future
trajectory, they may hesitate to intervene immediately, and instead desire to keep an eye on the
situation to see how it develops. P3 explains:
P3: From time to time I do see a discussion I think that I want to monitor, and I’m like
‘Yeah, I probably should be keeping an eye on this.’ [. . . ] I might leave a tab open on it
and come back to it just in case.
As P3 goes on to elaborate, however, this idealized notion of monitoring a conversation as it
develops in real time is impractical in reality:
P3: There are some technical challenges to [monitoring a discussion] just because
of the way the Wikipedia software works. There isn’t an easy way to say, ‘Give me
updates for any changes in this discussion.’ And, in fact, you can’t even say, ‘Give me
an update every time this page is changed,’ which is a perennial source of annoyance.
But on the other hand, the resulting gap in attention could cause the moderator to miss out on key
developments in the conversation, and thereby lose an opportunity to intervene. P6 explains this
dilemma:
P6: I think I am okay at gauging if things are going to go pear-shaped, but do I always
stick around to even find out if I am not interested in the topic? I may just move on
and it blows up behind me. The hand grenade has gone off and I didn’t even hear it
because I’ve gone down the street.
We therefore find that proactive moderation practices are difficult to scale up manually, both
because of the size of the platform itself and because monitoring conversations—a necessary step
given the uncertainty of moderators’ forecasts—is time-consuming and impractical. Algorithmic
solutions have the potential to address both challenges: by using a forecasting algorithm rather
than random discovery or other limited methods to more efficiently identify conversations that
are at risk, and by automatically monitoring them for relevant changes as opposed to requiring
manual, repeated checks. This can potentially help moderators engage in proactive monitoring at
a larger scale and dedicate more time to addressing potential issues.
How our prototype tool can address moderators’ needs. Having identified the potential
ways an algorithmic tool could help scale up the process of proactive discussion moderation, we

Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW2, Article 370. Publication date: November 2022.
Proactive Moderation of Online Discussions 370:17

now investigate the extent to which our prototype tool meets this potential, as well as aspects
in which it might still fall short and thereby provide directions for future work. Concretely, we
analyze moderators’ feedback from their hands-on interaction with our prototype tool, with a
focus on understanding which features moderators found the most useful and why, as well as what
moderators found lacking in the prototype tool.
Moderators’ feedback on the prototype tool suggests that information presented in the tool’s
Ranking View is helpful in discovering at-risk conversations, although individual moderators
differed in their evaluation of exactly which pieces of information were most useful. For example,
P4 reported that they would mainly use the CRAFT score to decide which conversations were
worth monitoring:
P4: [For monitoring] I would just pick the ones with the highest score ’cause it seems
to be somewhat accurate.
Meanwhile, other participants highlighted the score change representation (i.e., the colored arrows)
as providing an easy way to get a sense of when a monitored conversation needs to be further
inspected. P7 reports:
P7: I like the score change indicator. That is useful. From a cursory glance, it looks like
if the score is going up, I would inspect it, if the score was going down, maybe it is not
worth inspecting.
All together, five participants described how both the score and score change representation would
be useful towards discovering these at-risk conversations.
However, moderators also identified several aspects of conversations that play into their existing
intuitions about whether to monitor a conversation, but are not captured by the prototype tool. In
particular, three participants mentioned wanting to know the ages of conversations, since their
intuition is that if a conversation has been inactive for an extended period of time, it is unlikely to
pick up again at all, much less turn uncivil. P7 expresses this view:
P7: That is very useful. That is probably all I would really need, too, except for the age
of the conversation would also be another useful column because if the conversation
was old, it wouldn’t be worth my time to investigate it anyway but if I see the last
comment was added within a day or two, I would then check it out. If it was more
than, like, 2 or 3 days old, I mean, the conversation is probably dead anyway so it is
not worth my time.
Additionally, three participants wanted to see a summary of the end of the conversation—calling
attention to the way that conversations on a path to incivility often stray far from the ostensible
conversation topic, and how knowing the actual topic of discussion at hand is crucial for moderators
to plan interventions. Both these suggestions could be taken into consideration in a future iteration
of this tool.12 On the other hand, five participants reported wanting to see data about discussion
participants such as their usernames or age on the platform or their prior activity—features that
could raise moral concerns and whose inclusion should thus be carefully weighed, as we will discuss
in Section 5.3.2.
The feedback discussed thus far suggests that moderators would find the Ranking View useful
in identifying conversations that might be at risk. However, as discussed above, an important
additional part of the proactive moderation workflow is continuing to monitor such conversations—
such that the “grenade” does not “blow up” behind the moderator, to follow P6’s metaphor. While
12 It should be noted that the prototype tool already only includes “live” conversations and thus probably excludes most, if not

all, “dead” conversations from the ranking, a fact that is not made apparent to the moderators. This issue notwithstanding,
it is still possible that information about the age of conversations would still be useful to moderators even for more recently
active conversations, so such a feature is still worth considering for future development.

Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW2, Article 370. Publication date: November 2022.
370:18 Charlotte Schluger et al.

we believe the comment-by-comment information given by the Conversation View could be helpful
for this,13 that would only be the case if this information aligns with how the moderator would
intuitively judge the conversation.
To assess this, we selected several conversations from different positions in the ranking and
invited the moderators to first examine them raw (i.e., without added information), allowing them
to make intuitive judgments, and then to re-examine them in the Conversation View. Overall,
moderators reported that the displayed per-comment CRAFT scores matched their own initial
intuitive judgments. For instance, while looking at an example conversation predicted to be heading
for future incivility, P2 describes:
P2: [The escalating comment] definitely took it to a whole new level—and then having
the third person come in, right? So, I feel like [the conversation view] is backing up
what intuitively I had said. [...] I feel like that’s very much in line with my experience
and makes a lot of sense.
The most notable exception is that some participants disagreed with the final CRAFT score of a
conversation because they thought the conversation was unlikely to continue, and thus in a trivial
sense unlikely to see any future incivility. P8 explains:
P8: I didn’t think [the last comment has] that high [chance of seeing an antisocial
reply]. I mean, in most cases this person [...] will rage quit. That’s typically in my
experience what happened. That’s interesting. I didn’t think it was going to be that
high [of a score].
While the prototype tool may be correctly capturing the intuition that the conversation is very
tense and likely to see future antisocial behavior, in practice this may result in a user leaving the
platform rather than leaving an antisocial comment in the conversation. Given that editor retention
is also an issue of concern to Wikipedia moderators [14], this finding suggests that a future iteration
of the tool could focus on more outcomes than just antisocial comments, as these other outcomes
are equally part of the proactive moderation workflow.
Taking the design implications of a proactive moderation tool gleaned from this feedback together
with the fact that the conversational forecasting algorithm underlying our prototype proactive
moderation tool generally agrees with moderators’ intuitions, we conclude that it is at least feasible
to support moderators in identifying and monitoring at-risk conversations. However, this conclusion
does not necessarily imply that moderators would accept and use such a tool, nor does it guarantee
that the tool would be used in an ethical way. P3 explains some hesitations:
P3: I think an algorithm could be a useful indicator for flagging, ‘Hey, this seems like a
topic or a conversation that might be a problem down the line.’ But on its own I don’t
think an algorithm could actually be trusted to make the decision. A nice little browser
plugin that highlights a section for me that says, ‘This discussion looks like it’s getting
heated, you might want to take a look at it,’ that’s something I would trust. A browser
plug in telling me or a pile of machine learning telling me, ‘Block this person, they’re
making everything uncivil wherever they go,’ not as inclined to trust it.
As P3 exemplifies, moderators are rightfully hesitant to put their full faith in an algorithmic tool,
preferring to only use such a tool under their watch. Therefore, despite the agreement between
state of the art forecasting methods and moderators’ intuitions, these considerations motivate the

13 In addition to just providing an augmented interface to follow the unfolding conversation, in a future iteration of the tool

we can envision additional affordances, such as allowing the moderator to request notifications based on specific CRAFT
thresholds.

Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW2, Article 370. Publication date: November 2022.
Proactive Moderation of Online Discussions 370:19

need for follow-up work to conduct a large scale user study to analyze the performance of the tool
and analyze use patterns in a more systematic way.
5.3.2 Other Needs and Ethical Challenges. Thus far we showed how our prototype tool addresses
some proactive moderation needs, while leaving others unanswered. In the latter category, some
gaps could be straightforwardly addressed by future iterations of the tool (e.g., considering the age
of the conversation). Addressing others, however, could have significant ethical implications. We
now consider such cases in more detail.
In particular, one commonly reported complication in moderating antisocial behavior is how
to properly assign blame in multi-user interactions. P8 points out that oftentimes, instances of
antisocial behavior might arise in response to prior provocation:
P8: [When considering taking moderation action] I have to read the whole [discussion]
in order to understand where they’re coming from. I’m looking to see how the other
person [is] responding to them, because it’s not really fair for me to stick to talk[ing]
to one person if they’re being egged on. So, I tried to see where it got derailed and how
to bring it back to a proper discussion.
As P8 highlights, moderators do not want to treat a user being provoked and the user provoking
them in the same way, and thus try to understand the root cause of a derailment in order to
make a useful intervention and bring the discussion back on topic. This need is heightened by the
phenomenon of Civil POV Pushers—users who keep their language within community guidelines
but come with an intention to only push their point of view, often enraging other users. While the
term ‘Civil POV Pusher’ is specific to Wikipedia, similar issues of bad-faith but superficially civil
behavior could arise on any platform [38]. This makes it important for a moderator to know the
history of a conversation to take action. P3 explains:
P3: You need to look back: is somebody provoking them into this? Because that’s
always a possibility. [...] We actually have a term kind of related to that. It’s called civil
POV pushing, for point of view, which is basically: you are here because you are on a
mission. You want your opinion to be the correct one, but you are going to follow all
the rules. You’re not going to swear at people, but you are just going to keep bugging
people or keep pressing your agenda until everybody else just gets tired of arguing
with you. [...] In that case it might not be justified that someone cussed them out, but
it’s more that they were provoked in a way. And so it probably not be as appropriate
to take such a harsh sanction against the person who flew off the handle.
As P3 highlights, moderators use their discretion when sanctioning incivility if they believe the
uncivil user was provoked by a bad-faith “Civil POV pusher.” This indicates the need to identify
civil POV pushing in discussions, especially when it leads to that discussion derailing.
In light of the complications introduced by the phenomenon of civil POV pushing, several
moderators we spoke to expressed a desire for a further level of foresight in a proactive moderation
tool: not just identifying which conversations are at risk of derailing, but also why those conver-
sations might derail—even down to the level of exactly which individual comment might trigger
the derailment. Besides tying back to a general desire among moderators to address the root cause
of antisocial behavior, such information could also have the practical benefit of helping identify
civil POV pushers—a task that is currently acknowledged by Wikipedia as being one of the most
challenging and patience-testing ones facing moderators.14
However, we caution that algorithmic tools, at least in their current state, are not a good
framework to support moderators in this need. The current forecasting models—including CRAFT,
14 https://en.wikipedia.org/wiki/Wikipedia:Civil_POV_pushing

Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW2, Article 370. Publication date: November 2022.
370:20 Charlotte Schluger et al.

the one we use in our prototype tool—are based on statistical machine learning methods, which
notably do not explicitly model causality. In addition to being a key technical limitation of such
models, this property can also introduce ethical hazards, as rather than making predictions based
on the actual causal mechanisms, models might make predictions based on correlations arising
from bias in the training data, a problem that has been previously observed in models for antisocial
content detection [54, 62]. This limitation implies that, at least given current technologies, automated
tools cannot be used to determine the root cause of conversational derailment, and by extension
cannot be used to distinguish the provoker from the provoked. Therefore, despite the depth of the
challenge moderators face, algorithmic tools based on the statistical machine learning framework
may not be an appropriate choice to assist moderators for this task given current limitations.
These biases embedded into any moderation tool raise further questions about their use. For
example, participants identified how they use information about the participants in a discussion
to inform their perceptions of which conversations are at-risk of future incivility (Section 5.3.1),
and further that this use is a potential point for algorithmic support. However, automating the use
of the identities of discussion participants for identifying at-risk conversations has questionable
ethics. P3 explains:
P3: I think one thing that actually could be potentially useful for this is, though it
also gets into some questionable territory is: who is in the discussion. Either just a
breakdown of the top five contributors to the discussion. Or even, if we want to go into
more Big Brother territory, [a summary of] how this person’s comments usually score.
The biases likely present within the underlying CRAFT conversational forecasting model, and any
statistical machine learning model, suggest that profiling users based on how such an algorithm typ-
ically scores their comments is problematic, and may facilitate discrimination against certain users.
Moreover, because statistical machine learning algorithms’ errors tend to be most concentrated
around marginalized groups [5, 24], profiling users based on their CRAFT scores over time holds
the potential to reinforce social inequities and further harm those already facing marginalization.
Therefore, we caution against the deployment of any proactive moderation tools without im-
plementing design features to discourage these fraught uses, in addition to informing moderators
about the abilities and limitations of the tool. As mentioned in Section 5.3.1, these dangers warrant
closer investigation through a user study, to better understand how to best inform moderators
about the tool’s limitations and ethical hazards.

6 DISCUSSION
Motivated by the gap between the goals of moderation and the reality of reactive moderation
strategies, this work seeks to deepen our understanding of the proactive moderation paradigm.
Through a case study of moderation on Wikipedia Talk Pages, we uncover the workflow through
which moderators proactively monitor and intervene in discussions they deem to be at risk of
derailing into uncivil behavior. We identify a specific component of this workflow that lends itself
to algorithmic assistance and test the feasibility of such assistance by observing how moderators in
this community engage with a tool running live on their platform.
Based on our interviews with moderators of Wikipedia Talk Pages, we reveal a delicate balance
between two moderation goals in this collaborative setting: maintaining a civil and welcoming
environment, while trying not to alienate otherwise valuable content creators. Reactive moderation
tends to put these goals at odds: imposing harsh sanctions against uncivil behavior from otherwise
valuable contributors can alienate them, but leaving such behavior alone creates a less civil and
less welcoming environment. Proactive moderation offers an alternative pathway, by preventing
sanctionable actions from occurring in the first place. Moreover, whereas reactive interventions

Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW2, Article 370. Publication date: November 2022.
Proactive Moderation of Online Discussions 370:21

tend to be strict formal sanctions such as a block, proactive interventions better lend themselves to
more nuanced, informal actions. In interviews, moderators discuss how they employ proactive
moderation strategies to prevent incivility without needing to remove any content or alienate
users.
The interviews further shed light on aspects of the proactive workflow that moderators currently
find challenging, and which may therefore benefit from algorithmic assistance. Moderators reported
that there are too many ongoing conversations for them to reasonably inspect and that they would
benefit from tools that can help them identify those that are at-risk, echoing suggestions from
prior work [39, 56]. Additionally, moderators indicated once at-risk conversations are identified,
monitoring them is cumbersome and time consuming, and expressed a need for tools that can
support this process.
We design a prototype tool to examine how algorithmic assistance could help moderators over-
come these difficulties. In their feedback, moderators indicated that they found the prototype tool’s
Ranking View, and in particular its CRAFT score and score change information, to be helpful in dis-
covering potentially at-risk conversations. Furthermore, moderators’ feedback on the Conversation
View shows that CRAFT scores within a conversation tend to match moderators’ intuitions regard-
ing derailment, thus providing a helpful visual aid for monitoring at-risk conversations. Overall,
these findings suggest that using an algorithmic tool to quickly identify candidate conversations
for monitoring could therefore improve the scalability of proactive moderation by mitigating the
need for labor-intensive manual searching and tracking of Talk Page discussions.
These findings motivate further steps towards developing and testing technology for assisting
proactive moderation:
Quantitative analysis. While we have observed our prototype tool in the hands of moderators
during the course of each hour long interview, a full quantitative analysis of the usage and utility
of the tool would require observations of its use over a longer time period, running on a broad
selection of discussions. Such larger scale studies need to be set up with care, potentially using a
sandbox approach [8], avoiding the disruption of the broader Wikipedia environment.
Potential misuses and ethical considerations. Any new technology that engages so directly
with human communication requires thorough ethical scrutiny. We have argued (Section 2) that
existing reactive and pre-screening moderation frameworks come with a number of ethical down-
sides, and that this justifies at least looking into alternative possibilities such as our proposal
of algorithmically-assisted proactive moderation. Even so, we must remain conscious that our
proposed methodology still falls under the broad umbrella of automated approaches to moderation,
a subject area that in general raises hard moral questions [24]. Though we have already identified
and discussed some ethical considerations that arose from the interviews (Section 5.3.2), this is
by no means comprehensive. At a high level, further moral questions may arise from two sources.
The first is technological: while the flaws of toxicity-classification algorithms that underpin reac-
tive moderation tools are now well-studied and documented [15, 17], conversational forecasting
algorithms like CRAFT are a much newer technology and have yet to receive similar levels of
scrutiny. The second is social: though human moderators already engage in proactive moderation
on a small scale, if tools like ours succeed in their goal of scaling up such practices, there may
be unforeseen consequences, as past work has demonstrated in other situations where human
moderators augmented their existing practices with automated tools [29].
The ethical considerations listed above provide further inspiration for the design of a follow-
up study. Such work should, in addition to evaluating the technical merits of algorithmically
assisted proactive moderation, explore how to effectively inform moderators about the limitations
and dangers of such algorithmic tools. Another important step is error analysis: the proactive
moderation paradigm is more prone to error than the reactive moderation paradigm, as predicting

Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW2, Article 370. Publication date: November 2022.
370:22 Charlotte Schluger et al.

the future is inherently harder than detecting something that already happened. These errors can
be more consequential in the proactive paradigm—a proactive intervention based on a false positive
prediction could in the worst case shut down a perfectly civil conversation before it even has a
chance to progress, with the potential for a chilling effect on speech if these errors are common.
As such, future work needs to evaluate and characterize the types of errors made by forecasting
algorithms, and observe how moderators might be influenced by these errors. A particular focus
should be placed on transparency and explainability to ensure that moderators can better understand
the suggestions of the algorithms and identify erroneous ones.
Other domains and additional uses. While our focus has been on the collaborative domain
of Wikipedia Talk Pages, future work should investigate the proactive moderation paradigm in
other settings, such as Reddit and Facebook Groups. Even though our methodology can be easily
translated to such domains, we do not expect the needs and dispositions of the moderators to be
entirely echoed in other platforms, and thus the design of the tool might need to be adapted. In
particular, prior studies have revealed that Reddit moderators might be more hands off and thus
less likely to engage in proactive strategies [59].
Finally, our findings regarding how moderators can benefit from predictions about the future of
a conversation invite the question of what other ways this information can benefit the broader
community. For example, future work could investigate the potential of empowering regular users—
that is, the participants within online discussions—with this information through a user facing tool.
By demonstrating the benefits of proactive moderation, and showing the feasibility of algorithmic
assistance in this process, our present work lays a strong foundation for such future work, which
constitutes a promising new direction in research on online content moderation.

ACKNOWLEDGEMENTS
We would like to thank Lillian Lee, Cecelia Madsen, the 2021-2022 cohort of fellows at the Center for
Advanced Study in the Behavioral Sciences at Stanford, and all the reviewers for the enlightening
discussions and helpful suggestions. We additionally recognize everyone who helped with the
implementation of the prototype tool, particularly Lucas Van Bramer and Oscar So for their contri-
butions to the codebase, Todd Cullen for his help in setting up the backend server configuration,
and Caleb Chiam, Liye Fu, Khonzoda Umarova, and Justine Zhang for their extensive testing and
generous feedback. Finally, we are grateful to all the Wikipedia Administrators who participated in
our interviews, as well as to Daniel Glus who guided our efforts by offering key starting insights into
the Wikipedia moderation workflow, and who kickstarted the recruitment process by connecting
us to the broader community. This research was supported in part by an NSF CAREER award
IIS-1750615; Jonathan P. Chang was supported in part by a fellowship with the Cornell Center for
Social Sciences and Cristian Danescu-Niculescu-Mizil was supported in part by fellowships with
the Cornell Center for Social Sciences and with the Center for Advanced Study in the Behavioral
Sciences at Stanford.

REFERENCES
[1] Yavuz Akbulut, Yusuf Levent Sahin, and Bahadir Eristi. 2010. Cyberbullying Victimization among Turkish Online
Social Utility Members. Educational Technology & Society 13, 4 (Oct. 2010).
[2] Ofer Arazy, Lisa Yeo, and Oded Nov. 2013. Stay on the Wikipedia Task: When Task-related Disagreements Slip Into
Personal and Procedural Conflicts. Journal of the American Society for Information Science and Technology 64, 8 (Aug.
2013).
[3] Matt Billings and Leon A. Watts. 2010. Understanding Dispute Resolution Online: Using Text to Reflect Personal and
Substantive Issues in Conflict. In Proceedings of CHI.
[4] Lindsay Blackwell, Tianying Chen, Sarita Schoenebeck, and Cliff Lampe. 2018. When Online Harassment Is Perceived
as Justified. In Proceedings of ICWSM.

Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW2, Article 370. Publication date: November 2022.
Proactive Moderation of Online Discussions 370:23

[5] Joy Buolamwini and Timnit Gebru. 2018. Gender Shades: Intersectional Accuracy Disparities in Commercial Gender
Classification. In Proceedings of FAccT.
[6] Jie Cai and Donghee Yvette Wohn. 2019. What Are Effective Strategies of Handling Harassment on Twitch? Users’
Perspectives. In Proceedings of CSCW.
[7] Stevie Chancellor, Jessica Annette Pater, Trustin Clear, Eric Gilbert, and Munmun De Choudhury. 2016. #thyghgapp:
Instagram Content Moderation and Lexical Variation in Pro-Eating Disorder Communities. In Proceedings of CSCW.
[8] Eshwar Chandrasekharan, Chaitrali Gandhi, Matthew Wortley Mustelier, and Eric Gilbert. 2019. Crossmod: A Cross-
Community Learning-based System to Assist Reddit Moderators. In Proceedings of CSCW.
[9] Eshwar Chandrasekharan, Umashanthi Pavalanathan, Anirudh Srinivasan, Adam Glynn, Jacob Eisenstein, and Eric
Gilbert. 2017. You Can’t Stay Here: The Efficacy of Reddit’s 2015 Ban Examined Through Hate Speech. In Proceedings
of CSCW.
[10] Eshwar Chandrasekharan, Mattia Samory, Shagun Jhaver, Hunter Charvat, Amy Bruckman, Cliff Lampe, Jacob
Eisenstein, and Eric Gilbert. 2018. The Internet’s Hidden Rules: An Empirical Study of Reddit Norm Violations at
Micro, Meso, and Macro Scales. In Proceedings of CSCW.
[11] Jonathan P. Chang, Caleb Chiam, Liye Fu, Andrew Wang, Justine Zhang, and Cristian Danescu-Niculescu-Mizil. 2020.
ConvoKit: A Toolkit for the Analysis of Conversations. In Proceedings of SIGDIAL.
[12] Jonathan P. Chang and Cristian Danescu-Niculescu-Mizil. 2019. Trajectories of Blocked Community Members:
Redemption, Recidivism and Departure. In Proceedings of WWW.
[13] Jonathan P. Chang and Cristian Danescu-Niculescu-Mizil. 2019. Trouble on the Horizon: Forecasting the Derailment
of Online Conversations as They Develop. In Proceedings of EMNLP.
[14] Benjamin Collier and Julia Bear. 2012. Conflict, Criticism, or Confidence: An Empirical Examination of the Gender
Gap in Wikipedia Contributions. In Proceedings of CSCW.
[15] Thomas Davidson, Dana Warmsley, Michael Macy, and Ingmar Weber. 2017. Automated Hate Speech Detection and
the Problem of Offensive Language. In Proceedings of ICWSM.
[16] Bryan Dosono and Bryan Semaan. 2019. Moderation Practices as Emotional Labor in Sustaining Online Communities:
The Case of AAPI Identity Work on Reddit. In Proceedings of CHI.
[17] Natasha Duarte and Emma Llansó. 2018. Mixed Messages? The Limits of Automated Social Media Content Analysis.
In Proceedings of FAccT.
[18] Dmitry Epstein and Gilly Leshed. 2020. The Magic Sauce: Practices of Facilitation in Online Policy Deliberation.
Journal of Deliberative Democracy 12, 1 (May 2020).
[19] Thomas Erickson and Wendy A. Kellogg. 2000. Social Translucence: An Approach to Designing Systems That Support
Social Processes. ACM Transactions on Computer-Human Interaction 7, 1 (March 2000).
[20] Björn Gambäck and Utpal Kumar Sikdar. 2017. Using Convolutional Neural Networks to Classify Hate-Speech. In
Proceedings of ALW.
[21] R. Stuart Geiger and David Ribes. 2010. The Work of Sustaining Order in Wikipedia: The Banning of a Vandal. In
Proceedings of CSCW.
[22] Sarah A. Gilbert. 2020. "I Run the World’s Largest Historical Outreach Project and It’s on a Cesspool of a Website."
Moderating a Public Scholarship Site on Reddit: A Case Study of r/AskHistorians. In Proceedings of CSCW.
[23] Tarleton Gillespie. 2018. Custodians of the Internet: Platforms, Content Moderation, and the Hidden Decisions That Shape
Social Media. Yale University Press, New Haven.
[24] Tarleton Gillespie. 2020. Content Moderation, AI, and the Question of Scale:. Big Data & Society (July 2020).
[25] Tarleton Gillespie, Patricia Aufderheide, Elinor Carmi, Ysabel Gerrard, Robert Gorwa, Ariadna Matamoros-Fernández,
Sarah T. Roberts, Aram Sinnreich, and Sarah Myers West. 2020. Expanding the Debate about Content Moderation:
Scholarly Research Agendas for the Coming Policy Debates. Internet Policy Review 9, 4 (Oct. 2020).
[26] James Grimmelmann. 2015. The Virtues of Moderation. Yale Journal of Law and Technology 17, 1 (Sept. 2015).
[27] David Gurzick, Kevin F. White, Wayne G. Lutters, and Lee Boot. 2009. A View from Mount Olympus: The Impact of
Activity Tracking Tools on the Character and Practice of Moderation. In Proceedings of GROUP.
[28] Aaron Halfaker and R. Stuart Geiger. 2020. ORES: Lowering Barriers with Participatory Machine Learning in Wikipedia.
In Proceedings of CSCW.
[29] Aaron Halfaker, R. Stuart Geiger, Jonathan T. Morgan, and John Riedl. 2013. The Rise and Decline of an Open
Collaboration System. American Behavioral Scientist 57, 5 (May 2013).
[30] Aaron Halfaker, Aniket Kittur, and John Riedl. 2011. Don’t Bite the Newbies: How Reverts Affect the Quantity and
Quality of Wikipedia Work. In Proceedings of WikiSym.
[31] Aaron Halfaker, Bryan Song, D. Alex Stuart, Aniket Kittur, and John Riedl. 2011. NICE: Social Translucence Through
UI Intervention. In Proceedings of WikiSym.
[32] Christophe Henner and Maria Sefidari. 2016. Wikimedia Foundation Board on Healthy Wikimedia Community Culture,
Inclusivity, and Safe Spaces – Wikimedia Blog. https://blog.wikimedia.org/2016/12/08/board-culture-inclusivity-safe-

Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW2, Article 370. Publication date: November 2022.
370:24 Charlotte Schluger et al.

spaces/
[33] Krithika Jagannath, Katie Salen, and Petr Slovàk. 2020. "(We) Can Talk It Out...": Designing for Promoting Conflict-
Resolution Skills in Youth on a Moderated Minecraft Server. In Proceedings of CSCW.
[34] Shagun Jhaver, Darren Scott Appling, Eric Gilbert, and Amy Bruckman. 2019. “Did You Suspect the Post Would Be
Removed?”: Understanding User Reactions to Content Removals on Reddit. In Proceedings of CSCW.
[35] Shagun Jhaver, Iris Birman, Eric Gilbert, and Amy Bruckman. 2019. Human-Machine Collaboration for Content
Regulation: The Case of Reddit Automoderator. ACM Transactions on Computer-Human Interaction 26, 5 (July 2019).
[36] Shagun Jhaver, Christian Boylston, Diyi Yang, and Amy Bruckman. 2021. Evaluating the Effectiveness of Deplatforming
as a Moderation Strategy on Twitter. In Proceedings of CSCW.
[37] Shagun Jhaver, Sucheta Ghoshal, Amy Bruckman, and Eric Gilbert. 2018. Online Harassment and Content Moderation:
The Case of Blocklists. ACM Transactions on Computer-Human Interaction 25, 2 (March 2018).
[38] Amy Johnson. 2017. The Multiple Harms of Sea Lions. In Perspectives on Harmful Speech Online. Berkman Klein Center
for Internet & Society.
[39] David Jurgens, Libby Hemphill, and Eshwar Chandrasekharan. 2019. A Just and Comprehensive Strategy for Using
NLP to Address Online Abuse. In Proceedings of ACL.
[40] Sara Kiesler, Robert Kraut, Paul Resnick, and Aniket Kittur. 2012. Regulating Behavior in Online Communities. In
Building Successful Online Communities: Evidence-Based Social Design, Paul Resnick and Robert Kraut (Eds.). MIT Press.
[41] Aniket Kittur and Robert E. Kraut. 2008. Harnessing the Wisdom of Crowds in Wikipedia: Quality Through Coordina-
tion. In Proceedings of CSCW.
[42] Aniket Kittur, Bongwon Suh, Bryan A. Pendleton, and Ed H. Chi. 2007. He Says, She Says: Conflict and Coordination
in Wikipedia. In Proceedings of CHI.
[43] Travis Kriplean, Jonathan Morgan, Deen Freelon, Alan Borning, and Lance Bennett. 2012. Supporting Reflective Public
Thought with Considerit. In Proceedings of CSCW.
[44] Travis Kriplean, Michael Toomim, Jonathan Morgan, Alan Borning, and Andrew Ko. 2012. Is This What You Meant?:
Promoting Listening on the Web with Reflect. In Proceedings of CHI.
[45] Cliff Lampe and Paul Resnick. 2004. Slash(Dot) and Burn: Distributed Moderation in a Large Online Conversation
Space. In Proceedings of CHI.
[46] Ping Liu, Joshua Guberman, Libby Hemphill, and Aron Culotta. 2018. Forecasting the Presence and Intensity of
Hostility on Instagram Using Linguistic and Social Features. In Proceedings of ICWSM.
[47] Claudia (Claudia Wai Yu) Lo. 2018. When All You Have Is a Banhammer : The Social and Communicative Work of
Volunteer Moderators. Thesis. Massachusetts Institute of Technology.
[48] J. Nathan Matias. 2019. The Civic Labor of Volunteer Moderators Online. Social Media + Society 5, 2 (April 2019).
[49] Aiden R McGillicuddy, Jean-Gregoire Bernard, and Jocelyn Ann Cranefield. 2016. Controlling Bad Behavior in Online
Communities: An Examination of Moderation Work. In Proceedings of ICIS.
[50] Jonathan T. Morgan and Aaron Halfaker. 2018. Evaluating the Impact of the Wikipedia Teahouse on Newcomer
Socialization and Retention. In Proceedings of OpenSym.
[51] Chikashi Nobata, Joel Tetreault, Achint Thomas, Yashar Mehdad, and Yi Chang. 2016. Abusive Language Detection in
Online User Content. In Proceedings of WWW.
[52] Deokgun Park, Simranjit Sachar, Nicholas Diakopoulos, and Niklas Elmqvist. 2016. Supporting Comment Moderators
in Identifying High Quality Online News Comments. In Proceedings of CHI.
[53] Juliana Raskauskas and Ann D. Stoltz. 2007. Involvement in Traditional and Electronic Bullying Among Adolescents.
Developmental Psychology 43, 3 (May 2007).
[54] Maarten Sap, Dallas Card, Saadia Gabriel, Yejin Choi, and Noah A. Smith. 2019. The Risk of Racial Bias in Hate Speech
Detection. In Proceedings of ACL.
[55] Joseph Seering. 2020. Reconsidering Self-Moderation: The Role of Research in Supporting Community-Based Models
for Online Content Moderation. In Proceedings of CSCW.
[56] Joseph Seering, Tianmi Fang, Luca Damasco, Mianhong ’Cherie’ Chen, Likang Sun, and Geoff Kaufman. 2019. Designing
User Interface Elements to Improve the Quality and Civility of Discourse in Online Commenting Behaviors. In
Proceedings of CHI.
[57] Joseph Seering, Geoff Kaufman, and Stevie Chancellor. 2020. Metaphors in Moderation. New Media & Society (Oct.
2020).
[58] Joseph Seering, Robert Kraut, and Laura Dabbish. 2017. Shaping Pro and Anti-Social Behavior on Twitch Through
Moderation and Example-Setting. In Proceedings of CSCW.
[59] Joseph Seering, Tony Wang, Jina Yoon, and Geoff Kaufman. 2019. Moderator Engagement and Community Development
in the Age of Algorithms. New Media & Society 21, 7 (July 2019).
[60] Kumar Srinivasan, Cristian Danescu-Niculescu-Mizil, Lillian Lee, and Chenhao Tan. 2019. Content Removal as a
Moderation Strategy: Compliance and Other Outcomes in the ChangeMyView Community. In Proceedings of CSCW.

Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW2, Article 370. Publication date: November 2022.
Proactive Moderation of Online Discussions 370:25

[61] Samuel Hardman Taylor, Dominic DiFranzo, Yoon Hyung Choi, Shruti Sannon, and Natalya N. Bazarova. 2019.
Accountability and Empathy by Design: Encouraging Bystander Intervention to Cyberbullying on Social Media. In
Proceedings of CSCW.
[62] Michael Wiegand, Josef Ruppenhofer, and Thomas Kleinbauer. 2019. Detection of Abusive Language: The Problem of
Biased Datasets. In Proceedings of NAACL.
[63] Wikimedia Support and Safety Team. 2015. Harassment Survey. https://upload.wikimedia.org/wikipedia/commons/5/
52/Harassment_Survey_2015_-_Results_Report.pdf
[64] Donghee Yvette Wohn. 2019. Volunteer Moderators in Twitch Micro Communities: How They Get Involved, the Roles
They Play, and the Emotional Labor They Experience. In Proceedings of CHI.
[65] Ellery Wulczyn, Nithum Thain, and Lucas Dixon. 2017. Ex Machina: Personal Attacks Seen at Scale. In Proceedings of
WWW.
[66] Justine Zhang, Jonathan P. Chang, Cristian Danescu-Niculescu-Mizil, Lucas Dixon, Nithum Thain, Yiqing Hua, and
Dario Taraborelli. 2018. Conversations Gone Awry: Detecting Early Signs of Conversational Failure. In Proceedings of
ACL.
[67] Lei (Nico) Zheng, Christopher M. Albano, Neev M. Vora, Feng Mai, and Jeffrey V. Nickerson. 2019. The Roles Bots Play
in Wikipedia. In Proceedings of CSCW.

A INTERVIEW QUESTIONS
This appendix shows the general outline we followed for all moderator interviews. Note that this
only served as a general guide; as the interview process is semi-structured we let the conversation
flow naturally, so the exact order and wording of questions varied in practice.

Topic 1: Current Discussion Moderation Practices

• Understanding comment removal practices:
– Q: How do you select comments to inspect for incivility and community rule violations?
– Q: Do you ever proactively monitor ongoing conversations that you consider to be at risk
of derailing into uncivil behavior?
– Q: Say that you have a potentially problematic comment. Please describe your typical
process for determining whether or not this comment needs moderation action.
∗ (Optional) Q: Can you think of a specific example of a comment you took action on, and
describe the process of determining whether or not that comment needed moderation
action?
• Understanding how moderators use context:
– Q: When you are considering [moderating/removing] a comment, do you generally read
earlier comments in the thread for context? If so, what are you looking for?
– Q: Do you think a user’s [post and comment/edit] history affects your decision on whether
you remove one of their comments? Can you give examples of such historical factors?
– Q: After you take some moderation action on a comment, would you ever look at earlier
comments in the conversation to identify more rule-violating comments?
• Understanding how moderators use automated tools:
– Q: What automated tools do you currently use for moderation, if any?
– Q: If you use any automated tools, do you use them for
∗ triaging comments for you to review
∗ and/or
∗ automatically removing content?
∗ If so, how do you configure automod for your community?
• Moderators’ motivation and how they view the role of moderator:
– Q: Why did you become a [moderator/administrator] for [Reddit/Wikipedia]?
– Q: As an administrator, why did you become involved in discussion moderation work?
• Miscellaneous:

Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW2, Article 370. Publication date: November 2022.
370:26 Charlotte Schluger et al.

– Q: How much time do you spend moderating each day? Each week?
∗ First ask about time spent as an administrator, then ask about moderation.
– Q: How satisfied are you with your current moderation practices? Do you see room for
improvement?
– Q: When doing your job as a moderator, would you rather:
∗ (a) only [remove/take moderation action] very flagrant rule violations, and potentially
miss some rule-breaking comments, or
∗ (b) [remove/take moderation action] all comments that could be rule violations, poten-
tially [remove/take moderation action] some comments that don’t deserve to be.

Topic 2: Potential Use of Conversational Forecasting

• Understanding what moderators would do without time constraints:
– Q: If you had more time for your job as a moderator, what actions would you want to do?
– Q: If you had more time for your job as a moderator, when you make a moderation decision
about a comment, would you read more of the context around the comment to inform your
decision?
• Can moderators tell if a conversation is going awry? Can anyone?
– Explanation: Here is some terminology that we will use for the rest of the interview:
∗ We’ll say that a comment is civil if it follows all the rules of your community, and that it
is uncivil if it violates a community rule.
∗ We will also say that a conversation eventually derails if it is civil right now, but in the
future an uncivil comment gets posted to the conversation.
– Q: Given a civil conversation, do you think it is possible to foretell if a conversation will
eventually derail into uncivil comments?
– Q: Do you think you yourself are able to do this prediction?
∗ If yes:
· Q: Roughly how often do you think your prediction would be correct? That is, can you
estimate what portion of the conversations you think will derail actually do end up
derailing?
· Q: What clues from a conversation do you use to inform your prediction?
– Q: Do you think other moderators would be able to do this type of prediction?
– Q: Do you think an algorithm might be able to do this type of prediction?
∗ Q: Do you think it would be better or worse than humans?
• Monitoring derailing conversations:
– Q: Assume you would know for sure that an ongoing conversation will turn uncivil in the
future. Would you like to monitor new comments that are posted in this conversation?
– Q: Now consider a more realistic scenario, where you cannot know for sure what the future
of a conversation will be. Now, say we have a conversation that is predicted to derail; we
will go through various levels of confidence in this prediction, and I want you to tell me if
you would want to monitor new comments in the conversation for each level of prediction
confidence.
∗ Would you want to monitor new comments if you had low certainty in the prediction
(i.e., 20% of the conversations that are predicted to derail will eventually actually end up
derailing)?
∗ Would you want to monitor new comments if you had 50-50 certainty in the prediction
(i.e., 50% of the conversations that are predicted to derail will eventually actually end up
derailing)?

Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW2, Article 370. Publication date: November 2022.
Proactive Moderation of Online Discussions 370:27

∗ Would you want to monitor new comments if you had high certainty in the prediction
(i.e., 80% of the conversations that are predicted to derail will eventually actually end up
derailing)?
• Taking proactive steps for derailing conversations
– Q: Assuming you would know for sure that a (currently civil) conversation will turn uncivil
and violate the rules of the community, what proactive steps do you, as a moderator, see
yourself taking in order to prevent uncivil behavior (if any)?
– Q: Now—as before—consider a more realistic scenario, where you cannot know for sure
what the future of a conversation will be. Now, say we have a conversation that is predicted
to derail; we will go through various levels of confidence in this prediction, and I want you
to tell me which of the proactive steps you just mentioned you would still take for each
level of prediction confidence.
∗ What proactive steps would you take if you had low certainty in the prediction (i.e.,
20% of the conversations that are predicted to derail will eventually actually end up
derailing)?
∗ What proactive steps would you take if you had 50-50 certainty in the prediction (i.e.,
50% of the conversations that are predicted to derail will eventually actually end up
derailing)?
∗ What proactive steps would you take if you had high certainty in the prediction (i.e.,
80% of the conversations that are predicted to derail will eventually actually end up
derailing)?
– Q: Have you ever taken any of these proactive steps in the past?

Topic 3: Analyzing a Mockup Conversation

[The participant is shown a conversation from their community in the Conversation View, with
the CRAFT score annotations removed.]
• Q: Do you think any comments in this conversation are uncivil?
• Q: How likely do you think this conversation is to eventually derail into uncivil behavior
(breaking the rules of the community)? What made you think this way (point to specific
behaviors)?
• Q: Would you want to monitor new comments in this conversation?
• Q: Would you consider taking any proactive steps to prevent uncivil behavior?
[The participant is shown the CRAFT scores for this conversation.]
• Ask the same questions again.

Topic 4: Analyzing a Mockup Ranking

[The moderator is shown a ranking of conversations from their community in the Ranking View.]
• Q: Which conversations do you think would be worth monitoring for uncivil behavior?
• Q: On the main page for the listing, how relevant is the information displayed about each
thread?
• Q: What other information would you find useful in deciding whether to inspect or monitor
a conversation?

Received July 2021; revised November 2021; accepted April 2022

Proc. ACM Hum.-Comput. Interact., Vol. 6, No. CSCW2, Article 370. Publication date: November 2022.

Key To Kindness - Reducing Toxicity in Online Discourses
No ratings yet
Key To Kindness - Reducing Toxicity in Online Discourses
25 pages
Amity Paper 2
No ratings yet
Amity Paper 2
17 pages
1 s2.0 S1071581925000254 Main
No ratings yet
1 s2.0 S1071581925000254 Main
15 pages
Navigating Ethical Dilemmas in AI Content Moderation: Balancing Privacy, Free Speech, and Fairness
No ratings yet
Navigating Ethical Dilemmas in AI Content Moderation: Balancing Privacy, Free Speech, and Fairness
9 pages
Content Moderation
No ratings yet
Content Moderation
4 pages
IEEE Conference Template 4 Copy - 3
No ratings yet
IEEE Conference Template 4 Copy - 3
15 pages
Understanding The Unintended Effects of Human-Machine Moderation in Addressing Harassment Within Online Communities
No ratings yet
Understanding The Unintended Effects of Human-Machine Moderation in Addressing Harassment Within Online Communities
27 pages
Influencer Summit - The Question of Content Moderation and Censorship Across Social Media Platforms
No ratings yet
Influencer Summit - The Question of Content Moderation and Censorship Across Social Media Platforms
9 pages
AI in Automated Content Moderation On Social Media
No ratings yet
AI in Automated Content Moderation On Social Media
8 pages
Ethics and Ai
No ratings yet
Ethics and Ai
17 pages
Tchi Áo 1
No ratings yet
Tchi Áo 1
6 pages
Confronting Abusive Language Online: A Survey From The Ethical and Human Rights Perspective
No ratings yet
Confronting Abusive Language Online: A Survey From The Ethical and Human Rights Perspective
48 pages
Filled PBL Presentation3e3
No ratings yet
Filled PBL Presentation3e3
36 pages
SSRN 4949261
No ratings yet
SSRN 4949261
39 pages
Analyzing The Use of Large Language Models For Content
No ratings yet
Analyzing The Use of Large Language Models For Content
8 pages
Social Media Moderator Using AI
No ratings yet
Social Media Moderator Using AI
7 pages
Claudia Lo: "When All You Have Is A Banhammer: The Social and Communicative Work of Volunteer Moderators"
No ratings yet
Claudia Lo: "When All You Have Is A Banhammer: The Social and Communicative Work of Volunteer Moderators"
86 pages
Approaches To Managing Deviant Behavior in Virtual
No ratings yet
Approaches To Managing Deviant Behavior in Virtual
3 pages
Everything in Moderation 2019-07!15!142127 Tq36vr4
No ratings yet
Everything in Moderation 2019-07!15!142127 Tq36vr4
42 pages
Master Thesis Ankush Arora
No ratings yet
Master Thesis Ankush Arora
83 pages
2gillespie 2020 Content Moderation Ai and The Question of Scale
No ratings yet
2gillespie 2020 Content Moderation Ai and The Question of Scale
5 pages
Mustread
No ratings yet
Mustread
9 pages
Algorithmic Censorship by Social Platforms: Power and Resistance
No ratings yet
Algorithmic Censorship by Social Platforms: Power and Resistance
28 pages
Toxic Comment Analysis Using NLP
No ratings yet
Toxic Comment Analysis Using NLP
9 pages
Ozanne Et Al 2022 Shall Ai Moderators Be Made Visible Perception of Accountability and Trust in Moderation Systems On
No ratings yet
Ozanne Et Al 2022 Shall Ai Moderators Be Made Visible Perception of Accountability and Trust in Moderation Systems On
13 pages
Elevating Digital Harmony
No ratings yet
Elevating Digital Harmony
2 pages
Online Abuse Detection
No ratings yet
Online Abuse Detection
8 pages
Monitering Suspicious Discussion On Online Forum
No ratings yet
Monitering Suspicious Discussion On Online Forum
6 pages
Molina
No ratings yet
Molina
12 pages
Fighting Hate Speech, Silencing Drag Queens? Artificial Intelligence in Content Moderation and Risks To LGBTQ Voices Online
No ratings yet
Fighting Hate Speech, Silencing Drag Queens? Artificial Intelligence in Content Moderation and Risks To LGBTQ Voices Online
33 pages
Re-Humanizing The Platform Content Moderators and The Logic of Care
No ratings yet
Re-Humanizing The Platform Content Moderators and The Logic of Care
17 pages
A Holistic Approach To Undesired Content Detection in The Real World
No ratings yet
A Holistic Approach To Undesired Content Detection in The Real World
15 pages
IJCRT2505107
No ratings yet
IJCRT2505107
8 pages
A System To Filter Unwanted Messages From Osn User Walls
0% (1)
A System To Filter Unwanted Messages From Osn User Walls
19 pages
Cscw2019 Discord
No ratings yet
Cscw2019 Discord
24 pages
Cyber Bullying Detection On Social Media Network
No ratings yet
Cyber Bullying Detection On Social Media Network
9 pages
Social Media Content Moderation Using AI Ijariie23024
No ratings yet
Social Media Content Moderation Using AI Ijariie23024
7 pages
ID017 - 2024-02-02 Massive Data Institute (Georgetown), Attachment 2, Reference Paper
No ratings yet
ID017 - 2024-02-02 Massive Data Institute (Georgetown), Attachment 2, Reference Paper
32 pages
It's Trying Too Hard To Look Real: Deepfake Moderation Mistakes and Identity-Based Bias
No ratings yet
It's Trying Too Hard To Look Real: Deepfake Moderation Mistakes and Identity-Based Bias
20 pages
Dasfaasg
No ratings yet
Dasfaasg
23 pages
Ebook Content Moderation Ai 2022
No ratings yet
Ebook Content Moderation Ai 2022
10 pages
Classifying Constructive Comments: University of British Columbia Jigsaw Simon Fraser University
No ratings yet
Classifying Constructive Comments: University of British Columbia Jigsaw Simon Fraser University
24 pages
2020 Trac-1 22
No ratings yet
2020 Trac-1 22
7 pages
ps1 Notes
No ratings yet
ps1 Notes
2 pages
The Civic Labor of Volunteer Moderators Online: J. Nathan Matias
No ratings yet
The Civic Labor of Volunteer Moderators Online: J. Nathan Matias
12 pages
Ethics of Using LLMs in Content Moderation On Twitter
No ratings yet
Ethics of Using LLMs in Content Moderation On Twitter
7 pages
AI Social Media Content Moderation - An Ethics Case Study
No ratings yet
AI Social Media Content Moderation - An Ethics Case Study
2 pages
Automatic Detection of Online Abuse Final
No ratings yet
Automatic Detection of Online Abuse Final
19 pages
Offensive Meme Detection
No ratings yet
Offensive Meme Detection
15 pages
Team Nocturnal - SIPC Assignment
No ratings yet
Team Nocturnal - SIPC Assignment
8 pages
Data Statements: From Technical Concept To Community Practice
No ratings yet
Data Statements: From Technical Concept To Community Practice
17 pages
AI's Role in Content Moderation
No ratings yet
AI's Role in Content Moderation
10 pages
Engineering Students' Seminar Report
No ratings yet
Engineering Students' Seminar Report
30 pages
A Qualitative Exploration of Perceptions of Algorithmic Fairness
No ratings yet
A Qualitative Exploration of Perceptions of Algorithmic Fairness
14 pages
Tech Seminar
No ratings yet
Tech Seminar
9 pages
Cybercrime Prevention On Social Media-1
No ratings yet
Cybercrime Prevention On Social Media-1
5 pages
REPORT
No ratings yet
REPORT
30 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
52 pages
Final Journal Paper Hate Speech-1 - Rathee Meenatchi
No ratings yet
Final Journal Paper Hate Speech-1 - Rathee Meenatchi
10 pages
Wikipedia PDF
No ratings yet
Wikipedia PDF
58 pages
Wikipedia: Main Page English Wikipedia List of Wikipedias Wikipedia (Disambiguation) Encyclopedia Free Encyclopedias
No ratings yet
Wikipedia: Main Page English Wikipedia List of Wikipedias Wikipedia (Disambiguation) Encyclopedia Free Encyclopedias
58 pages
Wikipedia
No ratings yet
Wikipedia
67 pages
Website Evaluation PowerPoint
No ratings yet
Website Evaluation PowerPoint
15 pages
Wikipedia Overview and Policies
No ratings yet
Wikipedia Overview and Policies
6 pages
Wikipedia - Wikipedia
No ratings yet
Wikipedia - Wikipedia
51 pages
Wikipedia PDF
No ratings yet
Wikipedia PDF
67 pages
Wikipedia Is Not A Reliable Source
No ratings yet
Wikipedia Is Not A Reliable Source
2 pages
The Technical Side of Wikipedia
No ratings yet
The Technical Side of Wikipedia
5 pages
Wikipedia
No ratings yet
Wikipedia
47 pages
Wiki Squiber
No ratings yet
Wiki Squiber
4 pages
An Article On Wikipedia by Wikipedia
No ratings yet
An Article On Wikipedia by Wikipedia
55 pages
O Amazon Prime o No Sé Qué Cosa Que Me
No ratings yet
O Amazon Prime o No Sé Qué Cosa Que Me
6 pages
Wiki - Wikipedia
No ratings yet
Wiki - Wikipedia
15 pages
Wiki 1
No ratings yet
Wiki 1
16 pages
Wikipedia - Wikipedia PDF
No ratings yet
Wikipedia - Wikipedia PDF
64 pages
Wikipedia: Revision History: Filter Revisions
No ratings yet
Wikipedia: Revision History: Filter Revisions
3 pages
Wikipedia PP
No ratings yet
Wikipedia PP
13 pages
Understanding Wiki: Collaborative Online Publishing
No ratings yet
Understanding Wiki: Collaborative Online Publishing
20 pages

Proactive Moderation

Uploaded by

Proactive Moderation

Uploaded by

Proactive Moderation of Online Discussions: Existing

Practices and the Potential for Algorithmic Support

2 BACKGROUND: THE LANDSCAPE OF MODERATION

Why did you revert my edit?

They were just following the Proactive moderation

No, sorry, but this is an

2.1 Practices of Moderation

2.2 Reactive Moderation of Online Discussions

2.3 Pre-screening Moderation of Online Discussion

2.4 Proactive Moderation of Online Discussion

2.5 Algorithmic Tools for Moderation

3 CASE STUDY: WIKIPEDIA TALK PAGES

4.2 Prototype Tool for Assisting Proactive Moderation

Bernie_Sanders, Coronavirus_disease_2019, COVID-19_pandemic, Donald_Trump, Joe_Biden, Kim_Jong-un, and

Discussion moderation is crucial to maintaining these goals: antisocial behavior in discussions

5.2 Proactive Moderation Practices

5.3 Evaluating the Feasibility of Algorithmic Assistance

Topic 1: Current Discussion Moderation Practices

Topic 2: Potential Use of Conversational Forecasting

Topic 3: Analyzing a Mockup Conversation

Topic 4: Analyzing a Mockup Ranking

Received July 2021; revised November 2021; accepted April 2022

You might also like