Hostname: page-component-6bf8c574d5-9f2xs Total loading time: 0 Render date: 2025-03-11T17:30:12.982Z Has data issue: false hasContentIssue false

“Let Me Just Interrupt You”: Estimating Gender Effects in Supreme Court Oral Arguments

Published online by Cambridge University Press:  10 March 2025

Erica Cai
Affiliation:
Ph.D. Student, University of Massachusetts Amherst, Amherst, Massachusetts, USA
Ankita Gupta
Affiliation:
Ph.D. Student, University of Massachusetts Amherst, Amherst, Massachusetts, USA
Katherine A. Keith*
Affiliation:
Assistant Professor, Williams College, Williamstown, Massachusetts, USA
Brendan O’Connor
Affiliation:
Associate Professor, University of Massachusetts Amherst, Amherst, Massachusetts, USA
Douglas Rice*
Affiliation:
Associate Professor, University of Massachusetts Amherst, Amherst, Massachusetts, USA
*
Corresponding authors: Katherine A. Keith and Douglas Rice; Emails: [email protected]; [email protected]
Corresponding authors: Katherine A. Keith and Douglas Rice; Emails: [email protected]; [email protected]
Rights & Permissions [Opens in a new window]

Abstract

Oral argument is the most public and visible part of the U.S. Supreme Court’s decision-making process. Yet what if some advocates are treated differently before the Court solely because of aspects of their identity? In this work, we leverage a causal inference framework to quantify the effect of an advocate’s gender on interruptions of advocates at both the Court-level and the justice-level. Examining nearly four decades of U.S. Supreme Court oral argument transcript data, we identify a clear and consistent gender effect that dwarfs other influences on justice interruption behavior, with female advocates interrupted more frequently than male advocates.

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of the Law and Courts Organized Section of the American Political Science Association

Introduction

Oral argument is the most visible part of the U.S. Supreme Court’s decision-making process. Advocates on conflicting sides of the nation’s most important controversies press their arguments, while justices interject questions and comments. After, the Court meets in a private conference where the justices offer their initial votes and begin the process of drafting opinions explaining those votes. A rich literature documents how these arguments inform the decisions of the Court, with oral argument influencing the information justices have (e.g., Johnson Reference Johnson2001, Reference Johnson2004), the issues discussed in decisions (e.g., Black, Johnson, and Wedeking Reference Black, Johnson and Wedeking2012), and the justice votes (e.g., Johnson, Wahlbeck, and Spriggs Reference Johnson, Wahlbeck and Spriggs2006).

Yet prior research raises an important question: What if some advocates are treated differently by the justices solely because of aspects of their identity unrelated to the case? Consider the Supreme Court’s oral argument in United States v. Texas in November of 2021. The case dealt with a Texas state law that effectively banned abortion after six weeks. Arguing on behalf of the United States, Solicitor General Elizabeth Prelogar – a woman – was interrupted regularly by the justices, with one particularly poignant example occurring in this exchange with Associate Justice Samuel Alito:

GENERAL PRELOGAR: While I certainly acknowledge, Justice Alito, that an injunction that would bind state court judges is extremely rare, it’s not unheard of, and I think, in the unprecedented facts of this case, it’s appropriate relief. And –

JUSTICE ALITO: Well, judges have been enjoined –

GENERAL PRELOGAR: – and the reason for that is –

JUSTICE ALITO: – let me just interrupt you – judges have been enjoined from performing unlawful acts.

Interruptions of this sort are not unique, with recent work demonstrating that female advocates and justices are, in the aggregate, interrupted more frequently and given less time to speak than their male colleagues (see, for example, Jacobi and Schweers Reference Jacobi and Schweers2017). This aggregate pattern of gendered interruptions relates to an emerging consensus on the influence of gender on judicial behavior more generally that covers the gamut from oral argument (e.g., Jacobi and Schweers Reference Jacobi and Schweers2017; Patton and Smith Reference Patton and Smith2017) to deliberation (e.g., Boyd, Epstein, and Martin Reference Boyd, Epstein and Martin2010) and through to voting (e.g., Collins and Moyer Reference Collins and Moyer2008; Boyd, Epstein, and Martin Reference Boyd, Epstein and Martin2010; Szmer, Sarver, and Kaheny Reference Szmer, Sarver and Kaheny2010; Szmer et al. Reference Szmer, Kaheny, Sarver and DeCamillis2013; Gleason and Smart Reference Gleason and Smart2022). Clearly, women being interrupted more frequently has stark consequences for their participation in political and legal spaces. Interruptions are a way for someone to assert dominance over a conversation (Zimmermann and West Reference Zimmermann and West1996), and that the justices might more frequently assert their dominance over female advocates than male advocates detracts from the ability of women to fully participate at the Court.

But establishing that women are being interrupted more frequently because they are women is a difficult causal problem. In the case of Supreme Court oral argument, for instance, women may systematically represent different ideological perspectives where the probability of being interrupted is greater. Likewise, as Solicitor General Prelogar’s affirmative action quote demonstrates, given systemic disadvantages, the average woman who pursues and gains the role of advocate before the Supreme Court is likely to be systematically different than the average man who does the same, and their relationship to the Court’s underlying ideological divisions may reflect this. In both cases, differences in interruption rates are not directly attributable to gender, though they are indirectly. By way of example, consider gender differences in judicial appointments, the pinnacle achievement for those in the legal profession, by the party of recent presidents. During his four years in office, approximately 24% of federal judges appointed by then-President Donald Trump were women, compared to 42% of appointments during the tenure of then-President Barack Obama (Gramlich Reference Gramlich2021); as of the end of July 2022, fully 77% of the appointments by then-President Joe Biden had been women. These appointment dynamics mean the ideology of the politician in power influences judicial career prospects differently by gender and may analogously influence the distribution of observed ideologies of justices and advocates, generating difficult causal entanglements that render standard modeling approaches dangerous for causal interpretations.

We untangle the difficult causal question of whether female advocates are interrupted more often at oral argument because they are women. To do so, this paper makes three important contributions to the existing literature on gender and oral argument. First, we provide a justice-level analysis, estimating individual justice interruption rates to isolate behaviors and disentangle a single justice’s interruption propensity from others. Second, it introduces a research design tailored for the complex nature of conversations like Supreme Court oral arguments, employing a new Markov assumption for conversational chunks and normalizing interruption rates by the number of tokens spoken by advocates, thus allowing for direct gender-based comparisons. Third, the study compares multiple causal pathways of gender bias with other potential causal pathways such as ideological alignment and style, using precise definitions and non-parametric estimation to ensure clarity and interpretability of assumptions and strategies.

Our careful causal research design is employed with data from over 3,000 oral argument sessions featuring over 770,000 advocate statements (or “utterances”) by more than 4,000 unique advocates. The analysis demonstrates the bias that female advocates face before the Court. Throughout, we find consistent evidence most justices interrupt female advocates more often than male advocates. Critically, we find that the effect of gender on justice interruption behavior far outweighs that of other hypothetical influences on interruption behavior. Even ideological factors pale in comparison to the gender effect, with the gender effect anywhere from three to five times greater than the effect of ideological disagreement, or arguments where justices are likely – based on their ideological predispositions and the argument being made by the advocate – to disagree with the advocate. The effect is identifiable among most justices on the Court, but particularly prominent among the moderate and conservative justices. A series of corroborative analyses further establish the primacy of gender as the primary factor driving interruption behavior among most justices. Moreover, in establishing this effect, our causal framework permits us to state all our assumptions explicitly rather than let them be left up to interpretation (Lundberg, Johnson, and Stewart Reference Lundberg, Murray and Stewart2021), and provides a more careful and robust approach for this critical area of legal and political inquiry.

Gender Bias and Oral Argument

Women face significant barriers to participation in deliberative settings and the legal profession. Women speak less (Karpowitz and Mendelberg Reference Karpowitz and Mendelberg2014; Mendelberg and Karpowitz Reference Mendelberg and Karpowitz2016) and must navigate different linguistic expectations compared to men (e.g., Hancock and Rubin Reference Hancock and Rubin2015; Gleason and Smart Reference Gleason and Smart2022). In Congress, women get fewer leadership opportunities (Kanthak and Krause Reference Kanthak and Krause2012; Barnes Reference Barnes2016) and are interrupted more often (Miller and Sutherland Reference Miller and Sutherland2022). Despite increasing law school admissions, the vast majority of prestigious legal positions remain male-dominated (St. Eve and Luguri Reference St. Eve and Luguri2021). At the Supreme Court, the disparity is stark, with only 18% of attorneys arguing before the Court in the 2020-2021 term being women (St. Eve and Luguri Reference St. Eve and Luguri2021).

During the 2022 oral arguments in two cases addressing affirmative action in college admissions, Solicitor General Prelogar highlighted the extreme gender disparities among advocates arguing before the Court, emphasizing the broader implications for women’s career prospects in law. She remarked that out of 27 advocates the Court would hear from that term, only two were women, despite women comprising over half of law school graduates.

Beyond this disparity in representation, what of disparities in treatment by gender once advocates are participating at the Court? We focus on one area, oral argument, where even the female justices on the Court – equals by design of their male colleagues – are interrupted more frequently than the male justices (e.g., Jacobi and Schweers Reference Jacobi and Schweers2017). The advocate, with limited time to present, can be freely interrupted during that time by justices who may ask questions, respond to particular arguments, or simply speak. Importantly, these arguments are opportunities for the justices to gain information (e.g., Johnson Reference Johnson2004), elucidate issues, and possibly change another justice’s vote (e.g., Johnson Reference Johnson2004; Black, Johnson, and Wedeking Reference Black, Johnson and Wedeking2012; Ringsmuth, Bryan, and Johnson Reference Ringsmuth, Bryan and Johnson2013). The arguments are also an increasingly public presentation of the Court, with live audio broadcast as well as recorded audio and transcripts released shortly afterwards. In all, oral argument is a critical, public-facing component of the work of a justice on the Court.

What, therefore, influences justice behavior at oral argument? As elsewhere (e.g., Zorn and Bowie Reference Zorn and Bowie2010), scholars have demonstrated the influence of a justices’ ideological preferences on their behavior during oral arguments. Indeed, recent work suggests a growing influence of ideology on behavior at oral argument (Jacobi and Sag Reference Jacobi and Sag2019). Liberal justices are more likely to interrupt conservative arguments, and conservative justices are more likely to interrupt liberal arguments (Johnson, Wahlbeck, and Spriggs Reference Johnson, Wahlbeck and Spriggs2006; Jacobi and Schweers Reference Jacobi and Schweers2017). Notably, justices are particularly likely to pose questions to legal representatives aligned with ideologies opposite their own; for example, the number of questions asked to each side has emerged as a consistent predictor of the likely winner of a case in the past, as those sides asked more questions are viewed as the likely loser (Roberts Reference Roberts2005). Taken together, the evidence demonstrates that a justice is more likely to interrupt, more likely to speak to, and more likely to ask questions of an attorney arguing against the justice’s perceived ideological inclinations.

Perhaps unsurprisingly then, the justice’s behavior at oral argument is predictive of how the justice votes in the case (see, for example, Jacobi and Rozema Reference Jacobi and Rozema2018; Dietrich, Enos, and Sen Reference Dietrich, Enos and Sen2019a). Both the number of questions (Roberts Reference Roberts2005) and the tone of the questions (Black et al. Reference Black, Treul, Johnson and Goldman2011; Dietrich, Enos, and Sen Reference Dietrich, Enos and Sen2019a) are predictive of how justices ultimately vote. For advocates, the time is therefore invaluable in structuring the discussion of the case, setting the table for a possible successful outcome, and making their pitch to the broader public (Sullivan and Canty Reference Sullivan and Canty2015; Jacobi and Schweers Reference Jacobi and Schweers2017).

Interruptions at Oral Argument

The growing importance of oral argument has been accompanied by attention to the conversational dynamics at play, and particularly the extent to which the conversations are marked by interruptions. The earlier exchange between Justice Alito and Solicitor General Prelogar is one example. Interruptions can be broken down into two different types: justices interrupting the advocates and justices interrupting other justices. In the latter case, the interruptions are often deemed inappropriate; the justices are equals, and thus none should interrupt another. Responding to concerns that the female justices were not getting the opportunity to speak, the Court recently instituted new rules around oral argument, with each justice permitted to take a turn to ask questions uninterrupted after an attorney’s allotted time has expired.Footnote 1

In the former case, interruptions of advocates are in the design of oral argument. Indeed, the Court’s Guide for Counsel states, “If you are speaking and a Justice interrupts you, cease talking immediately and listen” (9). Interruptions stop the development of an argument so that a justice may move the argument in a different direction. Importantly, justice interruptions occur more frequently today than in the past (Sullivan and Canty Reference Sullivan and Canty2015). Yet while the Court sought to actively deal with interruptions of justices, for interruptions of attorneys, the Court’s response has been more limited, with the Court only instituting a new norm around trying to allow attorneys to speak uninterrupted for the first two minutes.Footnote 2

The new norm of brief uninterrupted time recognizes the challenge that interruptions introduce for advocates attempting to develop their argument. Accordingly, scholars have likewise turned their attention to conversational dynamics at oral argument. In Table 1, we summarize prior research on this, and specifically work examining gender-related bias and U.S. Supreme Court oral arguments. As the table makes clear, scholars have identified many different manifestations of bias, generally falling into three categories: speaking time, interruptions, and case outcomes. Overwhelmingly, scholars have measured each at the level of the case (or attorney), and have then estimated some form of parametric model. Importantly, only Patton and Smith (Reference Patton and Smith2020), in their analysis of justice speaking time, examine differences at the level of individual justices. Our analysis therefore diverges from this prior work by shifting the focus to the individual justice and their interactions with litigants, better specifying the causal pathways and relationships, improving the modeling and measurement of conversational dynamics through modeling of a justice-level token-normalized interruption rate, and limiting parametric assumptions. Of course, doing so requires several important empirical advances, and we outline these at length below.

Table 1. Summary of Prior Work Examining Interruption Behavior During U.S. Supreme Court Oral Argument. We compare this to the contribution of this work in the final line in which Justices are interrupting Advocates (Justice $ \to $ Advocate).

Before doing so, though, we note that we root our analysis in this rich prior work and build our expectation that two factors – beyond gender – might influence interruption behavior: whether a justice agrees or disagrees with the argument being made (or ideological alignment with the advocate) and the stylistic quality of the argument. Unfortunately, both factors are potentially intertwined with advocate gender. In terms of style, the Court publicly lays out guidance. The Guide for Counsel notes advocates should “speak in a clear, distinct manner” and “[b]e careful to use precise language” (Guide for Counsel, 10). Further, justices have beliefs about the quality of oral argument; former Justice Blackmun even graded the quality of an advocate’s oral argument (Johnson, Wahlbeck, and Spriggs Reference Johnson, Wahlbeck and Spriggs2006). Thus, justices may be more likely to interrupt when the stylistic quality of the argument is low. On this front, there are two ways we might expect female counsel to be different than male. First, given the barriers we note above, women may, in general, have higher stylistic quality than men arguing before the Court. By way of comparison, prior work on Congress has noted that women work harder and better represent their constituents (Lazarus and Steigerwalt Reference Lazarus and Steigerwalt2018). Considered in the context of a legal profession where they are consistently excluded from high-prestige positions, women participating in oral argument are likely to make arguments of exceptionally high quality. That being the case, quality differences may mask underlying gender differences.

Turning to the entwining of ideology and gender, the issue is nicely illustrated by Jacobi and Schweers (Reference Jacobi and Schweers2017), who note in their analysis of justice interruptions of other justices that “ideology and gender seem to work together in an interactive effect” (1487). Disentangling these is therefore critical to identifying the gender effect, as “results confirm that there is a clear ideological direction to interruptions on the Supreme Court” (Jacobi and Schweers Reference Jacobi and Schweers2017, 1379). The issue is further complicated because there is a substantial gender effect on the type of cases that women lawyers represent at the appellate level (St. Eve and Luguri Reference St. Eve and Luguri2021). As an analysis of the Seventh Circuit (St. Eve and Luguri Reference St. Eve and Luguri2021, 21) revealed,

Women argued more often in criminal cases and other cases that involved the government – like immigration and habeas cases – and at lower rates in civil cases. And among civil cases, the gender gap was particularly pronounced in cases that may be perceived as complex due to the structure of the case (class actions), its content (e.g., antitrust), or money at stake (as indicated through the involvement of a major law firm). Women were more likely to argue cases that involved the government, especially federal and local governments. An attorney representing the government was twice as likely to be a woman (compared to attorneys who represented non-governmental entities).

The gender disparity in representation means that women are more likely to represent particular types of arguments at the Supreme Court, arguments that might make them systematically more likely to be ideological allies (or opponents) of the justices.

In all, prior work is constrained by the struggle to disentangle the influence of gender on interruptions from other factors. The challenge we confront echoes many others encountered by scholars studying hypothetical influences on judicial behavior, as justice’s ideological preferences are entwined with other justice- and case-level characteristics. As a result, we have preliminary evidence from prior work that gender matters, but little understanding of how and how much it matters. To address this gap, we introduce a causal framework that lays out how gender influences interruption behavior at the Court. Our approach demonstrates that the magnitude of the effect of gender on interruption behavior is well beyond the magnitude of the effect of ideology (and style) on interruption. We turn now to describing our approach.

Research Design

Our goal of quantifying the effect of advocate gender in oral argument is inherently a causal goal. As Lundberg, Johnson, and Stewart Reference Lundberg, Murray and Stewart2021 explain, in the social sciences, the “dominant mode of quantitative inquiry” has been “hypotheses about regression coefficients.” However, the focus on regression coefficients limits a researcher to a particular class of models and parametric assumptions. In contrast, stating an explicit theoretical causal estimand allows researchers to thoroughly investigate what the aim of the statistical analysis is and make explicit the assumptions behind the analysis.

In our setting, we ideally want to estimate a counterfactual: if everything else in an oral argument remained the same, but we swapped a female advocate for a male advocate, would judges behave differently? In causal terminology, we are interested in the average treatment effect of advocate gender on a justice interrupting that advocate. Following causal frameworks for quantitative social science (Morgan and Winship Reference Morgan and Winship2015; Lundberg, Murray, and Stewart Reference Lundberg, Murray and Stewart2021), we discuss the decisions behind our research design – from the idealized causal experiment to causal estimands that can be estimated from data.

Unit of Analysis. In order to precisely state a theoretical causal estimand – for example, the average treatment effect of gender on interruption – we first need to define our unit of analysis, the quantity that is our idealized counterfactual and over which we aggregate.

Definition of a valid chunk. Oral arguments are composed of utterances – a continuous piece of speech from a single speaker – made by both advocates and justices. We are interested in aggregating them to short multi-utterance chunks of oral arguments, each representing an exchange between justice and advocate, to investigate our causal question about justices interrupting advocates. We define a valid chunk as four or more contiguous utterances in which there are exactly two speakers – a single justice and a single advocate – where the advocate makes the first utterance, and each speaker has two or more utterances.Footnote 3 Valid chunks are extracted with a greedy algorithm, iterating through utterances in the transcript, adding each utterance to the current chunk as long as they are from the same advocate–justice dyad, otherwise starting a new candidate chunk. We require a chunk to begin with the advocate so there is an opportunity to be interrupted by the justice and to have a minimum of 20 tokens total across all advocate utterances (for definition of $ Y $ below). The earlier exchange between General Prelogar and Justice Alito is a valid chunk.

Formally, we define our unit of analysis for a chunk with Chunk ID $ i $ , having Justice ID $ {j}_i $ , and Advocate ID $ {a}_i $ . For convenience, we often refer to chunk $ i $ with notation $ i\mid j $ to also indicate its justice $ j $ , since our primary analysis is of justice’s interruption behavior within chunks. Although we report aggregate measures, most of our analysis focuses on individual justices since we know each justice has their own background, experience, and implicit or explicit biases. Because our unit of analysis consists of a single advocate, we can condition on specific attributes of the advocate, including their gender and ideology.

Operationalization of variables. One contribution of this paper is precise, interpretable, and careful operationalization of the following variables: gender signal of an advocate ( $ T $ ), ideological alignment of an advocate and justice ( $ A $ ), and token-normalized interruption rates (Y).

Gender signal of an advocate ( $ T $ ). Thus far, we have been relatively loose in our description of the “gender” of an advocate and what it means to define a causal quantity of gender. We formally conceptualize “gender” as the binaryFootnote 4 gender signal given by an advocate’s outward appearance. As previous researchers have discussed at length, we should be wary of estimating the causal effects of “immutable characteristics” such as gender or race (Berk, Li, and Hickman Reference Berk, Li and Hickman2005; Holland Reference Holland2008; Sen and Wasow Reference Sen and Wasow2016). However, we follow previous work that argues that gender signal is a variable we could hypothetically manipulate. For example, we could hire an actor to say the exact same script but flip the gender presentation of the actor. We operationalize the gender signal of advocates through a two-stage process, detailed in the Supplementary material.

Ideological alignment of advocate and justice ( $ A $ ). To capture the ideological alignment of the justice and the advocate, we require measures of advocates’ and justices’ ideological preferences. For the justice, we measure ideology using the average (arithmetic mean) of the justice’s time-varying Martin-Quinn score (Martin and Quinn Reference Martin and Quinn2002), and treat any values less than zero as a liberal justice and values greater than zero as a conservative justice. For the advocate, we rely on the Supreme Court Database’s (Spaeth et al. Reference Spaeth, Epstein, Martin, Segal, Ruger and Benesh2021) coding of the ideological direction of the decision of the Court and whether or not the advocate won; if the advocate won and the direction of the decision is conservative, we indicate the advocate was making a conservative argument to the Court and vice versa for liberal arguments. With a measure of justice ideological proclivities and a measure of the ideological direction of the attorney’s argument, we create our ideological alignment variable for each chunk $ i $ :

(1) $$ {A}_i=\{\begin{array}{ll}1& \mathrm{i}\mathrm{f}\ \mathrm{a}\mathrm{d}\mathrm{v}\mathrm{o}\mathrm{c}\mathrm{a}\mathrm{t}\mathrm{e}\ {a}_i\ \mathrm{a}\mathrm{n}\mathrm{d}\ \mathrm{j}\mathrm{u}\mathrm{s}\mathrm{t}\mathrm{i}\mathrm{c}\mathrm{e}\ {j}_i{}^{\prime}\mathrm{s}\ \mathrm{i}\mathrm{d}\mathrm{e}\mathrm{o}\mathrm{l}\mathrm{o}\mathrm{g}\mathrm{i}\mathrm{c}\mathrm{a}\mathrm{l}\ \mathrm{p}\mathrm{r}\mathrm{e}\mathrm{f}\mathrm{e}\mathrm{r}\mathrm{e}\mathrm{n}\mathrm{c}\mathrm{e}\mathrm{s}\ \mathrm{m}\mathrm{a}\mathrm{t}\mathrm{c}\mathrm{h}\\ {}0& \mathrm{o}\mathrm{t}\mathrm{h}\mathrm{e}\mathrm{r}\mathrm{w}\mathrm{i}\mathrm{s}\mathrm{e}\end{array}\operatorname{} $$

We define ideological alignment for an advocate and justice in the context of a specific case;an advantage of this approach is that attorneys – many of whom appear multiple times before the Court – can be ideologically aligned with a justice in one case but not in another case, as the measure is argument-based.

Token-normalized interruption rates ( $ Y $ ). In creating the outcome variable, $ Y $ , we aim to account for variation in the speaking rates of various advocates. Thus, for a single valid chunk $ i $ , we define the token-normalized interruption rate as

(2) $$ {Y}_{i\mid j}=\frac{\mathrm{number}\ \mathrm{of}\ \mathrm{advocate}\ \mathrm{utterances}\ \mathrm{in}\mathrm{terrupted}\hskip0.32em \mathrm{by}\hskip0.32em \mathrm{justice}\hskip0.32em j\hskip0.32em \mathrm{in}\ \mathrm{chunk}\hskip0.32em i}{\left(\mathrm{number}\ \mathrm{of}\ \mathrm{advocate}\ \mathrm{tokens}\ \mathrm{in}\ \mathrm{chunk}\;i\right)/1000} $$

Intuitively, this represents the following: if an advocate is trying to say 1,000 words during arguments, how many interruptions from a justice would the advocate endure (on average) by the time they got to 1,000 words?

In our data, interruptions are identified deterministically by two different markers in the transcript: double dashes and two dots. Note, these transcripts are manually created and follow standardized protocols for recording interruptions, a fact we leverage to automatically identify all interruptions. Later, we look at the robustness of the transcription of interruptions over the transcript years in Figure 1.

Figure 1. Number of Chunks and Cases per Supreme Court Term. Plots indicate the number of valid chunks (bars) and the number of cases (line) per October Term.

Unit specific quantity. Following Lundberg, Murray, and Stewart (Reference Lundberg, Murray and Stewart2021), we define a counterfactual quantity of interest for a single unit: for a specific justice and chunk, the difference in interruption rates if the advocate was female or male. We use potential outcomes notation (Rubin Reference Rubin1974) in which $ Y\left(T=0\right) $ and $ Y\left(T=1\right) $ represent the potential (counterfactual) outcomes given that the unit is treated, $ T=1 $ , and not treated, $ T=0 $ , respectively. Formally, for a given justice, $ j $ , and chunk, $ i $ , we are interested in the the following unit-specific quantity of interruption rate given counterfactual genders:

(3) $$ {Y}_{i\mid j}\left({T}_i=F\right)-{Y}_{i\mid j}\left({T}_i=M\right) $$

Target population. Lundberg, Murray, and Stewart (Reference Lundberg, Murray and Stewart2021) emphasize that a careful research design must also state a target population, “over whom or what do we aggregate [the] unit-specific quantity?” For the remainder of our analysis, we only include justices who participate in more than 1,000 valid chunks in order to ensure we have enough data to reliably support any research conclusions.Footnote 5 Given this set of valid justices, our target population is all advocates a justice has or would encounter.

Theoretical causal estimand. For a fixed justice, how much would intervening to change the gender signal of the advocate change the rate the justice interrupted the advocate? The target estimand, the average treatment effect for justice $ j $ , is the average of this unit-specific quantity over all advocates and chunks.

(4) $$ {\tau}_{\mathrm{Gender}}^j:= E\left({Y}_{i\mid j}\left({T}_i=F\right)-{Y}_{i\mid j}\left({T}_i=M\right)\right) $$

Causal identification assumptions. In order to estimate causal effects from data, one needs to state the causal identification assumptions that will link estimates from data to the theoretical causal estimand. We state these assumptions explicitly, as many of these assumptions cannot be measured empirically.

Markov assumption for conversational chunks. We make the assumption that the conversational dynamics between a justice and an advocate in one chunk do not influence the conversational dynamics in a subsequent chunk. This is a Markov independence assumption over conversational chunks. We acknowledge this is a strong assumption; however, if we do not make this assumption and instead assume previous interactions influence the current one, we need to account for all the text and interruptions of the previous chunks as potential confounders. In this scenario, we very quickly run into causal positivity violations, $ 0<\mathit{\Pr}\left(T=1|X=x\right)<1\forall x $ , where $ X $ is a variable representing previous conversational dynamics. This would make the causal effects not identified, and we would be unable to reach any conclusions. Thus, we opt for our Markov assumption.

No unmeasured confounding. We assume there are no unmeasured confounding variables that cause both treatment – gender signal – and outcome – interruption rates. See Table A1 of the Supplementary material for one measured confounder, the case topic. See our section on corroborative analyses for evidence results remain the same despite possible threats to this assumption such as speech disfluencies, advocate experience, and ideological alignment as causal mediators between gender signal and interruption.

Empirical estimands. Given the aforementioned causal identification assumptions, we translate the theoretical estimand – Equation 4 – to an empirical estimand that can be estimated from data. We use $ \tau $ to refer to theoretical estimands and $ \theta $ for their corresponding empirical estimands. In these empirical formulations, $ {T}_i $ refers to the observed advocate gender in chunk $ i $ , heuristically inferred from the text as described previously. The empirical estimate of average treatment effect for a given justice is simply the difference in mean interruption rates between female and male advocates with whom the justice interacts during valid chunks,

(5) $$ {\theta}_{\mathrm{Gender}}^j:= \left(\frac{1}{n_{j,{T}_i=F}}\hskip0.5em \sum \limits_{i:{j}_i=j,{\mathbf{T}}_{\mathbf{i}}=\mathbf{F}}{Y}_{i\mid j}\right)-\left(\frac{1}{n_{j,{T}_i=M}}\hskip0.5em \sum \limits_{i:{j}_i=j,{\mathbf{T}}_{\mathbf{i}}=\mathbf{M}}\hskip0.5em {Y}_{i\mid j}\right), $$

where, for example, $ {n}_{j,{T}_i=F} $ indicates the number of chunks for justice $ j $ where advocates’ gender is female.

We can compare this to an analogous empirical quantity about difference due to ideological alignment:

(6)

Then for each justice, we can compare the magnitudes of $ {\theta}_{\mathrm{Gender}}^j $ and $ {\theta}_{\mathrm{Ideological}\ \mathrm{Alignment}}^j $ .

Data

We acquired the Supreme Court Oral Arguments Corpus (Danescu-Niculescu-Mizil et al. Reference Danescu-Niculescu-Mizil, Lee, Pang and Kleinberg2012) from the Cornell Conversational Analysis Toolkit (ConvoKit) (Chang et al. Reference Chang, Chiam, Fu, Wang, Zhang and Danescu-Niculescu-Mizil2020). The corpus covers more than 8,000 oral argument transcripts from more than 7,700 cases decided between 1955 and 2019. Due to limitations related to the reliability and consistency of the transcription, particularly with respect to the identification of the speaking justices, we limit our analysis to 1982 through 2019. The corpus is organized at the level of an utterance. In all, we have 776,193 utterances across 3,424 cases. Of the 4,025 unique advocates, we identify 551 as female, or approximately 14% of all unique advocates. Overall, approximately 25% of all advocate utterances are interrupted, reflecting the extent to which the dialogue is driven by justices.

We segment the oral argument transcript utterances into valid chunks as described in the Unit of Analysis section above. In the bottom half of Table 2, we provide summary statistics on these chunks. We identify 65,768 valid chunks, accounting for more than 23 million tokens spoken across more than 520,000 utterances.

Table 2. Descriptive Statistics of the Corpus and Dataset

As evidence that the chunking approach offers a valid representation of oral argument, in Figure 1 we plot the number of valid chunks we identify each Court term (the bars) alongside the number of cases decided by the Court during that term (the line).Footnote 6 The number of chunks tracks closely with the number of cases over time. Our chunking approach reflects the drop in the number of cases heard by the Court during an average term over time and provides broad and consistent coverage of oral argument during the entire time period under study.

Notably, the rate at which women participate in these exchanges has changed significantly over time. In Figure 2, we plot the number of valid chunks featuring a woman either as an advocate (left panel) or as a justice (right panel). The participation of women as advocates increased marginally from the 1980s until about the turn of the century, but has generally leveled out to an average of around 12% of arguments more recently.

Figure 2. Participation of Women in Oral Argument Exchanges on the U.S. Supreme Court. These plots provide the proportion of exchange chunks featuring a female advocate (left panel) or female justice (right panel) by October Term.

On the other hand, the proportion of chunks featuring a female justice has shifted consistently, and unsurprisingly, with changes in the membership of the Court. The right panel of Figure 2 makes plain the shifts associated with the addition or subtraction of female justices. Indeed, the shifts are approximately proportional to the gender composition of the Court itself during each October Term, reflecting the extent to which the chunking approach validly captures the dynamics of oral argument on the Court.

Measuring Interruption Behavior

With the data in hand, we turn to measuring interruptions. We analyze the token-normalized interruption rate, where interruption is defined as the justice interrupting an advocate. By way of interpretation, a rate of 4.0 would indicate that for every 1,000 tokens of speech for the advocate, we would expect them to be interrupted four times.

In Figure 3, we plot over time the Court-level average of the token-normalized interruption rate (blue) and also, for comparison purposes, plot the token-normalized rate of justices being interrupted by advocates (red). Two dynamics bear mentioning and again speak to the validity of our measurement approach. First, there is a marked increase in the token-normalized interruption rate of advocates in the mid-1990s. This dynamic is consistent with Jacobi and Sag (Reference Jacobi and Sag2019) who identified this as the point justice behavior began to reflect the polarization of the political system. Second, the increase persists from that point forward. Indeed, the lowest rate during the 25-year span after the 1994 term is 9.9 in OT2001, an interruption rate greater than any of the preceding years. As the Court’s bar expanded to include marginally more female advocates, the rate of interruptions likewise increased. Of course, this also coincided with changes in the ideological composition of the Court as well as the gender composition of the Court, both factors that may contribute in different ways to the gendered differences in observed interruption behavior. In all, this further underscores the necessity of a careful causal research design to understand and disentangle the complex causal story.

Figure 3. Interruption Rate Over Time. This plot presents the Court-level average of token-normalized interruption rate (y-axis) by October Term (x-axis). The blue line is the advocate being interrupted by justices and the red line is justices being interrupted by advocates.

The Causal Effect of Gender on Interruptions

We begin with an analysis of the rates that justices interrupt male and female advocates differently.Footnote 7 In Table 3, we present the interruption effect of gender (column 2) and, for comparison, the interruption effect of ideological alignment (column 3). The rows calculate the interruption effects across different levels of justice aggregates: tabulatedFootnote 8 across all justices, across only male justices, and across only female justices. Recall, $ {\theta}_{\mathrm{Gender}} $ is equal to $ E\left[Y|\mathrm{Gender}=\mathrm{Female}\right]-E\left[Y|\mathrm{Gender}=\mathrm{Male}\right] $ , such that positive values indicate higher interruption rates for female advocates, and negative values indicate higher interruption rates for male advocates.

Table 3. Effects on advocate interruption rate, aggregated by justice gender. Positive $ {\theta}_{\mathrm{Gender}} $ indicates justices interrupt female advocates more often than they interrupt male advocates. Negative $ {\theta}_{\mathrm{Gender}} $ indicates justices interrupt ideologically opposed advocates more often than they interrupt ideologically aligned advocates. Here ± indicates the 95% confidence interval via bootstrap sampling of valid chunks. The last column shows the absolute ratio of point estimates to aid interpretation.

Across all justices, female advocates are interrupted more frequently, with a point estimate of 0.89 indicating they are interrupted on average nearly one additional time per 1,000 words in comparison to male advocates. The dynamic is more striking when the results are disaggregated by the gender of the justice; male justices interrupt female advocates more than one additional time per 1,000 tokens. In comparison, though female justices are also more likely to interrupt female advocates than male advocates, the effect size (0.43) is far smaller, well less than half the magnitude of the effect among male justices.Footnote 9

The substantive magnitude of the effect is made clear by a comparison to the effect of ideological alignment on interruption behavior. Across all justices, and particularly for male justices, the gender effect on interruption behavior dwarfs the effect of ideological alignment. Among all justices, the effect of gender is approximately 3.5 times greater than the effect of ideological alignment. Among just the male justices, though, the effect is well more than five times greater than the effect of ideological alignment. In contrast, the effect among female justices is nearly identical to the effect of ideological alignment. In all, we have clear evidence that – in the aggregate – advocate gender influences the interruption behavior of Supreme Court justices at oral argument, and the effect is largely concentrated among male justices.

Justice-Level Analysis

As opposed to analyzing aggregate rates of interruptions, justice-level analysis not only offers a more nuanced understanding of the intricate interpersonal dynamics at play during oral arguments but also offers a far superior approach to causal estimation. By shifting our focus to the individual justices, we gain a sharper lens through which to identify differences in individual behavior that collectively contribute to the observed aggregate results discussed above. In doing so, our individual-level approach acknowledges that aggregate rates are not homogenous entities but rather composite outcomes shaped by the diverse styles, preferences, and biases of each justice. An individual-level approach disentangles these multifaceted influences, empowering us to better understand all of the factors that drive a justice’s interruption behavior. Such an approach is therefore pivotal for unraveling the causal impact of gender on interruption behavior on the Court.

Our analytical framework entails calculating the effects of advocate gender and ideological alignment on the interruption rate at the level of the justice based on the same rigorous estimation approach outlined above, analyzing effects by focusing on justice-specific chunk subsets. This methodological approach is distinctive in its ability to dissect justice-specific interactions, allowing us to disentangle the unique behavioral patterns of each justice.

We present the results of the justice-level analyses in Figure 4. We order the justices along the y-axis, in descending order by $ {\theta}_{\mathrm{Gender}} $ , and indicate for each justice the ideological classification on the basis of the average Martin & Quinn scores (where L is Liberal and C is Conservative) along with the total number of valid chunks our estimates are based on for that justice (in parentheses).Footnote 10 The far left panel presents the rate justices are interrupting advocates in all valid chunks for which they participate, the center panel presents the gender effect on interruption rates (where higher values indicate the justice is more likely to interrupt a female advocate), and the right panel presents the effect of ideological alignment on the justice’s interruption behavior (where higher values indicate the justice is more likely to interrupt an advocate whose ideology aligns with the justice).

Figure 4. Plot of Justice Interruption Rates, Effect of Gender, and Effect of Ideological Alignment. Justice-specific estimates (points) and boostrapped confidence intervals (line segments) for justice interruption rate (left panel), gender effect (center panel), and ideological alignment (right panel). Higher values indicate more interruptions. A table of these results appears in the Supplementary material.

Four justices stand out in the analysis for the substantive and statistical significance of the gender effect in their exchanges with advocates at oral argument: Sandra Day O’Connor, John Roberts, Anthony Kennedy, and Antonin Scalia. The effects among this group range from 1.65 (for Justice Scalia) to 2.18 (for Justice O’Connor). For these two justices, the results indicate that for every 1,000 tokens spoken by a female advocate, relative to a male advocate she will be interrupted 1.65 times more frequently in exchanges with Justice Scalia, and 2.18 times more frequently by Justice O’Connor. For each, the gender effect is more than twice the magnitude of the ideological alignment effect.

However, only four justices out of the fifteen in our study interrupt male advocates more frequently than female advocates at any magnitude, and for each, the 95% confidence intervals cover zero. Therefore, we cannot confidently say, based on this data, that any justice interrupts male advocates more frequently in comparison to female advocates.

Another notable dynamic emerging is the extent to which the gender effect covers nearly the entirety of the ideological elements of the Court’s composition. Three of the five most liberal justices in our analysis interrupt male advocates more frequently than female advocates, though – to reiterate – the confidence intervals for each cross zero. To illustrate justices’ ideological preferences more clearly, in Figure 5, we plot the justice-level gender effects by the justice’s Martin & Quinn ideology score. First, seven of the eight conservative justices interrupt female advocates more frequently than male advocates, with Burger being the only exception. Among these, the effect sizes we identify are statistically significant at the 95% level for five of the eight. The largest point estimates are concentrated among the more moderate conservatives (Kennedy, O’Connor, and Roberts); interestingly, each was also often portrayed as the median justice during at least part of their tenure on the Court.

Figure 5. Justice Gender Effects $ {\theta}_{\mathrm{Gender}}^j $ (y-axis) by Martin & Quinn Ideology Scores (x-axis). Gender effects are calculated as in Figure 4.

Among the more liberal justices, two patterns bear mentioning. First, uncertainty around the point estimates in most of the cases cautions against firm conclusions of a gender effect. Of the seven justices rated as liberal, only two – Justices Ginsburg and Souter – demonstrate a statistically significant effect of advocate gender on their interruption behavior; in both cases, the justice interrupts female advocates more frequently than male advocates, and at rates generally commensurate with the group average among their more conservative colleagues.

Second, three liberal justices – Sonia Sotomayor, Thurgood Marshall, and Elena Kagan – interrupt men more frequently than they do women within our data. This group is particularly interesting in that it contains the two female justices (Sotomayor and Kagan) who have most recently joined the Court within our timeframe.Footnote 11 While O’Connor and Ginsburg were trailblazing women in the legal profession, within our data, both are more likely to interrupt female advocates than male advocates. In contrast, Sotomayor – who joined the Court in 2009 – and Kagan – who joined in 2010 – are more likely to interrupt male advocates. Taken together, we have clear evidence of a gender effect on interruption behavior that tracks across most justices on the Court.

Corroboration: Causal Mediation Analysis

The analysis above begs further corroboration to support the claim that the estimated effects relate to gender specifically. For instance, one might be concerned that the observed gender effect is not a function specifically of gender, but is instead a function of differences in the ideological orientations of male and female advocates and the members of the Court. Moreover, one might question whether the broader professionalization biases faced by women in the legal profession yields a qualitatively different pool of advocates, leading to differences in interruption behavior. In both cases, one might worry that the observed gender effect does not capture the causal effect of gender on interruption behavior, but rather that the observed effect of gender operates through ideological differences or differences in advocates’ speech quality. To address this, we turn to a causal mediation analysis (Pearl Reference Pearl2001; Imai, Keele, and Tingley Reference Imai, Keele and Tingley2010; VanderWeele Reference VanderWeele2016).

Our analysis focuses on two pathways through which the gender effect may operate: differences in the ideological orientation of advocates and justices, and differences in the quality of arguments. In the former case, as discussed above, professionalization and prospects for career progression may be different for women representing particular types of arguments and cases before the Court, leading to different patterns of representation at the Court for particular arguments that may be more or less likely to yield interruption by justices. If female advocates are more likely to represent liberal arguments during the period under study while the Court is consistently composed of a majority of conservative justices, the effect of gender may operate through the ideological differences between advocate and justice. We maintain the same approach to measuring ideological alignment and incorporate it now through a causal mediation approach.

Further, we analyze two dimensions related to argument quality: speaking fluidity and advocate experience. Consider first speaking fluidity, or the extent to which an advocate can seamlessly present their argument. At a basic level, if an advocate is proceeding seamlessly through their argument, the justice must be more aggressive to interrupt; alternatively, if an advocate is stumbling over their words, there are more opportunities for an interruption. If we expect differences in the presentation styles of women and men at the Court, whether because of differences in the legal profession or because of the efforts of advocates to conform with gender schemas (e.g., Gleason and Smart Reference Gleason and Smart2022), then we would expect the effect of gender to operate through differences in oral argument quality.

To capture speaking fluidity, we turn to counts of speech disfluencies, or breaks in the flow of a speech. Though there are many types of speech disfluencies, one of the most visible is stuttering or stopping and starting in order to repeat words or phrases. In oral argument transcripts, these types of speech disfluencies are indicated by the presence of a dash followed by a repeat of the word or phrase that had preceded the dash. We automatically extracted these at the utterance level. In the process of doing so, we found that such disfluencies are unreliably indicated in some years of the digital versions of the transcripts, and therefore, we limit our analysis to the years 2007 through 2019.Footnote 12

Beyond speaking fluidity, we also address advocate experience as a variable on the causal path between advocate gender and interruption. Again, given differences in the participation of women as advocates before the Court, women advocates may have less experience on average. If justices are more likely to interrupt advocates who are inexperienced, then the effect of gender would operate through an experience path. To capture experience, we employ a dichotomous indicator of prior experience arguing before the Court. The first time an advocate argues before the Court, the indicator takes on the value of 0. For any additional arguments, the indicator takes on the value of 1. Though our causal mediation analysis starts in 2007, we search for prior advocate experience back to 1980 to ensure full coverage.

With the updated data, we undertake a causal mediation analysis, including speech disfluencies, experience, and ideological alignment as mediators. We calculate the natural direct effects (NDE) and natural indirect effects (NIE) via the assumptions and formulas presented in Keith, Rice, and O’Connor (Reference Keith, Rice and O’Connor2021); these estimates are intentionally non-parametric (i.e., not based on the assumptions of linear regression) and simplify to the conditional means from the data. We report the results in Table 4. We find all the natural indirect effects are near zero, indicating that the effect of gender directly to interruption is the main driving force for the differences in interruption we observe.

Table 4. Corroborative analyses. Causal mediation estimates aggregated across justices of the natural direct effect (NDE) from gender to interruption and the natural indirect effect (NIE) from gender to interruption through one of the following mediators: speech disfluencies, ideological alignment, or advocate experience. The data is from 2007-2019 and $ n $ is the number of chunks.

Discussion

The U.S. Supreme Court, currently characterized by a conservative supermajority, is at the center of battles over power and politics in the United States. This article tackles a fundamental question regarding the Court’s business: can men and women advocates participate equally? We address the complex causal challenges involved in this question by disentangling the effect of an advocate’s signaled gender from other legal, political, and social factors affecting representation before the Court. Our approach elucidates the gender effects on justice interruption behavior and allows us to examine differentials in interruption behavior at the level of the individual justice, providing insights on the behavioral underpinnings of bias in interruption behavior at the Court.

We find a pronounced gender effect, with most justices interrupting female advocates more frequently than male advocates during oral arguments. This effect, which significantly outpaces other potential explanations of interruption behavior, scales with the length of oral arguments, revealing a substantial impact over cases and terms. That said, the dynamic reverses in areas traditionally thought of as women’s issues, including the right to privacy. Moreover, we find that conservative justices exhibit the bias more strongly, while some liberal justices tend to interrupt male advocates more frequently. In all, the work conclusively demonstrates that conversational dynamics on the nation’s highest court are significantly shaped by the genders of the lawyers appearing before the Court.

The results have stark implications going forward. The concentration of gendered interruption effects among conservative justices, coupled with the probability of a conservative super-majority going forward, indicates a major challenge for gender equity before the Court going forward. Given that more recent liberal additions to the Court have shown a tendency to operate in the opposite direction, interrupting male advocates more frequently than female advocates, future research should build from prior work (e.g., Haire and Moyer Reference Haire and Moyer2015) to examine how the Court’s polarization affects advocates ability to participate equitably. Moreover, while we have considered interruptions generally, we have not explored whether the interruptions were to ask questions or to make statements, though the different types are associated with different probabilities of winning or losing the case (Sorenson Reference Sorenson2023). Therefore, future work could build to better understand how the types of interruptions vary across the advocate gender by each justice.

On this, our analyses highlight significant differences in interruption behavior among female justices. Trailblazers like Justice O’Connor and, to a lesser extent, Justice Ginsburg frequently interrupted female attorneys, contrary to more recent appointees Justice Sotomayor and Justice Kagan. This indicates generational shifts and evolving norms within the Court (see, e.g., Haire and Moyer Reference Haire and Moyer2015). With two more recent female appointees, analysis of these generational differences going forward could provide deeper insights into the intersection of gender and judicial behavior at oral argument. Such longitudinal studies can also better capture how societal attitudes relate to changes in behavior at the Court.

Beyond the justices, future work might also consider how different advocacy styles and strategies influence the interruption behavior of the justices, considering for instance the extent to which advocates behaving in particular ways might discourage or encourage interruptions from particular justices (Gleason and Smart Reference Gleason and Smart2022). Likewise, our work has not considered other intersectional disadvantages that attorneys might face, including how race, ethnicity, or other social identities might intersect with gender to influence interruption behavior.

On these above opportunities, our approach is generalizable and, along with our findings, offers clear pathways for such future research, including how recent appointments have changed these dynamics and how the changing composition of the federal judiciary relates to the dynamics observed at the U.S. Supreme Court. The approach is also generalizable beyond the United States, and future comparative work, both in the context of the states and internationally, could contextualize whether the dynamics we observe are more general or are specific to particular places and courts. Likewise, the approach generalizes to other forms of marginalization in the courtroom beyond interruptions – for instance, changes in the tone (Rice and Zorn Reference Rice and Zorn2021) or vocal pitch (Dietrich, Enos, and Sen Reference Dietrich, Enos and Sen2019b) of justices in their interactions, and other potential mediators (Goelzhauser, Kassow, and Rice Reference Goelzhauser, Kassow and Rice2021). Throughout these avenues for future work, our analysis suggests that to understand the U.S. Supreme Court, it is vitally important to understand the ways justice behavior is shaped by gender dynamics.

Supplementary material

The supplementary material for this article can be found at http://doi.org/10.1017/jlc.2024.7.

Acknowledgement

Authors are in alphabetical order by last name. Authors contributed as follows: implementation of data pipeline by EC, AG, KK, DR; research design by KK, BO, DR; first draft writing of manuscript by KK, DR; editing manuscript by EC, AG, KK, BO, DR. The authors thank Mohit Iyyer, David Jensen, Paul Musgrave, Jesse Rhodes, Kelsey Shoub, Donny Snyder, members of the American Politics Working Group at UMass Amherst, and conference participants at the 2021 Causal Inference + NLP Workshop and 2022 New Directions in Analyzing Text as Data conference for comments and suggestions on earlier versions of this work. All data and materials necessary to replicate the results reported in this article are available on the Journal of Law & Courts Dataverse and at https://github.com/kakeith/interruptions-supreme-court.

Footnotes

1 Adam Liptak, “Supreme Court Tries to Tame Unruly Oral Arguments”, New York Times, November 1, 2021.

2 Dan Berman and Ariane de Vogue, “Supreme Court gives lawyers 2 minutes with no interruptions,” CNN, October 3, 2019.

3 Our approach deviates from that of Miller and Sutherland (Reference Miller and Sutherland2022) as we define chunks as exchanges between justice and advocate lasting at least four turns, whereas they define a chunk as an individual turn, or what we are calling an utterance.

4 While we understand there are more than two genders, the U.S. Supreme Court never introduces an advocate with an explicitly non-binary gender title like “Mx.”, and thus we restrict our analysis to binary gender.

5 Based on our filtering criteria, we considered fifteen justices: Anthony Kennedy, Antonin Scalia, Byron White, David Souter, Elena Kagan, John Roberts Jr., John Paul Stevens, Ruth Bader Ginsburg, Samuel Alito Jr., Sandra Day O’Connor, Sonia Sotomayor, Stephen Breyer, Thurgood Marshall, Warren Burger, and William Rehnquist.

6 The consistency of the chunking approach over time provides evidence that transcription issues related to identifying justices and advocates are not frequent during the period of analysis, as our chunking approach relies on identifying justice and attorney turns in specifying valid chunks.

7 In the supplemental material, we explore differences in interruption behavior between gendered and non-gendered issues, demonstrating that in women’s issue cases –which cover approximately 2.4% of chunks- – women are interrupted less frequently. Because they represent a unique dynamic and comprise such a small number of cases and chunks, we exclude these from the analyses below.

8 This is a simple mean over the per-justice effects, $ {\theta}_{\mathrm{Gender}}=\frac{1}{N}{\sum}_j{\theta}_{\mathrm{Gender}}^j $ and $ {\theta}_{\mathrm{Ideological}\ \mathrm{Alignment}}=\frac{1}{N}{\sum}_j{\theta}_{\mathrm{Ideological}\ \mathrm{Alignment}}^j $

9 In the supplemental material, we address and reject the possibility that differences in the rates of advocates interrupting justices may lead to different conversational dynamics and influence the differences in rates of justices interrupting advocates. We also address concerns that the results are driven by short, back-channel communications (“OK”, “Right”). The results are consistent when excluding these back-channel communications.

10 Confidence intervals are inferred using a non-parametric bootstrap, detailed in the Supplementary material.

11 Justices Amy Coney Barrett and Ketanji Brown Jackson joined more recently and offer an avenue for future exploration.

12 This analysis includes ten judges who were involved in greater than 1,000 chunks: Anthony Kennedy, Antonin Scalia, David H. Souter, Elena Kagan, John Paul Stevens, John Roberts Jr., Ruth Bader Ginsburg, Samuel Alito Jr., Sonia Sotomayor, and Stephen Breyer.

References

Barnes, Tiffany. 2016. Gendering legislative behavior: institutional constraints and collaboration. CambridgeUniversity Press.CrossRefGoogle Scholar
Berk, Richard, Li, Azusa, and Hickman, Laura J.. 2005. Statistical difficulties in determining the role of race in capital cases: a re-analysis of data from the state of maryland. Journal of Quantitative Criminology 21 (4): 365390.CrossRefGoogle Scholar
Black, Ryan, Treul, Sarah, Johnson, Timothy, and Goldman, Jerry. 2011. Emotions, oral arguments, and supreme court decision making. The Journal of Politics 73 (2): 572581.CrossRefGoogle Scholar
Black, Ryan C., Johnson, Timothy R., and Wedeking, Justin. 2012. Oral arguments and coalition formation on the u.s. supreme court: a deliberate dialogue. University of Michigan Press.Google Scholar
Boyd, Christina L., Epstein, Lee, and Martin, Andrew D.. 2010. Untangling the causal effects of sex on judging. American Journal of Political Science 54:389411.CrossRefGoogle Scholar
Chang, Jonathan P, Chiam, Caleb, Fu, Liye, Wang, Andrew Z, Zhang, Justine, and Danescu-Niculescu-Mizil, Cristian. 2020. Convokit: a toolkit for the analysis of conversations. Proceedings of SIGDIAL.CrossRefGoogle Scholar
Collins, Todd, and Moyer, Laura. 2008. Gender, race, and intersectionality on the federal appellate bench. Political Research Quarterly 62 (2): 219227.CrossRefGoogle Scholar
Danescu-Niculescu-Mizil, Cristian, Lee, Lillian, Pang, Bo, and Kleinberg, Jon. 2012. Echoes of power: language effects and power differences in social interaction. In Www.CrossRefGoogle Scholar
Dietrich, Bryce J., Enos, Ryan D., and Sen, Maya. 2019a. Emotional arousal predicts voting on the u.s. supreme court. Political Analysis 27 (2): 237243. https://doi.org/10.1017/pan.2018.47.CrossRefGoogle Scholar
Dietrich, Bryce J., Enos, Ryan D., and Sen, Maya. 2019b. Emotional arousal predicts voting on the u.s. supreme court. Political Analysis 27 (2): 237243.CrossRefGoogle Scholar
Feldman, Adam, and Gill, Rebecca D.. 2019. Power dynamics in supreme court oral arguments. The Justice System Journal 40 (3): 173195.CrossRefGoogle Scholar
Gleason, Shane A., and Smart, EmiLee. 2022. You think; therefore i am: gender schemas and context in oral arguments at the supreme court, 1979–2016. Political Research Quarterly.CrossRefGoogle Scholar
Goelzhauser, Greg, Kassow, Benjamin J., and Rice, Douglas. 2021. Measuring Supreme Court Case Complexity. The Journal of Law, Economics, and Organization 38, no. 1 (January): 92118.CrossRefGoogle Scholar
Gramlich, John. 2021. How trump compares with other recent presidents in appointing federal judges. Pew Research Center.Google Scholar
Hack, Jonathan S., and Jenkins, Clinton M.. 2022. The attorneys’ gender: exploring counsel success before the u.s. supreme court. Political Research Quarterly 75 (3): 632645.CrossRefGoogle Scholar
Haire, Susan, and Moyer, Laura. 2015. Diversity matters: judicial policy making in the u.s. courts of appeals. University of Virginia Press.Google Scholar
Hancock, Adrienne B., and Rubin, Benjamin A.. 2015. Influence of communication partner’s gender on language. Journal of Language and Social Psychology 34 (1): 4664.CrossRefGoogle Scholar
Holland, Paul W. 2008. Causation and race. ,White logic, white methods: Racism and methodology 93109.Google Scholar
Imai, Kosuke, Keele, Luke, and Tingley, Dustin. 2010. A general approach to causal mediation analysis. Psychological methods 15 (4): 309.CrossRefGoogle ScholarPubMed
Jacobi, Tonja, and Rozema, Kyle. 2018. Judicial conflicts and voting agreement: evidence from interruptions at oral argument. Boston College Law Review 59.Google Scholar
Jacobi, Tonja, and Sag, Matthew. 2019. The new oral argument: justices as advocates. Notre Dame Law Review 94.Google Scholar
Jacobi, Tonja, and Schweers, Dylan. 2017. Justice, interrupted: the effect of gender, ideology, and seniority at supreme court oral arguments. Virginia Law Review 103 (7): 13791496.Google Scholar
Johnson, Timothy R. 2001. Information, oral arguments, and supreme court decision making. American Politics Research 29:331351.CrossRefGoogle Scholar
Johnson, Timothy R. 2004. Oral arguments and decision making on the u.s. supreme court. SUNY Press.Google Scholar
Johnson, Timothy R., Wahlbeck, Paul J., and Spriggs, James F.. 2006. The influence of oral arguments on the u.s. supreme court. American Political Science Review 100 (1): 99113.CrossRefGoogle Scholar
Kanthak, Kristin, and Krause, George A.. 2012. The diversity paradox: political parties, legislatures, and the organizational foundations of representation in america. Oxford University Press.Google Scholar
Karpowitz, Christopher F., and Mendelberg, Tali. 2014. The silent sex: gender, deliberation, & institutions. Princeton University Press.Google Scholar
Keith, Katherine, Rice, Douglas, and O’Connor, Brendan. 2021. Text as causal mediators: research design for causal estimates of differential treatment of social groups via language aspects. In Proceedings of the first workshop on causal inference and nlp, 2132.CrossRefGoogle Scholar
Lazarus, Jeffrey, and Steigerwalt, Amy. 2018. Gendered vulnerability: how women work harder to stay in office. University of Michigan Press.CrossRefGoogle Scholar
Lindom, Tiffany, Gregory, Charles, and Johnson, Timothy R.. 2017. Gender dynamics and supreme court oral arguments. Michigan State Law Review, no. 5, 10331055.Google Scholar
Lundberg, Ian, Murray, Rebecca, and Stewart, Brandon M. 2021. What is your estimand? defining the target quantity connects statistical evidence to theory. American Sociological Review 86 (3): 532565.CrossRefGoogle Scholar
Martin, Andrew, and Quinn, Kevin. 2002. Dynamic ideal point estimation via markov chain monte carlo for the u.s. supreme court, 1953–1999. Political Analysis 10:134153.CrossRefGoogle Scholar
Mendelberg, Tali, and Karpowitz, Christopher F.. 2016. Women’s authority in political decision-making groups. The Leadership Quarterly 27 (3): 487503.CrossRefGoogle Scholar
Miller, Michael G., and Sutherland, Joseph L.. 2022. The effect of gender on interruptions at congressional hearings. American Political Science Review.Google Scholar
Morgan, Stephen L., and Winship, Christopher. 2015. Counterfactuals and causal inference. Cambridge University Press.Google Scholar
Patton, Dana, and Smith, Joseph. 2017. “lawyer, interrupted: gender bias in oral arguments at the u.s. supreme court”. Journal of Law and Courts, 337361.CrossRefGoogle Scholar
Patton, Dana, and Smith, Joseph L.. 2020. Gender, ideology, and dominance in supreme court oral arguments. Journal of Women, Politics, & Policy 41 (4): 393415.CrossRefGoogle Scholar
Pearl, Judea. 2001. Direct and indirect effects. In ,Proceedings of the seventeenth conference on uncertainty in artificial intelligence 411420.Google Scholar
Rice, Douglas R., and Zorn, Christopher. 2021. Corpus-based dictionaries for sentiment analysis of specialized vocabularies. Political Science Research and Methods 9 (1): 2035.CrossRefGoogle Scholar
Ringsmuth, Eve M., Bryan, Amanda C., and Johnson, Timothy R.. 2013. Voting fluidity and oral argument on the u.s. supreme court. Political Research Quarterly 66 (2): 426437.CrossRefGoogle Scholar
Roberts, John G. 2005. Oral advocacy and the re-emergence of a supreme court bar. Journal of Supreme Court History 30 (1): 6881.CrossRefGoogle Scholar
Rubin, Donald B. 1974. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of educational Psychology 66 (5): 688.CrossRefGoogle Scholar
Sen, Maya, and Wasow, Omar. 2016. Race as a bundle of sticks: designs that estimate effects of seemingly immutable characteristics. Annual Review of Political Science 19:499522.CrossRefGoogle Scholar
Sorenson, Mason W. 2023. Asking versus telling: the supreme court’s strategic use of questions and statements during oral arguments. Political Research Quarterly 76 (4): 15591572.CrossRefGoogle Scholar
Spaeth, Harold, Epstein, Lee, Martin, Andrew, Segal, Jeffrey, Ruger, Theodore, and Benesh, Sara. 2021. The supreme court database. http://Supremecourtdatabase.org.Google Scholar
St. Eve, Amy J., and Luguri, Jamie B.. 2021. How unappealing: an empirical analysis of the gender gap among appellate attorneys. Technical report. American Bar Association.Google Scholar
Sullivan, Barry, and Canty, Megan. 2015. Interruptions in search of a purpose: oral argument in the supreme court, october terms 1958-60 and 2010-12. Utah Law Review 2015 (5).Google Scholar
Szmer, John, Kaheny, Erin B., Sarver, Tammy A., and DeCamillis, Mason. 2013. The impact of lawyer gender on decision making in the united states courts of appeals. Journal of Women, Politics and Policy 34 (1): 72100.CrossRefGoogle Scholar
Szmer, John J., Sarver, Tammy A., and Kaheny, Erin B.. 2010. Have we come a long way, baby? the influence of lawyer gender on supreme court decision making. Politics and Gender 6:136.CrossRefGoogle Scholar
VanderWeele, Tyler J. 2016. Mediation analysis: a practitioner’s guide. Annual Review of Public Health 37:1732.CrossRefGoogle ScholarPubMed
Zimmermann, Don H., and West, Candace. 1996. Sex roles, interruptions, and silences in conversation. In Amsterdam studies in the theory and history of linguistic science, 211–36.Google Scholar
Zorn, Christopher, and Bowie, Jennifer. 2010. Ideological influences on decision making in the federal judicial hierarchy: an empirical assessment. Journal of Politics 72 (4): 12121221.CrossRefGoogle Scholar
Figure 0

Table 1. Summary of Prior Work Examining Interruption Behavior During U.S. Supreme Court Oral Argument. We compare this to the contribution of this work in the final line in which Justices are interrupting Advocates (Justice$ \to $Advocate).

Figure 1

Figure 1. Number of Chunks and Cases per Supreme Court Term. Plots indicate the number of valid chunks (bars) and the number of cases (line) per October Term.

Figure 2

Table 2. Descriptive Statistics of the Corpus and Dataset

Figure 3

Figure 2. Participation of Women in Oral Argument Exchanges on the U.S. Supreme Court. These plots provide the proportion of exchange chunks featuring a female advocate (left panel) or female justice (right panel) by October Term.

Figure 4

Figure 3. Interruption Rate Over Time. This plot presents the Court-level average of token-normalized interruption rate (y-axis) by October Term (x-axis). The blue line is the advocate being interrupted by justices and the red line is justices being interrupted by advocates.

Figure 5

Table 3. Effects on advocate interruption rate, aggregated by justice gender. Positive $ {\theta}_{\mathrm{Gender}} $ indicates justices interrupt female advocates more often than they interrupt male advocates. Negative $ {\theta}_{\mathrm{Gender}} $ indicates justices interrupt ideologically opposed advocates more often than they interrupt ideologically aligned advocates. Here ± indicates the 95% confidence interval via bootstrap sampling of valid chunks. The last column shows the absolute ratio of point estimates to aid interpretation.

Figure 6

Figure 4. Plot of Justice Interruption Rates, Effect of Gender, and Effect of Ideological Alignment. Justice-specific estimates (points) and boostrapped confidence intervals (line segments) for justice interruption rate (left panel), gender effect (center panel), and ideological alignment (right panel). Higher values indicate more interruptions. A table of these results appears in the Supplementary material.

Figure 7

Figure 5. Justice Gender Effects $ {\theta}_{\mathrm{Gender}}^j $ (y-axis) by Martin & Quinn Ideology Scores (x-axis). Gender effects are calculated as in Figure 4.

Figure 8

Table 4. Corroborative analyses. Causal mediation estimates aggregated across justices of the natural direct effect (NDE) from gender to interruption and the natural indirect effect (NIE) from gender to interruption through one of the following mediators: speech disfluencies, ideological alignment, or advocate experience. The data is from 2007-2019 and $ n $ is the number of chunks.

Supplementary material: File

Cai et al. supplementary material

Cai et al. supplementary material
Download Cai et al. supplementary material(File)
File 437.1 KB