Category Archives: Law and corpus linguistics

The BYU Law corpora (updated)

Posted on May 6, 2018 | Leave a comment

[Cross-posted at Language Log.]

I’d imagine that most people who’ve been actively involved with corpus linguistics are familiar with the BYU corpora—a collection of web-accessible corpora created by Brigham Young University linguistics professor Mark Davies. These corpora (and BYU’s corpus-linguistics program more generally) have played an essential part in the development of what I’ll call the corpus-linguistic turn in legal interpretation. The BYU corpora served as my entry-point into corpus linguistics, and they have provided the corpus data that has been used in most of the law-and-corpus-linguistics work that has been done to date. And beyond that, the BYU Law School has played an enormous role, in a variety of ways, in Law and Corpus Linguistics becoming a thing.

One of the things that the law school has been doing has been happening largely behind the scenes. For the past two or three years, people there have been developing the Corpus of Founding Era American English (COFEA)—a historical corpus that is intended as resource for studying language usage in the time leading up to the drafting and ratification of the U.S. Constitution. At this year’s conference on law and corpus linguistics (the third such conference, all of them hosted by the BYU Law School), we were given a preview of COFEA. And via a tweet by the law school’s dean, Gordon Smith, I’ve now learned that a beta version of COFEA is up and available for public playing-around-with, as are beta versions of two other corpora: the Corpus of Early Modern English and the Corpus of Supreme Court of the United States.

Continue reading →

Leave a comment

Posted in Corpus linguistics & constitutional interpretation, Law and corpus linguistics, Law and linguistics

Lucia v. SEC: Corpus linguistics and originalism

Posted on April 17, 2018 | Leave a comment

Over about the past year, there’s been a significant increase in the attention being paid to the idea of using corpus linguistics in legal interpretation. One of the most recent developments has occurred in a case that will be argued next week in the Supreme Court, in which two of the amicus briefs rely on corpus linguistics (Brief of Scholars of Corpus Linguistics; Brief of Prof. Jennifer L. Mascott).

The case in question is Lucia v. Securities and Exchange Commission, and it raises the question whether federal Administrative Law Judges are “officers of the United States” within the meaning of the Appointments Clause of the Constitution. This is the first of what will be two or three posts that are prompted by the filing of these briefs. However, none of the posts will deal with the substance of the legal or linguistic issues in the case.

Lucia is the first Supreme Court case I’m aware of in which anyone has relied on corpus analysis since FCC v. AT&T, Inc., in which I filed an amicus brief that was largely corpus-based. It’s also as far as I know the only case in any court where corpus analysis has been used in a brief in connection with an issue of constitutional interpretation.

Continue reading →

Leave a comment

Posted in -isms of interpretation, Intentionalism, Law and corpus linguistics, Law and linguistics, Lucia v. SEC: Corpus linguistics and originalism, Originalism

“Empirical” doesn’t necessarily mean “definitively verifiable”

Posted on April 2, 2018 | Leave a comment

Carissa Hessick and I have been debating the appropriateness of using empirical methods in legal interpretation. The debate began on PrawfsBlawg, then moved over here (with some continued discussion at Prawfs), and then spread to Twitter. The relevant tweets are collected in my previous post, and in this post I’ll respond to Hessick’s most recent points.

As I understand her, Hessick contends that the issue of ordinary meaning isn’t an “empirical question” because the question of how a reasonable person would understand the text is inherently qualitative rather than quantitative, and therefore can’t be answered in a way that is “provable or verifiable.” I accept Hessick’s characterization of the ordinary-meaning issue as being qualitative rather than quantitative, but it doesn’t follow that quantitative information is always irrelevant.

Continue reading →

Leave a comment

Posted in Corpus linguistics & lexicography, Corpus linguistics and statutory interpretation, Hessick, Law and corpus linguistics, Law and linguistics

Corpus linguistics and empiricism: A Twitter exchange

Posted on March 24, 2018 | Leave a comment

My last post, Corpus linguistics: Empiricism and frequency, prompted a Twitter exchange between Carissa Hessick and me, a lightly edited version of which I present here.

Hessick:

One question based on my quick read: Do you think most people would understand “relying on linguistic intuition” to be an empirical undertaking? I appreciate the insight into how people’s linguistic intuitions are formed. But don’t most people think that, if something is an empirical question, that means there is a demonstrably correct answer?

And if we often have different intuitions about what a word means (as the split decisions on ordinary meaning illustrate), and if judges resolve the Q of ordinary meaning by consulting their own intuitions, then how can ordinary meaning be an empirical Q? If I have one intuition and you have another, then how to we demonstrate which is correct and which is incorrect?

Me: Continue reading →

Leave a comment

Posted in Corpus linguistics & lexicography, Corpus linguistics and statutory interpretation, Hessick, Law and corpus linguistics, Law and linguistics

Corpus linguistics: Empiricism and frequency

Posted on March 22, 2018 | Leave a comment

This is the second in a series of posts about the essentially final version of Carissa Hessick’s article Corpus Linguistics and the Criminal Law. The first post dealt mainly with Hessick’s views about how corpus linguistics relates to ultimate purpose of legal interpretation, which is to determine the legal meaning of the text in dispute. This time around, I’ll be discussing her claim that incorporating corpus linguistics into legal interpretation would radically transform the process of determining the text’s ordinary meaning:

Corpus linguistics reframes the “plain” or “ordinary” meaning inquiry in two ways. First, it claims that ordinary meaning is an empirical question. Second, it tells us that this empirical question ought to be answered by how frequently a term is used in a particular way. Both of these analytical moves represent significant departures from current theories of statutory interpretation, including textualism, and they render statutory interpretation essentially unrecognizable.

This statement is a mixed bag. In one respect, it’s correct. Those who support the use of corpus linguistics in legal interpretation do regard ordinary meaning as an empirical question—or at least as involving empirical questions. In a different respect, it is partly correct but oversimplified. Analysis of frequency data is in fact central to corpus linguistics, but it is not necessarily decisive, and in some cases (perhaps in many cases) it will not be helpful at all. And in a third respect, Hessick’s statement is wrong. Neither the empiricism of corpus linguistics nor the attention it pays to frequency represents a “significant departure” from existing interpretive theories.

Empiricism Continue reading →

Leave a comment

Posted in Corpus linguistics & lexicography, Corpus linguistics and statutory interpretation, Hessick, Law and corpus linguistics, Law and linguistics, Ordinary meaning

Thinking like a linguist (some news)

Posted on March 16, 2018 | Leave a comment

I have two pieces of news I want to share.

First, I am very excited to say that I have received an appointment by the Georgetown University Law Center (aka Georgetown Law) as a Dean’s Visiting Scholar.

That appointment will provide me with a platform from which I’ll continue and expand on the kind of work that I’ve been doing here at LAWnLinguistics, in the amicus briefs in which I’ve drawn on linguistics, and in my paper A Lawyer’s Introduction to Meaning in the Framework of Corpus Linguistics: developing and promoting the idea that part of what it means to think like a lawyer is learning how to think like a linguist.

Continue reading →

Leave a comment

Posted in Law and corpus linguistics, Law and linguistics, Self-promotion

Artis v. District of Columbia, part 2: Units of meaning and dictionary definitions

Posted on February 27, 2018 | Leave a comment

Sometimes, it’s immediately obvious from the opinions that a case raises questions about interpretation that are interesting, important, or both. Smith v. United States, in which the question was whether trading a handgun for drugs amounts to “using” it, is a classic example. At first glance, the Supreme Court’s decision in Artis v. District of Columbia doesn’t seem to be in that category. It doesn’t offer interesting linguistic issues that call attention to themselves, except for someone who is familiar with the work of the linguist John Sinclair and the lexicographer Patrick Hanks. But with some digging, Artis yields some issues that I think are interesting and significant, having to do with new approaches to analyzing questions of word meaning and with how not to use dictionaries.

Continue reading →

Leave a comment

Posted in "toll" (v.), Alito, Artis v. District of Columbia, Corpus linguistics & lexicography, Corpus linguistics and statutory interpretation, Dictionaries, Ginsburg, Gorsuch, Law and corpus linguistics, Law and linguistics, Word meaning

Responding further to Hessick on corpus linguistics (The first in a series)

Posted on January 19, 2018 | 1 comment

Carissa Hessick has recently posted a near-final version of her forthcoming article Corpus Linguistics and the Criminal Law, which will appear in a special issue of the B.Y.U. Law Review devoted to the papers that were presented at the law-and-corpus-linguistics conference at Brigham Young about a year ago. Like the draft that Hessick posted in September, the new version argues against the use of corpus linguistics in statutory interpretation. And although the article deals specifically with the use of corpus linguistics in criminal cases, Hessick acknowledges that some of her criticisms may apply more broadly.

I blogged about the previous draft, outlining some of my disagreements with Hessick’s position, and also offered some comments in response to her trio of posts about corpus linguistics at PrawfsBlawg (link, link, link). My disagreements apply equally to the revised version.

In this post, I’ll have some further things to say about Hessick’s portrayal of corpus linguistics as “a radical break from current interpretive theories.” The targets of that claim are Stephen Mouritsen and Utah Supreme Court Justice Lee. But as I’ll discuss, Mouritsen disputes Hessick’s reading of both his individual work and the work he and Lee have done together. (Justice Lee has so far maintained radio silence; perhaps he and Mouritsen will respond to Hessick in their forthcoming article in the Yale Law Journal [draft].) And in two or three posts that will follow this one, I’ll address some of the other aspects of Hessick’s argument. (Part 2 is here.)

HESSICK’S THESIS HASN’T CHANGED SIGNIFICANTLY between her original draft and the revised version. So the new draft, like the previous one, paints what I believe is an inaccurate picture of how corpus linguistics relates to statutory interpretation, and of the views and goals of corpus linguistics’s proponents.

Continue reading →

1 Comment

Posted in Corpus linguistics and statutory interpretation, Hessick, Law and corpus linguistics, Law and linguistics, Law review articles, Lee, Statutory interpretation, Textualism

More on the relevance of frequency data: Responding to Steinberg

Posted on January 2, 2018 | Leave a comment

In a comment on one of Carissa Hessick’s posts about corpus linguistics at Prawfsblawg, Asher Steinberg expressed the view that relying on frequency data in deciding issues of ordinary meaning is misguided. (Steinberg blogs at The Narrowest Grounds, where he frequently writes intelligently about statutory interpretation.) Shortly after that, I posted Meaning in the framework of corpus linguistics here, in which I explained why I believe that frequency data can in fact be relevant in doing legal interpretation. And that post prompted a long comment by Steinberg, elaborating on his objection to using frequency data in legal interpretation.

Steinberg fears that if the courts were to draw on corpus linguistics in the way I that I advocate, statutory interpretation would “fall into fundamental error[.]”His point of departure is my analysis of the corpus data regarding the issue raised by Muscarello v. United States—whether driving somewhere with a gun in the trunk or glove compartment counts as carrying a firearm. (My conclusions are briefly summarized in the post Steinberg comments on; for the full analysis, see my forthcoming article A Lawyer’s Introduction to Meaning in the Framework of Corpus Linguistics (henceforth, A Lawyer’s Introduction)) Steinberg argues that frequency data—or at least the kind of frequency data that my analysis is based on— is inherently unreliable as evidence of ordinary meaning.

I beg to differ.

Continue reading →

Leave a comment

Posted in "carry", Corpus linguistics & lexicography, Corpus linguistics and statutory interpretation, Hessick, Law and corpus linguistics, Law and linguistics, Muscarello v. United States, Uncategorized

Another judicial endorsement of corpus linguistics

Posted on November 26, 2017 | Leave a comment

On Facebook, Stephen Mouritsen writes, “Justice Christine Durham [of the Utah Supreme Court] finally comes around to corpus linguistics . . . and then promptly retires. (Oh well. A win’s a win.)”

Mouritsen is referring to this, from footnote 9 in Justice Durham’s concurrence in Fire Insurance Exchange v. Oltmanns, 2017 UT 81 [paragraph break added]:

Even though we place great trust in a judge’s discernment, a “judge’s confidence in her linguistic intuition may be misplaced. . . . Though the human language faculty is very good at assessing which meanings are linguistically permissible in a given context, human intuition is less successful in selecting the most common meaning or common understanding.” Stephen C. Mouritsen, Hard Cases and Hard Data: Assessing Corpus Linguistics as an Empirical Path to Plain Meaning, 13 Colum. Sci. & Tech. L. Rev. 156, 160–61 (2012) [hereinafter Mouritsen, Hard Cases]. When terms are to “be interpreted according to their ordinary meaning, they implicate a set of empirical questions, many of which are amenable to different types of linguistic analysis. . . . [I]n the field of corpus linguistics, scholars . . . determine . . . those meanings that are consistent with common usage,” or “the term’s ordinary or most frequent meaning” based on empirical data rather than personal intuition. Id. at 161.

These tools for empirical analysis are readily available to lawyers and should be used when appropriate. See, e.g., Rasabout, 2015 UT 72, ¶¶ 57–134, (Lee, J., concurring); In re Adoption of Baby E.Z., 2011 UT 38, ¶¶ 86–105, 266 P.3d 702 (Lee, A.C.J., concurring); Brief for the Project On Government Oversight et al. as Amici Curiae Supporting Petitioners, FCC v. AT&T, Inc., 562 U.S. 397 (2011) (No. 09-1279) [link – NG]; 2017 BYU Law Review Symposium, Law & Corpus Linguistics, 2017 B.Y.U. L. Rev. (forthcoming), http://lawcorpus.byu.edu/; Neal Goldfarb, Words, Meanings, Corpora: A Lawyer’s Introduction to Meaning in the Framework of Corpus Linguistics, 2017 B.Y.U. L. REV. (forthcoming), https://ssrn.com/abstract=2907485; Stephen C. Mouritsen, The Dictionary is Not a Fortress: Definitional Fallacies and a Corpus-Based Approach to Plain Meaning, 2010 B.Y.U. L. REV. 1915; Mouritsen, Hard Cases, supra; Daniel Ortner, The Merciful Corpus: The Rule of Lenity, Ambiguity and Corpus Linguistics, 25 B.U. Pub. Int. L.J. 101 (2016); James C. Phillips, Daniel Ortner, & Thomas Lee, Corpus Linguistics & Original Public Meaning: A New Tool to Make Originalism More Empirical, 126 Yale L.J. Forum 20 (2016); Neal Goldfarb, LAWN LINGUISTICS, https://lawnlinguistics.wordpress.com/ (last visited May 16, 2017) (discussing many contemporary issues regarding corpus linguistics and the law and providing links to various online tools and resources).

Leave a comment

Posted in Corpus linguistics and statutory interpretation, Law and corpus linguistics, Law and linguistics, Self-promotion, Uncategorized

Category Archives: Law and corpus linguistics

The BYU Law corpora (updated)

Lucia v. SEC: Corpus linguistics and originalism

“Empirical” doesn’t necessarily mean “definitively verifiable”

Corpus linguistics and empiricism: A Twitter exchange

Corpus linguistics: Empiricism and frequency

Thinking like a linguist (some news)

Artis v. District of Columbia, part 2: Units of meaning and dictionary definitions

Responding further to Hessick on corpus linguistics (The first in a series)

More on the relevance of frequency data: Responding to Steinberg

Another judicial endorsement of corpus linguistics

Recent posts

Law, Linguistics, or Law & Linguistics (Broadly Construed)

Tools & stuff

Dictionaries, Lexicography, and All That

Categories

Archives

Meta