Category Archives: Corpus linguistics & lexicography

BYU Law Review: Special issue on law and corpus linguistics

The B.Y.U. Law Review has published its special issue devoted to the papers presented at the 2017 law-and-corpus-linguistics conference hosted by the B.Y.U. Law School.

One of the papers in the volume is mine: “A Lawyer’s Introduction to Meaning in the Framework of Corpus Linguistics” (abstract; pdf), which discusses a new way of thinking about the issue of word meaning that has developed as a result of the use of corpus linguistics in lexicography. A condensed version of that discussion (very condensed) can be found in my post Meaning in the Framework of Corpus Linguistics.

Of the other papers, there are three that I think will be of the most interest to readers (whether judges, lawyers, or legal academics) who want to learn more about what role corpus linguistics can play in legal interpretation. Two of those papers view the use of corpus linguistics positively; the other is critical of it.

Continue reading

“Empirical” doesn’t necessarily mean “definitively verifiable”

Carissa Hessick and I have been debating the appropriateness of using empirical methods in legal interpretation. The debate began on PrawfsBlawg, then moved over here (with some continued discussion at Prawfs), and then spread to Twitter. The relevant tweets are collected in my previous post, and in this post I’ll respond to Hessick’s most recent points.

As I understand her, Hessick contends that the issue of ordinary meaning isn’t an “empirical question” because the question of how a reasonable person would understand the text is inherently qualitative rather than quantitative, and therefore can’t be answered in a way that is “provable or verifiable.” I accept Hessick’s characterization of the ordinary-meaning issue as being qualitative rather than quantitative, but it doesn’t follow that quantitative information is always irrelevant.

Continue reading

Corpus linguistics and empiricism: A Twitter exchange

My last post, Corpus linguistics: Empiricism and frequency, prompted a Twitter exchange between Carissa Hessick and me, a lightly edited version of which I present here.

Hessick:

One question based on my quick read:  Do you think most people would understand “relying on linguistic intuition” to be an empirical undertaking?  I appreciate the insight into how people’s linguistic intuitions are formed.  But don’t most people think that, if something is an empirical question, that means there is a demonstrably correct answer?

And if we often have different intuitions about what a word means (as the split decisions on ordinary meaning illustrate), and if judges resolve the Q of ordinary meaning by consulting their own intuitions, then how can ordinary meaning be an empirical Q? If I have one intuition and you have another, then how to we demonstrate which is correct and which is incorrect?

Me: Continue reading

Corpus linguistics: Empiricism and frequency

This is the second in a series of posts about the essentially final version of Carissa Hessick’s article Corpus Linguistics and the Criminal Law. The first post dealt mainly with Hessick’s views about how corpus linguistics relates to ultimate purpose of legal interpretation, which is to determine the legal meaning of the text in dispute. This time around, I’ll be discussing her claim that incorporating corpus linguistics into legal interpretation would radically transform the process of determining the text’s ordinary meaning:

Corpus linguistics reframes the “plain” or “ordinary” meaning inquiry in two ways. First, it claims that ordinary meaning is an empirical question. Second, it tells us that this empirical question ought to be answered by how frequently a term is used in a particular way. Both of these analytical moves represent significant departures from current theories of statutory interpretation, including textualism, and they render statutory interpretation essentially unrecognizable.

This statement is a mixed bag. In one respect, it’s correct. Those who support the use of corpus linguistics in legal interpretation do regard ordinary meaning as an empirical question—or at least as involving empirical questions. In a different respect, it is partly correct but oversimplified. Analysis of frequency data is in fact central to corpus linguistics, but it is not necessarily decisive, and in some cases (perhaps in many cases) it will not be helpful at all. And in a  third respect, Hessick’s statement is wrong. Neither the empiricism of corpus linguistics nor the attention it pays to frequency represents a “significant departure” from existing interpretive theories.

Empiricism Continue reading

Artis v. District of Columbia, part 2: Units of meaning and dictionary definitions

Sometimes, it’s immediately obvious from the opinions that a case raises questions about interpretation that are interesting, important, or both. Smith v. United States, in which the question was whether trading a handgun for drugs amounts to “using” it, is a classic example. At first glance, the Supreme Court’s decision in Artis v. District of Columbia  doesn’t seem to be in that category. It doesn’t offer interesting linguistic issues that call attention to themselves, except for someone who is familiar with the work of the linguist John Sinclair and the lexicographer Patrick Hanks. But with some digging, Artis yields some issues that I think are  interesting and significant, having to do with new approaches to analyzing questions of word meaning and with how not to use dictionaries.

Continue reading

More on the relevance of frequency data: Responding to Steinberg

In a comment on one of Carissa Hessick’s posts about corpus linguistics at Prawfsblawg, Asher Steinberg expressed the view that relying on frequency data in deciding issues of ordinary meaning is misguided. (Steinberg blogs at The Narrowest Grounds, where he frequently writes intelligently about statutory interpretation.) Shortly after that, I posted Meaning in the framework of corpus linguistics here, in which I explained why I believe that frequency data can in fact be relevant in doing legal interpretation. And that post prompted a long comment by Steinberg, elaborating on his objection to using frequency data in legal interpretation.

Steinberg fears that if the courts were to draw on corpus linguistics in the way I that I advocate, statutory interpretation would “fall into fundamental error[.]”His point of departure is my analysis of the corpus data regarding the issue raised by Muscarello v. United States—whether driving somewhere with a gun in the trunk or glove compartment counts as carrying a firearm. (My conclusions are briefly summarized in the post Steinberg comments on; for the full analysis, see my forthcoming article A Lawyer’s Introduction to Meaning in the Framework of Corpus Linguistics (henceforth, A Lawyer’s Introduction)) Steinberg argues that frequency data—or at least the kind of frequency data that my analysis is based on— is inherently unreliable as evidence of ordinary meaning.

I beg to differ.

Continue reading

What is this thing called law and corpus linguistics?

I recently came across a new paper by Stephen Mouritsen: “Corpus Linguistics in Legal Interpretation—An Evolving Interpretive Framework” (pdf). Judging from the title, you might think that the paper is intended as an introduction to corpus linguistics as an interpretive tool, with the intended audience being lawyers, judges, and law professors. But if you thought that, you’d be wrong.

The paper traces the birth and development (so far) of law-and-corpus-linguistics as a field of practice and study. (Hmmm…“field of practice and study”? Doesn’t that sound a little overblown?) Because the paper was written for an audience that was already familiar with corpus linguistics, it doesn’t explain what corpus linguistics is. What it does instead is to give a fairly complete description of the work that has been done so far in this area of inquiry. (No, that’s worse.) It covers what’s happened in the courts, in the legal academy, and in the internet-o-sphere. And with more attention starting to be paid law-and-corpus-linguistics movement (oy), there has been a need for something that will help them get up to speed. This paper fills that need.

Continue reading

Meaning in the framework of corpus linguistics

At the end of my previous post discussing Carissa Hessick’s paper “Corpus Linguistics and the Criminal Law,” I said that I would follow up with another post “making the affirmative case for the relevance of frequency data in determining ordinary meaning.” This is that post.

Given that subject, you might wonder why I’ve titled this post “Meaning in the framework of corpus linguistics.” The answer is that corpus linguistics has not only provided a methodology for investigating meaning, it has also generated important insights about word meaning. (That was the subject of the paper I presented at the BYU symposium in February, which has been published, along with the other papers from the symposium, in the BYU Law Review.) I’ll draw on those insights when I talk about frequency analysis, and I thought it would be helpful to make them explicit.

THERE ARE A VARIETY OF DIFFERENT WAYS to think about word meanings. One of them is the way that I see as characteristic of how lawyers and judges tend to think: the meaning of a word is more or less equated with its dictionary definition, and then the definition is in effect read into the statute. If you’ve read a lot of cases, you’ll probably recognize the pattern:

The issue here is what “flood” means. Webster’s Dictionary defines “flood” to mean, “a great flow of water over what is usually dry land.” Therefore, the plaintiffs must show that the water in their basement resulted from a great flow of water over what is usually dry land.

Under this approach, the dictionary entry is treated as if what it defines is the concept FLOOD rather than the word flood. The dictionary entry is being used as stating the conditions determining whether a particular instance of water on the ground qualifies as a flood. Considering the role that dictionaries have come to play in legal interpretation, it is no small irony that many lexicographers would say that the definitions they write aren’t intended to serve that purpose.

Continue reading

Some comments on Hessick on corpus linguistics (updated)

UP UNTIL NOW, the use of corpus linguistics in legal interpretation has gotten almost entirely good press—probably because almost all the press it’s gotten has come from its advocates. That situation has now changed, though, with the posting on SSRN of a paper by UNC law professor Carissa Hessick, who was one of the participants at the BYU law-and-corpus-linguistics symposium this past February. (Hessick has blogged about her paper at Prawfsblawg, here and here.)

The paper, “Corpus Linguistics and the Criminal Law” (pdf), argues that corpus linguistics “is not an appropriate tool” for interpreting statutes. Although it deals specifically with using corpus linguistics in interpreting criminal statutes, and Hessick’s concerns may not be as strong as to other areas of the law, much of her criticism would apply across the board. In this post I am going to discuss some of the issues that the paper raises, and if you’ve followed this blog before, you won’t be surprised to find out that I disagree with Hessick’s conclusion.

Continue reading

The semantics of sleeping in railway stations

 

“I really, really like the work in Congress, I really do, but I love my family more. People may try to make it more than that, but it’s really that simple,” [Jason] Chaffetz said on MSNBC. “I just turned 50. I’m sleeping on a cot in my office.”

Chaffetz on No 2018 Run: ‘I Just Turned 50, I’m Sleeping on a Cot in My Office,’ Talking Points Memo xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxx

Everyone familiar with the academic literature on statutory interpretation is aware of the no-vehicles-in-the-park hypothetical. It was formulated by the legal philosopher H.L.A. Hart to illustrate the argument that the words in which a law is written must have “a core of settled meaning”—a set of “standard instance[s] in which no doubts are felt about [the law’s] application”—but will also have “a penumbra of debatable cases in which words are neither obviously applicable nor obviously ruled out.”  But Harvard law professor Lon Fuller denied the existence of any core area in which the law’s applicability was clear; for Fuller, the law’s applicability turned not on linguistic semantics but on the law’s purpose. Thus, he asked whether, under the hypothesized prohibition against vehicles in the park, “mount[ing] on a pedestal in the park a truck used in World War II…in perfect working order” would fall within the law’s core or its periphery.

Less well known than Hart’s thought experiment is a separate hypothetical offered by Fuller to support his challenge to  Hart. Fuller posits a law making it a misdemeanor “to sleep in any railway station.” He then supposes that two people have been arrested for violating this law: one who dozed off while waiting for a train, and another “who had brought a blanket and pillow to the station and had obviously settled himself down for the night[,]” but who had been arrested before he fell asleep. “Which of these cases,” Fuller asked, “presents the ‘standard instance’ of the word ‘sleep’?” And would it be faithful to the law to say that the law had been violated by the second person but not the first?

The hypothetical is thought-provoking because applying what is assumed to be the literal meaning of the law—that it prohibits being asleep in a railway station—would yield a conclusion that seems nonsensical: that the law was violated by the dozing passenger but not by the person who was bedded down but still awake. The hypothetical has been discussed by some very smart legal scholars and philosophers over the years, including Kent Greenawalt, Fred Schauer, John Manning, Scott Soames, and Andrei Marmor, and with few exceptions (mainly the linguist Robyn Carston) they have accepted that assumption. Schauer put it as well as anyone: “Sleep is a physiological state, and as a matter of physiology Fuller’s businessman was sleeping. Period.”

But in fact (you can guess where this is going, can’t you?), the assumption’s validity is doubtful at best. It is entirely consistent with actual usage to use sleep in a railway station to mean ‘use a railway station as a place to sleep’ rather than ‘be asleep in a railway station.’

Continue reading