Language Log

Archive for Language and the law

Mandarin translation issues impeding the courts in New York

June 20, 2024 @ 4:36 am· Filed by Victor Mair under Language and the law, Translation

"Mandarin Leaves a Manhattan Courtroom Lost in Translation: Trial of Guo Wengui shows how linguistic issues can trip up China-related cases", by James T. Areddy, WSJ (6/18/24)

———

The New York trial of a Chinese businessman is Exhibit A for how language issues are gumming up federal prosecutions of Mandarin-speaking defendants.

Nearly everyone in the lower Manhattan courtroom appears frustrated by a halting process that requires translation of Chinese-language videos, documents and witness testimony.

It is one in a series of high-profile China-linked cases that are similarly getting lost in translation. Chinese-language evidence is piling up, unintelligible to attorneys. Translations are slow, and sometimes wrong. There is a limited pool of top-tier Mandarin court interpreters, and they can disagree on English translations. And for both sides in a trial, the work of interpreters provides ammunition for legal wrangling, from gamesmanship to courtroom objections and possible appeals.

Read the rest of this entry »

Permalink Comments (23)

The ideology of legal corpus linguistics

June 4, 2024 @ 12:00 pm· Filed by Mark Liberman under Language and the law

Jonathan Weinberg sent in a link to this article — Molly Redden, "How A Luxury Trip For Trump Judges Doomed The Federal Mask Mandate", Huffington Post 6/3/2024:

Buried in the April 2022 ruling that struck down the Biden administration’s mask mandate was a section that was unusual for a court decision.

The outcome itself was far from surprising. Places all over the country were dropping local mask requirements, and the judge hearing this case — a challenge to the federal mandate to mask on planes and other public transportation — was a conservative Trump appointee, U.S. District Judge Kathryn Kimball Mizelle for the Middle District of Florida. Mizelle ruled that the Centers for Disease Control and Prevention’s mask requirement overstepped the agency’s legal authority.

What was eye-catching was her explanation of why. In her ruling, Mizelle wrote she had consulted the Corpus of Historical American English, an academic search engine that returns examples of how words and phrases are used in select historical texts. Mizelle searched “sanitation,” a crucial word in the 1944 statute that authorizes the CDC to issue disease-prevention rules, and found it generally was used to describe the act of making something clean. “Wearing a mask,” she wrote, “cleans nothing.”

Searching large linguistic databases is a relatively new approach to judicial analysis called legal corpus linguistics. Although it has gained in popularity over the last decade, it is barely discussed outside of an enthusiastic group of right-wing conservative legal scholars. Which raises the question: How did this niche concept wind up driving such a consequential decision in the country’s health policy?

Read the rest of this entry »

Permalink Comments (7)

LLMs for judicial interpretation of "ordinary meaning"

May 30, 2024 @ 4:14 pm· Filed by Victor Mair under Artificial intelligence, Language and the law

Kevin Newsom serves as a United States circuit judge of the United States Court of Appeals for the Eleventh Circuit (of which there are a total of 13 across the country; since they are the next level below the Supreme Court, their practices and opinions are of great importance).

Judge Suggests Courts Should Consider Using "AI-Powered Large Language Models" in Interpreting "Ordinary Meaning", Eugene Volokh, The Volokh Conspiracy | 5.30.2024

That's from Judge Kevin Newsom's concurrence yesterday in Snell v. United Specialty Ins. Co.; the opinion is quite detailed and thoughtful, so people interested in the subject should read the whole thing. Here, though, is the introduction and the conclusion:

Read the rest of this entry »

Permalink Comments (5)

Brown Revisited

May 15, 2024 @ 10:59 am· Filed by Mark Liberman under Artificial intelligence, Language and the law

A couple of months ago, I told you about a project to recreate the Supreme Court oral arguments associated with Brown v. Board of Education ("Spontaneous SCOTUS", 3/2/2024):

Years ago, Jerry Goldman (then at Northwestern) created the oyez.org website as

a multimedia archive devoted to making the Supreme Court of the United States accessible to everyone. It is the most complete and authoritative source for all of the Court’s audio since the installation of a recording system in October 1955. Oyez offers transcript-synchronized and searchable audio, plain-English case summaries, illustrated decision information, and full-text Supreme Court opinions

He rescued decades of tapes and transcripts from the National Archives, digitized and improved them, and arranged the website's interactive presentations of the available recordings. Jiahong Yuan and I played a role, by devising and validating a program to identify which justice was speaking when (See "Speaker Identification on the Scotus Corpus", 2008).

More recently, Jerry has inspired an effort to recreate oral arguments from famous cases that took place before the recording system was installed, starting with Brown v. Board of Education. Rejecting the idea of producing "deep fakes" using the existing transcripts and extant recordings of the justices involved, he and his colleagues decided to create what we might call "shallow fakes", where actors will perform (selections from) the transcripts, and a voice morphing system will then be used to make their recordings sound like the target speakers. The recreated clips will be embedded in explanatory material.

All the scripts have been written, and in a few months, you'll be able to hear the results — which I expect will be terrific.

And here it is, at https://brown.oyez.org!

Read the rest of this entry »

Permalink Comments off

Hendiadys and sleeping in parks

April 22, 2024 @ 5:01 pm· Filed by Mark Liberman under Language and the law

Samuel Bray, "Cruel AND Unusual?", Reason 4/21/2024:

On Monday, the Supreme Court will hear argument in an Eighth Amendment case, City of Grants Pass, Oregon v. Johnson. One thing I will be watching for is whether the justices in their questions treat "cruel and unusual" as two separate requirements, or as one.

The Eighth Amendment (to the U.S. Consitution) says that "Excessive bail shall not be required, nor excessive fines imposed, nor cruel and unusual punishments inflicted."

And the issue in the cited Supreme Court case is "Whether the enforcement of generally applicable laws regulating camping on public property constitutes “cruel and unusual punishment” prohibited by the Eighth Amendment." (More here, here, and elsewhere…)

Read the rest of this entry »

Permalink Comments (13)

When is research "deemed not research", and why?

March 12, 2024 @ 6:07 am· Filed by Mark Liberman under Language and the law

This post is going to delve into one aspect of a recently-published article from the Centers for Disease Contol: DeCuir J, Payne AB, Self WH, et al. Interim Effectiveness of Updated 2023–2024 (Monovalent XBB.1.5) COVID-19 Vaccines Against COVID-19–Associated Emergency Department and Urgent Care Encounters and Hospitalization Among Immunocompetent Adults Aged ≥18 Years — VISION and IVY Networks, September 2023–January 2024. MMWR Morb Mortal Wkly Rep 2024;73:180–188.

Read the rest of this entry »

Permalink Comments (11)

Unborn Alabama chickens

February 26, 2024 @ 2:05 pm· Filed by Victor Mair under Humor, Language and food, Language and the law

Read the rest of this entry »

Permalink Comments (11)

The non-culpability of ChatGPT in legal cases

February 2, 2024 @ 10:51 am· Filed by Victor Mair under Artificial intelligence, Language and the law

"Second Circuit Refers Lawyer for Disciplinary Proceedings Based on AI-Hallucinated Case in Brief", by Eugene Volokh, The Volokh Conspiracy, reason | 1.30.2024

From Park v. Kim, decided today by the Second Circuit (Judges Barrington Parker, Allison Nathan, and Sarah Merriam); this is the 13th case I've seen in the last year in which AI-hallucinated citations were spotted:

We separately address the conduct of Park's counsel, Attorney Jae S. Lee. Lee's reply brief in this case includes a citation to a non-existent case, which she admits she generated using the artificial intelligence tool ChatGPT. Because citation in a brief to a non-existent case suggests conduct that falls below the basic obligations of counsel, we refer Attorney Lee to the Court's Grievance Panel, and further direct Attorney Lee to furnish a copy of this decision to her client, Plaintiff-Appellant Park….

Read the rest of this entry »

Permalink Comments (10)

Butter chicken

January 27, 2024 @ 6:44 am· Filed by Victor Mair under Language and food, Language and the law, Names, Romanization, Translation

Who owns it?

It's sort of like who owns kimchee, Koreans (of course!) or Chinese — we've been through that many times — except that the question of who has the rights to claim they invented butter chicken is ostensibly internecine / intranational rather than international (but maybe not [see below]), as is the case with kimchee.

"India’s courts to rule on who invented butter chicken: Two Delhi restaurants both claim to have the right to call themselves the home of the original butter chicken recipe" by Hannah Ellis-Petersen, The Guardian (1/25/24)

Judging from the account in The Guardian, the squabbling between the two Delhi restaurants is both picayune and misplaced:

Read the rest of this entry »

Permalink Comments (5)

Normative language

January 23, 2024 @ 9:30 am· Filed by Victor Mair under Ethics, Grammar, Language and science, Language and the law, Pragmatics, Semantics, Sociolinguistics

A matter that requires nuancing: Jinyi Kuang and Cristina Bicchieri, "Language matters: how normative expressions shape norm perception and affect norm compliance", Philosophical Transactions of the Royal Society B: Biological Sciences, 2024:

Abstract: Previous studies have used various normative expressions such as ‘should’, ‘appropriate’ and ‘approved’ interchangeably to communicate injunctions and social norms. However, little is known about whether people's interpretations of normative language differ and whether behavioural responses might vary across them. In two studies (total n = 2903), we find that compliance is sensitive to the types of normative expressions and how they are used. Specifically, people are more likely to comply when the message is framed as an injunction rather than as what most people consider good behaviour (social norm framing). Behaviour is influenced by the type of normative expression when the norm is weak (donation to charities), not so when the norm is strong (reciprocity). Content analysis of free responses reveals individual differences in the interpretation of social norm messages, and heterogeneous motives for compliance. Messages in the social norm framing condition are perceived to be vague and uninformative, undermining their effectiveness. These results suggest that careful choice of normative expressions is in order when using messages to elicit compliance, especially when the underlying norms are weak.

Read the rest of this entry »

Permalink Comments (2)

Stepford authors

January 12, 2024 @ 6:41 am· Filed by Mark Liberman under Computational linguistics, Language and the law

The issues discussed in "AI plagiarism" (1/4/2024) are rapidly coming to a boil. But somehow I missed Margaret Atwood's take on the topic, published last summer — "Murdered by my replica", The Atlantic 8/26/2023:

Remember The Stepford Wives? Maybe not. In that 1975 horror film, the human wives of Stepford, Connecticut, are having their identities copied and transferred to robotic replicas of themselves, minus any contrariness that their husbands find irritating. The robot wives then murder the real wives and replace them. Better sex and better housekeeping for the husbands, death for the uniqueness, creativity, and indeed the humanity of the wives.

The companies developing generative AI seem to have something like that in mind for me, at least in my capacity as an author. (The sex and the housekeeping can be done by other functionaries, I assume.) Apparently, 33 of my books have been used as training material for their wordsmithing computer programs. Once fully trained, the bot may be given a command—“Write a Margaret Atwood novel”—and the thing will glurp forth 50,000 words, like soft ice cream spiraling out of its dispenser, that will be indistinguishable from something I might grind out. (But minus the typos.) I myself can then be dispensed with—murdered by my replica, as it were—because, to quote a vulgar saying of my youth, who needs the cow when the milk’s free?

To add insult to injury, the bot is being trained on pirated copies of my books. Now, really! How cheap is that? Would it kill these companies to shell out the measly price of 33 books? They intend to make a lot of money off the entities they have reared and fattened on my words, so they could at least buy me a coffee.

Read the rest of this entry »

Permalink Comments (9)

AI plagiarism

January 4, 2024 @ 4:56 pm· Filed by Mark Liberman under Language and the law

"The Times Sues OpenAI and Microsoft Over A.I. Use of Copyrighted Work", NYT 12/27/2023:

The New York Times sued OpenAI and Microsoft for copyright infringement on Wednesday, opening a new front in the increasingly intense legal battle over the unauthorized use of published work to train artificial intelligence technologies.

The Times is the first major American media organization to sue the companies, the creators of ChatGPT and other popular A.I. platforms, over copyright issues associated with its written works. The lawsuit, filed in Federal District Court in Manhattan, contends that millions of articles published by The Times were used to train automated chatbots that now compete with the news outlet as a source of reliable information.

The suit does not include an exact monetary demand. But it says the defendants should be held responsible for “billions of dollars in statutory and actual damages” related to the “unlawful copying and use of The Times’s uniquely valuable works.” It also calls for the companies to destroy any chatbot models and training data that use copyrighted material from The Times.

Read the rest of this entry »

Permalink Comments (18)

"Made from 100% VERBed X Y"?

December 29, 2023 @ 4:05 pm· Filed by Mark Liberman under Language and the law, Pragmatics

As discussed in "this post" from 9/5/2017, the label "100% grated Parmesan cheese" means only that the product's Parmesan cheese is 100% grated, or maybe that the cheese in it is 100% Parmesan — never mind the cellulose powder that's also in the mix.