Cheap talk (a formal model)

毫无代价沟通（形式模型）

Jun 15, 2026

A formal model of cheap talk was first given in Crawford, Sobel 1982 in which sender (“S”) sends the recipient (“R”) a message (“m”) about the state of the world (“𝜽”). “R” cannot observe “𝜽” directly, but updates his belief about “𝜽” given “m” (i.e., “Pr(𝜽 | m)” [Note 1]), then takes action “a”. The payoffs are based on the action taken (“a”) and state of the world (“𝜽”), and represented by utility functions:

payoff to S: u(a, 𝜽, b);
payoff to R: v(a, 𝜽),

where the scalar “b” measures how much R’s and S’s interests diverge.

Each party maximises their respective payoffs, taking into account the other party’s behaviour. “S” chooses “m” to maximise u(a, 𝜽, b); “R” chooses “a” to maximise v(a, 𝜽). The outcome is taken to be an equilibrium where no party has reason to divert from their behaviour (a Nash equilibrium).

In Crawford, Sobel 1982, “𝜽” is a continuum, and “m” are discrete messages corresponding to partitions of “𝜽”. For example:

“m1” —>“𝜽” lies between 𝜽0-𝜽1,
“m2” —>“𝜽” lies between 𝜽1-𝜽2,
…,
“mN” —> “𝜽” lies between 𝜽N-1- 𝜽N,

where “N” is the number of partitions of “𝜽”.

The number of partitions depends on how far the parties’ interests diverge:

as “b —> ∞” (i.e., full divergence), “N —> 1” (i.e., “𝜽” is coarsely partitioned) so that “m” reveals no information (i.e., babbling equilibrium);
as “b —> 0” (i.e., full alignment), “N —> ∞” (i.e., “𝜽” is finely partitioned) so that “m” reveals “𝜽” ( full information disclosure).

Crawford, Sobel 1982 shows that:

if the parties’ interests are fully aligned (b = 0), “m” reveals “𝜽” (“full information disclosure”), and “a” maximises both parties’ payoffs;
if the parties’ interests are widely divergent (b = ∞), “m” is reveals no information (“babbling equilibrium”), and “a” can be any one of the set of possible actions.

******************

Reference: Crawford, Sobel 1982, “Strategic Information Transmission”. The presentation of their model in this Substack post is based on the one described in the Wikipedia page on cheap talk.
Note 1: Pr(𝜽 | m) = 0.5 suggests the recipient has no prior view on the matter.

毫无代价沟通的形式模型最由 Crawford, Sobel 1982 提出，其中发送者（“S”）向接收者（“R”）发送一条关于世界状态（“𝜽”）的消息（“m”）。“R”无法直接观察“𝜽”，但会在给定“m”的情况下更新其对“𝜽”的信念（即“Pr(𝜽 | m)”[注1]），然后采取行动“a”。收益基于所采取的行动（“a”）和世界状态（“𝜽”），并由效用函数表示：

S的收益：u(a, 𝜽, b)；
R的收益：v(a, 𝜽)，

其中标量“b”衡量 “R” 和 “S” 的利益分歧程度。

每一方都会最大化自己的收益，同时考虑对方的行为。“S”选择“m” 来最大化 u(a, 𝜽, b)；“R”选择“a” 来最大化 v(a, 𝜽)。结果被假设是一种均衡，即任何一方都没有理由改变其行为（就是呐什均衡）。

在Crawford, Sobel 1982，“𝜽” 是一个连续统，“m”是对应于“𝜽”分割的离散消息。例如：

“m1” —>“𝜽”位于𝜽0-𝜽1之间；
“m2” —>“𝜽”位于𝜽1-𝜽2之间；
… ；
“mN” —>“𝜽”位于𝜽N1—𝜽N之间。

其中“N”是“𝜽”的分割数。

区分的次数取决于各方利益的分歧程度：

当“b —> ∞”（即完全分歧）时，“N —> 1”（即 “𝜽” 被粗略分割），因此“m” 不透露任何信息（即胡言乱语均衡）；
当“b —> 0”（即完全一致）时，“N —> ∞”（即 “𝜽” 被精细分割），因此“m”透露“𝜽”（完全信息披露）。

Crawford, Sobel 1982 表明：

如果各方利益完全一致（b = 0），则“m”透露“𝜽”（“完全信息透露”），并且“a” 使双方受益最大化；
如果双方利益相差悬殊（b = ∞），则“m” 不透露任何信息（“胡言乱语均衡”），“a”可以是任意一种可能的行为。

*************

参考：Crawford, Sobel 1982 《战略信息传输》。在这篇Substack文章中提出的模型基于在维基百科“毫无代价沟通”页面中描述的模型。
注1: Pr(𝜽 | m) = 0.5 表示接收者对此事没有先前的看法。

The possibility of democratic choice （民主选择的可能性）

Discussion about this post

Ready for more?