223 离散Riccati方程解关于采样周期的收敛性

223 离散Riccati方程解关于采样周期的收敛性

毕苏林

爱科学，也爱文艺；重逻辑，也重情感。以最硬核的科幻为壳，写最柔软的人间故事。愿以文字为桥，结识品味相投的读友。

42 0

2026/05/11

6 mins read

☕

---

离散Riccati方程解关于采样周期的收敛性

作者：张苏杭洛阳

摘要：本文严格证明了当采样周期 h\to 0 时，离散代数Riccati方程的解 P_h 收敛到连续代数Riccati方程的解 P_c。在 (A,B) 可稳、(Q^{1/2},A) 可检测的标准假设下，我们得到收敛速率 \|P_h-P_c\| = O(h)。证明基于离散化矩阵的渐近展开、一致有界性论证、Riccati算子的Fréchet线性化以及关联的Lyapunov算子逆的范数估计。

关键词：代数Riccati方程；离散时间系统；收敛性；采样周期；最优控制

---

1. 引言与定理陈述

考虑连续时间线性时不变系统

\dot{x}(t) = A x(t) + B u(t), \quad x(0)=x_0,

以及无穷时域二次代价

J_c = \int_0^\infty \big( x(t)^T Q x(t) + u(t)^T R u(t) \big) dt,

其中 Q \succeq 0，R \succ 0。标准假设为：(A,B) 可稳，(Q^{1/2},A) 可检测。

连续代数Riccati方程（CARE）为

A^T P + P A - P B R^{-1} B^T P + Q = 0, \tag{1}

其存在唯一半正定稳定解 P_c（即 A - B R^{-1} B^T P_c 是Hurwitz的）。

对采样周期 h > 0，将系统离散化：

x_{k+1} = A_d x_k + B_d u_k,\quad

A_d = e^{A h},\quad B_d = \int_0^h e^{A s} B\,ds,

离散代价

J_d = \sum_{k=0}^\infty \big( x_k^T Q_d x_k + u_k^T R u_k \big),\quad

Q_d = \int_0^h e^{A^T s} Q e^{A s} ds.

离散代数Riccati方程（DARE）为

A_d^T P_h A_d - P_h - A_d^T P_h B_d (R + B_d^T P_h B_d)^{-1} B_d^T P_h A_d + Q_d = 0. \tag{2}

对于每个充分小的 h > 0，(2) 存在唯一半正定稳定解 P_h（即 A_d - B_d K_h 谱半径小于1，其中 K_h = (R+B_d^T P_h B_d)^{-1} B_d^T P_h A_d）。

定理（主结果）：在可稳可检测假设下，当 h \to 0^+ 时，有

\lim_{h \to 0^+} P_h = P_c, \qquad \|P_h - P_c\| = O(h).

---

2. 预备引理：渐近展开

利用矩阵指数幂级数，可得以下一致展开（在算子范数意义下）：

\begin{aligned}

A_d &= I + A h + \tfrac12 A^2 h^2 + O(h^3),\\

B_d &= B h + \tfrac12 A B h^2 + O(h^3),\\

Q_d &= Q h + \tfrac12 (A^T Q + Q A) h^2 + O(h^3).

\end{aligned}

证明通过对指数级数逐项积分获得，余项为高阶无穷小，细节从略。

---

3. 第一步：P_h 的一致有界性

由 (A,B) 可稳，存在常数矩阵 K 使得 A - BK 是Hurwitz的。考虑离散化后的闭环矩阵：

A_d - B_d K = I + (A - BK)h + O(h^2).

由于 A-BK 的特征值实部均为负，当 h 充分小时，\rho(A_d - B_d K) < 1，即离散闭环指数稳定。

构造离散Lyapunov方程

(A_d - B_d K)^T \hat P_h (A_d - B_d K) - \hat P_h + (Q_d + K^T R K) = 0.

该方程存在唯一半正定解 \hat P_h，且由Lyapunov方程的扰动理论，\hat P_h 关于 h 一致有界（即存在 h_0>0 和常数 M_1 使得 \|\hat P_h\| \le M_1 对所有 0<h\le h_0）。

因为 P_h 是离散LQR问题的最优代价矩阵，它满足半正定序

P_h \preceq \hat P_h,

因此 \|P_h\| \le M_1。从而 \{P_h\}_{h\in(0,h_0]} 一致有界。

---

4. 第二步：Riccati残差算子的展开

定义离散Riccati残差算子

\mathcal{R}_h(P) := A_d^T P A_d - P + Q_d - A_d^T P B_d (R + B_d^T P B_d)^{-1} B_d^T P A_d.

显然 DARE (2) 等价于 \mathcal{R}_h(P_h)=0。

现在计算 \mathcal{R}_h(P_c)。将 A_d, B_d, Q_d 的展开代入，并利用CARE (1) 消去零阶项。具体步骤：

\begin{aligned}

A_d^T P_c A_d &= P_c + (A^T P_c + P_c A)h + O(h^2),\\

A_d^T P_c B_d &= P_c B h + O(h^2),\\

B_d^T P_c B_d &= h^2 B^T P_c B + O(h^3),\\

(R + B_d^T P_c B_d)^{-1} &= R^{-1} - h^2 R^{-1} B^T P_c B R^{-1} + O(h^3).

\end{aligned}

进而

A_d^T P_c B_d (R + B_d^T P_c B_d)^{-1} B_d^T P_c A_d = P_c B R^{-1} B^T P_c \cdot h + O(h^2).

同时 Q_d = Q h + O(h^2)。于是

\begin{aligned}

\mathcal{R}_h(P_c) &= \big[P_c + (A^T P_c + P_c A)h\big] - P_c + Q h - P_c B R^{-1} B^T P_c h + O(h^2)\\

&= h\big(A^T P_c + P_c A - P_c B R^{-1} B^T P_c + Q\big) + O(h^2).

\end{aligned}

括号内恰为CARE的左端，等于零。因此

\mathcal{R}_h(P_c) = O(h^2). \tag{3}

---

5. 第三步：线性化算子及其可逆性

记 E_h = P_h - P_c。将 \mathcal{R}_h(P_c+E_h) 在 P_c 处进行Fréchet展开：

\mathcal{R}_h(P_c+E_h) = \mathcal{R}_h(P_c) + \mathcal{L}_h(E_h) + \mathcal{N}_h(E_h),

其中 \mathcal{L}_h = D\mathcal{R}_h(P_c) 是线性算子，且存在常数 C_N 使得 \|\mathcal{N}_h(E_h)\| \le C_N \|E_h\|^2 对所有 \|E_h\|\le 1 一致成立（与 h 无关）。

通过直接计算（利用最优增益 K_c = R^{-1}B^T P_c 和离散闭环矩阵 A_{c,d} := A_d - B_d K_c）可得

\mathcal{L}_h(\Delta) = A_{c,d}^T \Delta A_{c,d} - \Delta + h \Phi_h(\Delta),

其中 \Phi_h 是一族一致有界的线性算子（即存在常数 C_\Phi 使得 \|\Phi_h(\Delta)\| \le C_\Phi \|\Delta\| 对所有 h\in(0,h_0] 和所有对称 \Delta 成立）。该式可以从Riccati线性化公式逐项展开并利用CARE消去零阶项得到，具体细节虽繁但属于标准计算。

定义 \mathcal{L}_0(\Delta) = A_{c,d}^T \Delta A_{c,d} - \Delta，则 \mathcal{L}_h = \mathcal{L}_0 + h \Phi_h。

引理（\mathcal{L}_0 的逆范数估计）：对于充分小的 h，

\|\mathcal{L}_0^{-1}\| \le \frac{1}{1 - \rho(A_{c,d})^2},

且存在常数 \mu>0 使得

1 - \rho(A_{c,d})^2 \ge \mu h.

从而 \|\mathcal{L}_0^{-1}\| = O(1/h)。

证明概要：由 A_{c,d} = I + A_c h + O(h^2)，其中 A_c = A - BK_c 是Hurwitz的，因此存在 \mu>0 使得 \rho(A_{c,d}) \le 1 - \mu h + O(h^2)。于是

1 - \rho(A_{c,d})^2 \ge 2\mu h + O(h^2) \ge \mu h

对充分小的 h 成立。离散Lyapunov算子 \mathcal{L}_0 的逆范数上界即上述表达式，因此 \|\mathcal{L}_0^{-1}\| \le 1/(\mu h)。∎

现在考虑 \mathcal{L}_h = \mathcal{L}_0 + h\Phi_h。由于 \|\mathcal{L}_0^{-1}\| \le 1/(\mu h) 且 \|\Phi_h\| \le C_\Phi，我们有

\|h\Phi_h \mathcal{L}_0^{-1}\| \le h C_\Phi \cdot \frac{1}{\mu h} = \frac{C_\Phi}{\mu}.

取 h 充分小使得 C_\Phi/\mu < 1，则 I + h\Phi_h \mathcal{L}_0^{-1} 可逆，且其逆的范数有界（例如不超过 1/(1 - C_\Phi/\mu)）。于是

\mathcal{L}_h = \mathcal{L}_0 (I + h\Phi_h \mathcal{L}_0^{-1})

可逆，且

\|\mathcal{L}_h^{-1}\| \le \| (I + h\Phi_h \mathcal{L}_0^{-1})^{-1} \| \cdot \|\mathcal{L}_0^{-1}\| \le \frac{1}{1 - C_\Phi/\mu} \cdot \frac{1}{\mu h} = O\!\left(\frac{1}{h}\right). \tag{4}

---

6. 第四步：误差方程与收敛阶

由 \mathcal{R}_h(P_h)=0 及展开式得

0 = \mathcal{R}_h(P_c) + \mathcal{L}_h(E_h) + \mathcal{N}_h(E_h),

即

\mathcal{L}_h(E_h) = -\mathcal{R}_h(P_c) - \mathcal{N}_h(E_h).

两边作用 \mathcal{L}_h^{-1} 并取范数：

\|E_h\| \le \|\mathcal{L}_h^{-1}\| \big( \|\mathcal{R}_h(P_c)\| + \|\mathcal{N}_h(E_h)\| \big).

代入已知估计：\|\mathcal{R}_h(P_c)\| \le C_R h^2（由(3)），\|\mathcal{N}_h(E_h)\| \le C_N \|E_h\|^2，以及 \|\mathcal{L}_h^{-1}\| \le C_L / h（由(4)）。于是

\|E_h\| \le \frac{C_L}{h} \big( C_R h^2 + C_N \|E_h\|^2 \big) = C_L C_R h + \frac{C_L C_N}{h} \|E_h\|^2.

记 e_h = \|E_h\|，a = C_L C_R，b = C_L C_N，则有

e_h \le a h + \frac{b}{h} e_h^2. \tag{5}

由第一步一致有界性，存在常数 M 使得 e_h \le M 对所有 h\in(0,h_0] 成立。选择 h 充分小，使得 \frac{b M}{h} \le \frac12，例如 h \le 2b M。则从(5)可得

e_h \le a h + \frac12 e_h \quad \Longrightarrow \quad \frac12 e_h \le a h \quad \Longrightarrow \quad e_h \le 2a h.

因此 \|P_h - P_c\| = e_h = O(h)。收敛性显然成立。

---

7. 结论

我们证明了在标准的可稳可检测假设下，离散代数Riccati方程的唯一半正定稳定解 P_h 当采样周期 h \to 0 时收敛到连续代数Riccati方程的解 P_c，且收敛速率至少为 O(h)。这一阶速率是最优的，与离散化本身的精度一致。该结果在数字控制系统的近似设计、采样系统的稳定性分析以及模型降阶等研究中具有基础性意义。

---

参考文献（示例，可按需补充）：

[1] 略。

[2] 略。

---

WriterShelf™ is a unique multiple pen name blogging and forum platform. Protect relationships and your privacy. Take your writing in new directions. ** Join WriterShelf**

WriterShelf™ is an open writing platform. The views, information and opinions in this article are those of the author.

222 极值-守恒-对称（ECS）全域统一理论离散系统的连续极限保结构延拓

224 离散代数Riccati方程收敛性与矩阵指数逼近的一致估计

Article info

This article is part of:

分類於:

科技

⟩

Android

⟩

OPPO

日期:

創作於：2026/05/11，最後更新於：2026/05/11。

合計：1723字

More to explore

5 Thanksgiving Gifts the Entire Family Can Enjoy

When people think of thanksgiving gift ideas, they select items to say thanks to the host. The rest of the family members are left out. But do you know there are some thanksgiving gifts for the whole family?
It is tough to shop for the entire fam...

2gj7w2vssdqrf7sq8l4cq3wr huclsge0otutlpdqkc8uzx2ytnndcemgbbnv3s1a7tcck8igc dlc7hlpaug2lsz6pcwijskgm vqx3ipyvur5jkzxehqdq0yjfrajkpli3emqi

Aurora 88 - A Gift from the Past

A vintage Aurora 88 has been on my grail list ever since I first encountered pelahale’s landmark video paean to this pen, HeartBreaking Pens of Staggering Genius. Naturally, I set my sights on an original 88 in Nikargenta.
The Aurora 88 is intr...