Optimization of Conditional Value-at-Risk

Authors: R. Tyrrell Rockafellar, Stanislav Uryasev · Year: 2000 · Venue: Journal of Risk, 2(3):21–41

Raw full-text conversion pending — the LAN/marker cluster is down (2026-07-04). This page was built from the full text of the author-hosted PDF read via firecrawl (sites.math.washington.edu/~rtr/papers/rtr179-CVaR1.pdf), not a marker conversion. File the PDF to Docs/raw/pdf/rockafellar2000optimization.pdf and run marker when the cluster returns.

Summary

This is the foundational paper that turns Conditional Value-at-Risk (CVaR) from a descriptive tail statistic into a tractable optimization objective. Rockafellar and Uryasev introduce an auxiliary function $F_{β} (x, α)$ whose minimization over the extra scalar $α$ recovers the $β$ -CVaR of a loss, and whose joint minimization over the decision $x$ and $α$ is equivalent to minimizing CVaR directly. Because $F_{β}$ is convex (and, under sampling, piecewise-linear), CVaR minimization reduces to convex programming — an elementary linear program for scenario-sampled linear losses — while the $β$ -VaR is obtained for free as a by-product. The construction requires no assumption of normality and works for any loss distribution given as samples.

Key Claims

CVaR minimization needs no prior VaR computation. The $β$ -CVaR $ϕ_{β} (x)$ equals $min_{α} F_{β} (x, α)$ (Theorem 1), where the awkward, quantile-dependent definition of CVaR is replaced by minimizing a smooth convex function of one extra variable; the minimizing $α$ is the $β$ -VaR.
Joint convex program. Minimizing $β$ -CVaR over $x \in X$ is equivalent to minimizing $F_{β} (x, α)$ jointly over $(x, α) \in X \times R$ (Theorem 2). $F_{β}$ is jointly convex in $(x, α)$ whenever the loss $f (x, y)$ is convex in $x$ ; then, for convex $X$ , the whole thing is convex programming.
Sampling ⇒ linear programming. Approximating the expectation by $q$ Monte-Carlo (or quasi-random) samples makes $\tilde{F}_{β}$ convex and piecewise-linear in $α$ ; introducing one auxiliary variable $u_{k}$ per scenario reduces the minimization to an LP — independent of the sample distribution (no normality needed).
CVaR dominates VaR and is coherent. By construction $ϕ_{β} (x) \geq α_{β} (x)$ , so low CVaR forces low VaR; unlike VaR, CVaR is subadditive/convex (citing Pflug 2000; Artzner et al. 1999), so it avoids VaR’s multiple local extrema that make VaR hard to optimize.
Numerically validated. On a 3-instrument portfolio under a normal model, the LP-based min-CVaR solution matches the Markowitz min-variance benchmark (a Proposition they prove coincides for $β \geq 0.5$ under normality with an active return constraint); on a non-normal NIKKEI options hedge, min-CVaR and min-variance diverge and CVaR captures tail risk that VaR misses.

Method

Let $f (x, y)$ be the loss for decision $x \in X \subseteq R^{n}$ and random vector $y \in R^{m}$ with density $p (y)$ . The loss CDF is $Ψ (x, α) = \int_{f (x, y) \leq α} p (y) d y$ . In the paper’s notation $β \in (0, 1)$ is the confidence level (typically 0.90/0.95/0.99), so the risk lives in the upper $(1 - β)$ tail; $β$ -VaR is $α_{β} (x)$ and $β$ -CVaR is $ϕ_{β} (x)$ , the conditional expectation of loss at or above the VaR. The proof (Appendix) rests on a Lemma (from Shapiro & Wardi 1994): with $G (α) = E [f (x, y) - α]^{+}$ , $G$ is convex, $C^{1}$ , with $G^{'} (α) = Ψ (x, α) - 1$ ; hence $\partial_{α} F_{β} = (1 - β)^{- 1} [Ψ (x, α) - β]$ , whose zero set is exactly the VaR interval. The continuity/no-jump assumption on $Ψ$ is stated as a simplification (the general/atomic case is deferred — later handled in the 2002 follow-up, CVaR for General Loss Distributions, J. Banking & Finance 26:1443–1471).

Regime. This is a finance/stochastic-programming paper; it is regime-agnostic (no manipulator dynamics, no free-flying vs free-floating distinction). Its relevance is entirely at the risk/optimization layer — it supplies the machinery by which any planner’s scalar loss can be turned into a convex CVaR objective.

Convention clash with notation.md / conditional_value_at_risk

R&U use $β$ for the confidence level and place the tail at $1 - β$ , writing $β$ -CVaR $= ϕ_{β}$ . The wiki’s conditional_value_at_risk page uses $α$ for the confidence level and writes $CVaR_{1 - α}$ (subscript = tail mass). They denote the same quantity under $β \leftrightarrow α$ . Equations below are transcribed in R&U’s own symbols.

Relevance to thesis

This is the primary provenance for every CVaR computation in the risk layer of the thesis. For risk-aware view scoring on the free-flying manipulator, whatever scalar loss a viewpoint/trajectory emits (e.g. a versine pointing error, a singularity-proximity margin, a collision-distance shortfall), Theorem 2 says we can constrain or minimize its CVaR by adding one scalar variable $α$ and solving a convex program — reducing to an LP once the loss is sampled from the sim. Crucially, the reduction needs no distributional assumption, which matters because our inspection uncertainty (estimation error, thruster/contact disturbance) is not Gaussian. This is the load-bearing “how” behind CVaR; the “why CVaR over a chance constraint” is the tail-severity + coherence argument (majumdar2017how) and the tightest-convex-approximation result (nemirovski2006convex).

Connections

Topics: conditional_value_at_risk · value_at_risk · coherent_risk_measures · chance_constraints Sources: nemirovski2006convex (uses this CVaR form as the tightest convex approximation of a chance constraint) · majumdar2017how · dixit2023risk · ren2022chance (all use the R&U optimization form)

Key Equations / Quotes

$β$ -VaR and $β$ -CVaR (Eqs. 2–3):

α_{β} (x) = min {α \in R : Ψ (x, α) \geq β} ϕ_{β} (x) = (1 - β)^{- 1} \int_{f (x, y) \geq α_{β} (x)} f (x, y) p (y) d y

Auxiliary function (Eq. 4) and the two theorems (Eqs. 5, 10):

F_{β} (x, α) = α + (1 - β)^{- 1} \int_{y \in R^{m}} [f (x, y) - α]^{+} p (y) d y

ϕ_{β} (x) = α \in R min F_{β} (x, α) (Thm 1) x \in X min ϕ_{β} (x) = (x, α) \in X \times R min F_{β} (x, α) (Thm 2)

Sampled (LP-ready) approximation with $q$ scenarios (Eq. 9):

\tilde{F}_{β} (x, α) = α + \frac{1}{q ( 1 - β )} k = 1 \sum q [f (x, y_{k}) - α]^{+}

“The $β$ -VaR is never more than the $β$ -CVaR, so portfolios with low CVaR must have low VaR as well.” (Introduction)

“ $β$ -CVaR can be calculated without first having to calculate the $β$ -VaR on which its definition depends… The $β$ -VaR may be obtained instead as a byproduct.” (§2, after Thm 1)

Open Questions

The paper assumes the loss CDF $Ψ (x, α)$ is continuous (no atoms). Sample-based sim losses are discrete — does the atomic-distribution refinement of the 2002 follow-up (weighted VaR) change the LP for our scenario counts?
Theorem 2’s convexity needs $f (x, y)$ convex in $x$ ; a collision/pointing loss routed through the nonlinear circumcentroidal dynamics (ffsm_dynamics) is generally non-convex. What local/linearized surrogate keeps the CVaR reduction convex?
R&U optimize CVaR of a single-period loss; multi-step inspection guidance raises the time_consistency issue flagged by majumdar2017how — does a per-step nested CVaR still admit the $F_{β}$ LP reduction?

Inspection with a Free-Flying Space Manipulator

Explorer

rockafellar2000optimization

Optimization of Conditional Value-at-Risk

Summary

Key Claims

Method

Relevance to thesis

Connections

Key Equations / Quotes

Open Questions

Graph View

Table of Contents

Backlinks