Optimization of Conditional Value-at-Risk

Authors: R. Tyrrell Rockafellar, Stanislav Uryasev · Year: 2000 · Venue: Journal of Risk, 2(3):21–41

Raw full-text conversion pending — the LAN/marker cluster is down (2026-07-04). This page was built from the full text of the author-hosted PDF read via firecrawl (sites.math.washington.edu/~rtr/papers/rtr179-CVaR1.pdf), not a marker conversion. File the PDF to Docs/raw/pdf/rockafellar2000optimization.pdf and run marker when the cluster returns.

Summary

This is the foundational paper that turns Conditional Value-at-Risk (CVaR) from a descriptive tail statistic into a tractable optimization objective. Rockafellar and Uryasev introduce an auxiliary function whose minimization over the extra scalar recovers the -CVaR of a loss, and whose joint minimization over the decision and is equivalent to minimizing CVaR directly. Because is convex (and, under sampling, piecewise-linear), CVaR minimization reduces to convex programming — an elementary linear program for scenario-sampled linear losses — while the -VaR is obtained for free as a by-product. The construction requires no assumption of normality and works for any loss distribution given as samples.

Key Claims

  • CVaR minimization needs no prior VaR computation. The -CVaR equals (Theorem 1), where the awkward, quantile-dependent definition of CVaR is replaced by minimizing a smooth convex function of one extra variable; the minimizing is the -VaR.
  • Joint convex program. Minimizing -CVaR over is equivalent to minimizing jointly over (Theorem 2). is jointly convex in whenever the loss is convex in ; then, for convex , the whole thing is convex programming.
  • Sampling ⇒ linear programming. Approximating the expectation by Monte-Carlo (or quasi-random) samples makes convex and piecewise-linear in ; introducing one auxiliary variable per scenario reduces the minimization to an LP — independent of the sample distribution (no normality needed).
  • CVaR dominates VaR and is coherent. By construction , so low CVaR forces low VaR; unlike VaR, CVaR is subadditive/convex (citing Pflug 2000; Artzner et al. 1999), so it avoids VaR’s multiple local extrema that make VaR hard to optimize.
  • Numerically validated. On a 3-instrument portfolio under a normal model, the LP-based min-CVaR solution matches the Markowitz min-variance benchmark (a Proposition they prove coincides for under normality with an active return constraint); on a non-normal NIKKEI options hedge, min-CVaR and min-variance diverge and CVaR captures tail risk that VaR misses.

Method

Let be the loss for decision and random vector with density . The loss CDF is . In the paper’s notation is the confidence level (typically 0.90/0.95/0.99), so the risk lives in the upper tail; -VaR is and -CVaR is , the conditional expectation of loss at or above the VaR. The proof (Appendix) rests on a Lemma (from Shapiro & Wardi 1994): with , is convex, , with ; hence , whose zero set is exactly the VaR interval. The continuity/no-jump assumption on is stated as a simplification (the general/atomic case is deferred — later handled in the 2002 follow-up, CVaR for General Loss Distributions, J. Banking & Finance 26:1443–1471).

Regime. This is a finance/stochastic-programming paper; it is regime-agnostic (no manipulator dynamics, no free-flying vs free-floating distinction). Its relevance is entirely at the risk/optimization layer — it supplies the machinery by which any planner’s scalar loss can be turned into a convex CVaR objective.

Convention clash with notation.md / conditional_value_at_risk

R&U use for the confidence level and place the tail at , writing -CVaR . The wiki’s conditional_value_at_risk page uses for the confidence level and writes (subscript = tail mass). They denote the same quantity under . Equations below are transcribed in R&U’s own symbols.

Relevance to thesis

This is the primary provenance for every CVaR computation in the risk layer of the thesis. For risk-aware view scoring on the free-flying manipulator, whatever scalar loss a viewpoint/trajectory emits (e.g. a versine pointing error, a singularity-proximity margin, a collision-distance shortfall), Theorem 2 says we can constrain or minimize its CVaR by adding one scalar variable and solving a convex program — reducing to an LP once the loss is sampled from the sim. Crucially, the reduction needs no distributional assumption, which matters because our inspection uncertainty (estimation error, thruster/contact disturbance) is not Gaussian. This is the load-bearing “how” behind CVaR; the “why CVaR over a chance constraint” is the tail-severity + coherence argument (majumdar2017how) and the tightest-convex-approximation result (nemirovski2006convex).

Connections

Topics: conditional_value_at_risk · value_at_risk · coherent_risk_measures · chance_constraints Sources: nemirovski2006convex (uses this CVaR form as the tightest convex approximation of a chance constraint) · majumdar2017how · dixit2023risk · ren2022chance (all use the R&U optimization form)

Key Equations / Quotes

-VaR and -CVaR (Eqs. 2–3):

Auxiliary function (Eq. 4) and the two theorems (Eqs. 5, 10):

Sampled (LP-ready) approximation with scenarios (Eq. 9):

“The -VaR is never more than the -CVaR, so portfolios with low CVaR must have low VaR as well.” (Introduction)

-CVaR can be calculated without first having to calculate the -VaR on which its definition depends… The -VaR may be obtained instead as a byproduct.” (§2, after Thm 1)

Open Questions

  • The paper assumes the loss CDF is continuous (no atoms). Sample-based sim losses are discrete — does the atomic-distribution refinement of the 2002 follow-up (weighted VaR) change the LP for our scenario counts?
  • Theorem 2’s convexity needs convex in ; a collision/pointing loss routed through the nonlinear circumcentroidal dynamics (ffsm_dynamics) is generally non-convex. What local/linearized surrogate keeps the CVaR reduction convex?
  • R&U optimize CVaR of a single-period loss; multi-step inspection guidance raises the time_consistency issue flagged by majumdar2017how — does a per-step nested CVaR still admit the LP reduction?