Optimization of Conditional Value-at-Risk
Authors: R. Tyrrell Rockafellar, Stanislav Uryasev · Year: 2000 · Venue: Journal of Risk, 2(3):21–41
Raw full-text conversion pending — the LAN/marker cluster is down (2026-07-04). This page was built from the full text of the author-hosted PDF read via firecrawl (sites.math.washington.edu/~rtr/papers/rtr179-CVaR1.pdf), not a marker conversion. File the PDF to
Docs/raw/pdf/rockafellar2000optimization.pdfand run marker when the cluster returns.
Summary
This is the foundational paper that turns Conditional Value-at-Risk (CVaR) from a descriptive tail statistic into a tractable optimization objective. Rockafellar and Uryasev introduce an auxiliary function whose minimization over the extra scalar recovers the -CVaR of a loss, and whose joint minimization over the decision and is equivalent to minimizing CVaR directly. Because is convex (and, under sampling, piecewise-linear), CVaR minimization reduces to convex programming — an elementary linear program for scenario-sampled linear losses — while the -VaR is obtained for free as a by-product. The construction requires no assumption of normality and works for any loss distribution given as samples.
Key Claims
- CVaR minimization needs no prior VaR computation. The -CVaR equals (Theorem 1), where the awkward, quantile-dependent definition of CVaR is replaced by minimizing a smooth convex function of one extra variable; the minimizing is the -VaR.
- Joint convex program. Minimizing -CVaR over is equivalent to minimizing jointly over (Theorem 2). is jointly convex in whenever the loss is convex in ; then, for convex , the whole thing is convex programming.
- Sampling ⇒ linear programming. Approximating the expectation by Monte-Carlo (or quasi-random) samples makes convex and piecewise-linear in ; introducing one auxiliary variable per scenario reduces the minimization to an LP — independent of the sample distribution (no normality needed).
- CVaR dominates VaR and is coherent. By construction , so low CVaR forces low VaR; unlike VaR, CVaR is subadditive/convex (citing Pflug 2000; Artzner et al. 1999), so it avoids VaR’s multiple local extrema that make VaR hard to optimize.
- Numerically validated. On a 3-instrument portfolio under a normal model, the LP-based min-CVaR solution matches the Markowitz min-variance benchmark (a Proposition they prove coincides for under normality with an active return constraint); on a non-normal NIKKEI options hedge, min-CVaR and min-variance diverge and CVaR captures tail risk that VaR misses.
Method
Let be the loss for decision and random vector with density . The loss CDF is . In the paper’s notation is the confidence level (typically 0.90/0.95/0.99), so the risk lives in the upper tail; -VaR is and -CVaR is , the conditional expectation of loss at or above the VaR. The proof (Appendix) rests on a Lemma (from Shapiro & Wardi 1994): with , is convex, , with ; hence , whose zero set is exactly the VaR interval. The continuity/no-jump assumption on is stated as a simplification (the general/atomic case is deferred — later handled in the 2002 follow-up, CVaR for General Loss Distributions, J. Banking & Finance 26:1443–1471).
Regime. This is a finance/stochastic-programming paper; it is regime-agnostic (no manipulator dynamics, no free-flying vs free-floating distinction). Its relevance is entirely at the risk/optimization layer — it supplies the machinery by which any planner’s scalar loss can be turned into a convex CVaR objective.
Convention clash with notation.md / conditional_value_at_risk
R&U use for the confidence level and place the tail at , writing -CVaR . The wiki’s conditional_value_at_risk page uses for the confidence level and writes (subscript = tail mass). They denote the same quantity under . Equations below are transcribed in R&U’s own symbols.
Relevance to thesis
This is the primary provenance for every CVaR computation in the risk layer of the thesis. For risk-aware view scoring on the free-flying manipulator, whatever scalar loss a viewpoint/trajectory emits (e.g. a versine pointing error, a singularity-proximity margin, a collision-distance shortfall), Theorem 2 says we can constrain or minimize its CVaR by adding one scalar variable and solving a convex program — reducing to an LP once the loss is sampled from the sim. Crucially, the reduction needs no distributional assumption, which matters because our inspection uncertainty (estimation error, thruster/contact disturbance) is not Gaussian. This is the load-bearing “how” behind CVaR; the “why CVaR over a chance constraint” is the tail-severity + coherence argument (majumdar2017how) and the tightest-convex-approximation result (nemirovski2006convex).
Connections
Topics: conditional_value_at_risk · value_at_risk · coherent_risk_measures · chance_constraints Sources: nemirovski2006convex (uses this CVaR form as the tightest convex approximation of a chance constraint) · majumdar2017how · dixit2023risk · ren2022chance (all use the R&U optimization form)
Key Equations / Quotes
-VaR and -CVaR (Eqs. 2–3):
Auxiliary function (Eq. 4) and the two theorems (Eqs. 5, 10):
Sampled (LP-ready) approximation with scenarios (Eq. 9):
“The -VaR is never more than the -CVaR, so portfolios with low CVaR must have low VaR as well.” (Introduction)
“-CVaR can be calculated without first having to calculate the -VaR on which its definition depends… The -VaR may be obtained instead as a byproduct.” (§2, after Thm 1)
Open Questions
- The paper assumes the loss CDF is continuous (no atoms). Sample-based sim losses are discrete — does the atomic-distribution refinement of the 2002 follow-up (weighted VaR) change the LP for our scenario counts?
- Theorem 2’s convexity needs convex in ; a collision/pointing loss routed through the nonlinear circumcentroidal dynamics (ffsm_dynamics) is generally non-convex. What local/linearized surrogate keeps the CVaR reduction convex?
- R&U optimize CVaR of a single-period loss; multi-step inspection guidance raises the time_consistency issue flagged by majumdar2017how — does a per-step nested CVaR still admit the LP reduction?