Volatility
Volatility is the annualized standard deviation of a portfolio's returns. It measures how much returns swing around their average, and is the standard single-number proxy for risk in modern finance.
What is Volatility?
Volatility describes the dispersion of a return series. A portfolio that returns 10 percent every year with no variation has zero volatility. A portfolio that returns 50 percent one year and loses 30 percent the next, even if it averages out to the same compound return, has high volatility. The standard way to make that idea precise is to compute the standard deviation of periodic returns and scale it up to an annual figure, which is what the term volatility almost always refers to in finance.
There are two related but distinct flavors to keep separate. Realized (or historical) volatility is measured from observed returns, which is the version that appears in a backtest result. Implied volatility is inferred from option prices, which is what option traders quote when they say a stock is trading at 25 vol. The two usually move together but differ in important moments. A backtest only ever produces realized volatility, because there is no live options market to read implied vol from inside a historical simulation.
As rough calibration, the S&P 500 has had annualized realized volatility in the 15 to 20 percent range over most multi-decade windows, with brief spikes above 40 percent in 2008 and March 2020. A diversified equity portfolio under 12 percent vol is unusually calm, and over 25 percent is unusually rough. Single stocks can run 30 to 60 percent vol routinely. Concentrated factor portfolios often land between a broad index and a single stock. Volatility shows up as the denominator of the Sharpe ratio, which is why most discussions of risk-adjusted performance start by anchoring on it.
Formula
The monthly standard deviation is computed with the (n minus 1) divisor, which is the convention in finance software and in academic literature. Returns are simple percentage changes between successive monthly portfolio values, net of any transaction costs already deducted in the simulation. The mean used inside the standard deviation calculation is the sample mean of the same monthly returns, not a hardcoded baseline.
The square-root-of-time annualization assumes that monthly returns are roughly independent and identically distributed. That assumption is reasonable for diversified equity portfolios at monthly frequency. It breaks down for daily returns in markets with serial correlation (where today's return depends on yesterday's), which is one reason monthly returns are the more common base for annualized volatility. If a portfolio is tested on daily data instead, the same idea applies with a √252 multiplier (252 trading days in a year), but the result will typically be lower than the monthly-derived figure because daily autocorrelation gets understated.
Why Volatility Matters in Backtesting
Volatility is the building block of most risk-adjusted metrics. The Sharpe ratio divides excess return by volatility. The Sortino ratio uses a variant focused on downside volatility. Information ratios use tracking-error volatility. Reading a strategy's volatility in isolation tells you how rough the ride was. Reading it next to the benchmark's volatility tells you whether the strategy is taking more or less risk than the index it claims to beat.
For position sizing, volatility is the central input. A portfolio targeting 10 percent annual volatility scales positions differently than one targeting 20 percent. For risk parity strategies, weights are explicitly set inversely proportional to component volatilities. Even for investors who do not formally vol-target, knowing a strategy's volatility is what allows them to scale it appropriately against the rest of their portfolio.
Volatility also correlates with, but is not the same as, drawdown. Higher-vol strategies generally have larger drawdowns, but the relationship is not deterministic. A strategy can have moderate volatility and still produce an outsized drawdown if its bad months cluster, or it can have high volatility and small drawdowns if its big moves are spread out and reverse quickly. Reading volatility and Max Drawdown together is the cleanest way to see whether a strategy's risk shows up as steady churn or as concentrated shocks.
How SledgeKey Implements Volatility
Volatility is reported as an annualized percentage on the backtest results page, sitting alongside Total Return, Annual Return, Sharpe Ratio, and Max Drawdown. It is derived from the simulated portfolio value series sampled at month ends, converted to monthly simple returns, then run through a sample standard deviation and multiplied by the square root of 12.
The benchmark's volatility is computed the same way using the benchmark's monthly returns over the identical backtest window. The two figures sit on matching methodology, which makes the side-by-side comparison meaningful. If a tested strategy posted 14 percent annualized volatility while SPY posted 16 percent in the same window, that is a real apples-to-apples observation, not a methodology artifact.
Because monthly returns are derived from end-of-month portfolio values, intra-month volatility is invisible to the reported figure. A strategy that gyrates wildly inside a month but lands flat at each month end will show a calm volatility number. This is a feature of monthly volatility, not a bug, because it lines up with how long-horizon investors typically experience their portfolio. If you want to see the bumps inside a month, look at Max Drawdown and the drawdown episode list, not at annualized vol.
Common Pitfalls
The most common pitfall is treating volatility as the whole picture of risk. Volatility captures average dispersion but says nothing about skew (whether bad months are bigger than good ones) or kurtosis (whether tails are fatter than a normal distribution would predict). A strategy can run for years with calm vol and then produce a single month so bad it erases the prior cumulative return. The Sharpe ratio of that strategy will look terrific right up until the bad month and terrible immediately after. Pair volatility with skewness, kurtosis, and Max Drawdown to round out the risk view.
A second pitfall is comparing volatilities computed over different windows or frequencies. A strategy's vol from a calm decade like the 2010s is mechanically lower than the same strategy's vol from a turbulent decade like 2000 to 2010. Daily vol annualized via √252 is almost always lower than monthly vol annualized via √12 for the same portfolio. Whenever you compare two vol numbers, confirm they are computed on the same window length and the same return frequency before drawing conclusions.
A third pitfall is forgetting that volatility penalizes upside variability the same as downside. A portfolio that occasionally pops up 15 percent in a month adds to its volatility figure just as much as a portfolio that occasionally drops 15 percent. Most investors do not actually mind upside surprises, which is why downside-only measures like Sortino exist. When a strategy has clearly asymmetric returns (a value strategy that has rare large drawdowns, or a momentum strategy that has rare large rallies), volatility will misstate the actual investor experience.
Volatility is not risk; it is one dimension of risk. A strategy with low volatility and a single catastrophic month is riskier in any reasonable sense than a strategy with steady high vol and no blowups. Always pair volatility with Max Drawdown and a glance at skew before judging a strategy on dispersion alone.
See volatility in your own backtest
Run a backtest on any screening strategy and see annualized volatility computed on point-in-time data alongside the benchmark, free.
Run a Backtest