Discover how to measure and interpret the relationships among multiple asset returns, and learn practical applications for portfolio construction.
Let’s chat about something that’s crucial yet sometimes glossed over: how assets in a portfolio move relative to each other. We’re not just talking about whether they’re up or down by themselves, but how they interact when grouped together. This is where the concepts of covariance and correlation take center stage.
In day-to-day language, we might say, “Hmm, these two stocks always seem to rise and fall together. That’s weird.” In statistical-financial terms, we’d respond, “Yes, that’s because the correlation is quite high.” Maybe that’s not how your next dinner party conversation will go, but if you’re building or managing a portfolio, understanding these measures matters, big time.
Below, we’ll explore covariance and correlation in a practical, multi-asset setting. By the end, you should have a solid handle on how they’re calculated, why they matter, and how to use them when making investment decisions.
Covariance between two random variables (like returns on two different assets) essentially tracks how they move together around their means. If you’re comfortable with the idea of variance (the average of squared deviations from the mean for a single asset), you can think of covariance as the next step—only now, we’re looking at the interplay of two assets:
• If \(\mathrm{Cov}(R_i, R_j) \gt 0\), then when \(R_i\) (the return of asset \(i\)) is above its average, \(R_j\) (the return of asset \(j\)) also tends to be above its average.
• If \(\mathrm{Cov}(R_i, R_j) \lt 0\), they move in opposite directions; when \(R_i\) is above its average, \(R_j\) is typically below its average.
I remember once working with a friend who was balancing two fairly different investments: a technology growth stock fund and a gold mining stock. We plugged in about a year’s worth of weekly returns into a spreadsheet and calculated the covariance. Sure enough, it came out negative, which indicated that these two asset returns often moved in opposite directions. It wasn’t a foolproof guarantee, but it was a clue that these two investments might provide some risk diversification when combined in a single portfolio.
While covariance is a useful concept, the values you get can be a bit messy—positive or negative, large or small, depending on the scale of returns. To tidy things up, we use correlation, a standardized measure of how returns move together:
where \(\sigma_i\) is the standard deviation of \(R_i\), and \(\sigma_j\) is the standard deviation of \(R_j\).
• \(\rho_{i,j} = +1\) means perfect positive correlation: asset \(i\) and \(j\) move in lockstep.
• \(\rho_{i,j} = -1\) means perfect negative correlation: they move in perfectly opposite directions.
• \(\rho_{i,j} = 0\) means that, on average, there’s no linear relationship between the two assets.
Correlation is super handy for portfolio managers because it immediately signals the diversification potential of pairing assets together. For instance, if two assets are perfectly correlated (\(\rho=+1\)), diversifying among them won’t reduce risk (they move together). Conversely, if they’re negatively correlated (\(\rho<0\)), combining them might reduce the combined volatility, because when one zigs, the other zags.
Most portfolios aren’t just two assets. Once you go beyond a pair, everything you’ve learned about correlation and covariance expands into what’s called a variance–covariance (var-cov) matrix or a correlation matrix. Imagine you’ve got four assets. Then you have six unique pairwise covariances (for each pair of assets) that you can arrange in a tabular form. That’s what we typically call the covariance matrix. The correlation matrix is the same idea, except every element is standardized to fall between –1 and +1.
A covariance matrix for \(n\) assets is an \(n \times n\) table. The diagonal elements are just the variances of each asset. Off-diagonal elements are the pairwise covariances. A quick layout might look like this for three assets (\(A\), \(B\), \(C\)):
Because \(\mathrm{Cov}(A,B) = \mathrm{Cov}(B,A)\), the covariance matrix is symmetric.
A correlation matrix is very similar, except each off-diagonal entry is \(\rho_{i,j}\) rather than \(\mathrm{Cov}(i,j)\). The diagonal is all 1.0, since an asset is perfectly correlated with itself. That might look like:
We can have a small diagram to illustrate how these pairwise relationships connect:
graph LR A["Asset A <br/> Var(A) on diagonal"] -- "Cov(A,B)" --> B["Asset B <br/> Var(B) on diagonal"] A["Asset A <br/> Var(A) on diagonal"] -- "Cov(A,C)" --> C["Asset C <br/> Var(C) on diagonal"] B["Asset B <br/> Var(B) on diagonal"] -- "Cov(B,C)" --> C["Asset C <br/> Var(C) on diagonal"]
While a real matrix might contain dozens or even hundreds of assets, conceptually it’s the same: each pair of assets gets its own covariance (or correlation) entry.
A correlation close to +1.0 signals that two assets’ returns move in a highly synchronized way. If you’re hooking them into a portfolio, that means they’ll amplify each other’s movements—good or bad!
Meanwhile, a correlation close to –1.0 represents a near–perfect hedge. While these pairs are rare in practice, especially through varied market conditions, it’s a portfolio manager’s dream scenario because the combined variance can be dramatically reduced.
Even if correlations aren’t extremely negative, any correlation below +1.0 offers some diversification benefit. For example, if large-cap US equities have a correlation of 0.55 with small-cap international equities, that 0.55 might still help reduce overall portfolio volatility compared to holding only one equity asset.
Let’s recall that the variance of a two-asset portfolio is:
Replacing covariance with correlation:
In a multi-asset environment, you expand that formula to account for more terms. Essentially, you sum over all pairwise covariances, weighed by the proportion of each asset in the portfolio.
Covariance and correlation capture how assets move in tandem. But each asset also carries its own unique (idiosyncratic) risk. This is risk that’s not explained by broad market movements. A high correlation often hints a greater amount of shared or systematic risk, which might be driven by the overall market, macroeconomic factors, or sector influences.
• Systematic risk: Driven by broad factors (e.g., interest rates, inflation expectations, GDP trends).
• Idiosyncratic risk: Specific to individual assets, such as a corporate scandal or a product recall.
When you build a diversified portfolio, you’re typically trying to reduce idiosyncratic risk by holding multiple assets that don’t move in tandem. Meanwhile, you can’t fully dodge systematic risk—though you can sometimes hedge or mitigate bits and pieces of it (for instance, by adding certain derivatives).
• Use consistent data intervals: If you’re measuring monthly returns for correlation, do that for every asset in your sample. Mixing daily for one asset and monthly for another can lead to misleading estimates.
• Pick a relevant timeframe: Correlations can change over longer horizons, or in different market regimes. Watch out if you’re using data from the wrong historical period.
• Consider regime shifts: Sometimes, correlations spike in a market crisis (often called “risk-on/risk-off” behavior), undermining what looked like a sweet diversification benefit in stable times.
• Monitor correlation drift: Real-world correlations aren’t static. Keep checking them periodically.
• Don’t over-optimize: Searching for “perfect” negative correlations might make your portfolio too reliant on a single historical pattern that won’t hold in the future.
Once, I spent months building a multi-factor model that predicted future correlations among certain commodity and equity indexes. I was excited when back-tests showed that the correlations would remain low, offering a squeaky-clean diversification result. But then a sudden geopolitical event hit, and correlations among everything—equities, commodities, even certain bond sectors—skyrocketed practically overnight. The lesson: data analysis is great, but the market doesn’t read our spreadsheets. Always be prepared for new patterns to emerge.
• Watch Out for Leptokurtic Distributions: If returns are not normally distributed (for instance, if they have fat tails and frequent outliers), correlation might not tell the whole story.
• Spurious Correlation: Some assets might show correlation for random reasons. Always see if there’s a fundamental or market-based rationale.
• Overfitting: In advanced modeling (e.g., building large-scale covariance matrices using factor models), you can introduce noise or rely too heavily on short data samples.
• Arch/GARCH: For advanced analysis of volatility clustering, more nuanced approaches (like Arch/GARCH) can be used to capture changes in correlation over time.
If you’re curious how calculations might look in Python:
1import numpy as np
2import pandas as pd
3
4# Each column in df_returns is the return series for an asset
5
6cov_matrix = df_returns.cov() # Covariance matrix
7corr_matrix = df_returns.corr() # Correlation matrix
8
9print("Covariance Matrix:")
10print(cov_matrix)
11
12print("\nCorrelation Matrix:")
13print(corr_matrix)
Just keep in mind that real-world data cleaning and time alignment can be more involved.
• Formulas: Memorize the correlation formula inside-out so you can apply it quickly under exam conditions.
• Interpretation: Be ready to interpret correlation and covariance in words: “A positive covariance means these assets tend to move in the same direction, which can limit diversification.”
• Application: In multi-asset portfolio questions, pay special attention to how each correlation or covariance factor influences overall portfolio risk.
• Pitfalls: Remember that correlation doesn’t imply causation. The exam might require you to identify spurious relationships or watch out for different time horizons.
• Market Regimes: The CFA Program increasingly emphasizes scenario analysis and market stress. Correlations often rise in turbulent markets, so mention that if you see scenario-based questions.
• Bodie, Z., Kane, A., & Marcus, A. (2019). “Investments.” McGraw-Hill Education.
• CFA Institute Level I Curriculum, “Quantitative Methods: Correlation and Covariance.”
• Tsay, R.S. (2010). “Analysis of Financial Time Series.” Wiley.
Important Notice: FinancialAnalystGuide.com provides supplemental CFA study materials, including mock exams, sample exam questions, and other practice resources to aid your exam preparation. These resources are not affiliated with or endorsed by the CFA Institute. CFA® and Chartered Financial Analyst® are registered trademarks owned exclusively by CFA Institute. Our content is independent, and we do not guarantee exam success. CFA Institute does not endorse, promote, or warrant the accuracy or quality of our products.