Value at Risk (VaR) Analysis in Python

Compute Value at Risk (VaR), Conditional VaR (CVaR), and maximum drawdown for a stock portfolio using historical simulation.

This page shows an AI data analysis example. A workflow is a list of prompts (instructions) used to complete one analysis task step by step. We send the same prompt list to multiple AI models, then compare their notebook outputs and scores.

What this AI workflow does

This AI Data Analyst workflow downloads 3 years of SPY, QQQ, and GLD price data, computes daily returns, and builds an equal-weight portfolio return series. It estimates 95% and 99% Value at Risk (VaR) using historical simulation and computes 95% Conditional VaR (CVaR/Expected Shortfall). It visualizes the return distribution with VaR and CVaR thresholds and reports maximum drawdown for the portfolio.

Who this example is for

This is for analysts and students who need a reproducible Python notebook for portfolio risk metrics from market data. It helps validate downside risk estimates and communicate them with standard plots and summary statistics.

Expected analysis outcomes

These are the results the AI workflow is expected to generate.

Daily equal-weight portfolio return series
Historical-simulation VaR estimates at 95% and 99%
CVaR (Expected Shortfall) at 95%
Maximum drawdown statistic
Histogram of returns with VaR/CVaR threshold lines

Tools and libraries used

Main Python packages and tooling used to run this AI data analysis task.

yfinance
pandas
numpy
matplotlib
scipy

Prompt sequence

This is the exact list of prompts used in this workflow. The same prompt sequence is sent to each model so outputs and scores can be compared fairly.

1download 3 years of SPY, QQQ, GLD prices and compute daily portfolio returns assuming equal weights with yfinance
2calculate 95% and 99% Value at Risk using historical simulation
3calculate Conditional VaR (Expected Shortfall) at 95%
4plot the return distribution with VaR and CVaR thresholds marked

Model Comparison

We compared several LLM models on the same analysis task. The table below shows their scores side by side. You can also open each model run to check the full conversation and notebook results.

Model Source	Score	Task	Execution	Output	Reasoning	Reliability	Run
glm-5.1	10/10	2/2	2/2	3/3	2/2	1/1	Open glm-5.1 conversation
gpt-oss:120b	10/10	2/2	2/2	3/3	2/2	1/1	Open gpt-oss:120b conversation
qwen3-coder-next	10/10	2/2	2/2	3/3	2/2	1/1	Open qwen3-coder-next conversation
gemma4:31b	9/10	2/2	2/2	2/3	2/2	1/1	Open gemma4:31b conversation
gpt-5.4	9/10	2/2	2/2	2/3	2/2	1/1	Open gpt-5.4 conversation
qwen3.5:397b	2/10	1/2	0/2	0/3	1/2	0/1	Open qwen3.5:397b conversation

How scoring works

Each run is graded on five dimensions:

Task Completion (0-2): how fully the workflow answered the requested analysis task and completed required steps.
Execution Correctness (0-2): whether generated code is correct, consistent, and likely runnable without major fixes.
Output Quality (0-3): how well tables, charts, and results match the expected analytical outcomes.
Reasoning Quality (0-2): clarity and correctness of interpretation, explanation, and conclusions.
Reliability (0-1): robustness of the run, including low hallucination risk and stable behavior.

The final score is the sum of these dimensions, so total score = 0-10. Higher values indicate more complete, correct, and reliable notebook results for this workflow.

AI Analysis with qwen3.5:397b

This run uses AI with a step-by-step list of prompts to perform the analysis task Value at Risk (VaR) Analysis in Python. Below, you can review the full conversation generated with qwen3.5:397b. After the conversation, we show detailed score cards for this run.

download 3 years of SPY, QQQ, GLD prices and compute daily portfolio returns assuming equal weights with yfinance

Thinking

The user wants to download 3 years of price data for SPY, QQQ, and GLD using yfinance, then compute daily portfolio returns with equal weights. This is a straightforward data retrieval and calculation task that I can do in one code block.

Let me write clean code to:

Download 3 years of historical data for all three ETFs
Extract adjusted close prices
Calculate daily returns for each
Compute equal-weighted portfolio returns (1/3 weight each)

I'll do this in one step since it's a simple data retrieval and calculation task.

I'll download 3 years of price data for SPY, QQQ, and GLD, then compute equal-weighted daily portfolio returns.

Notebook code

import yfinance as yf
import pandas as pd

# Download 3 years of historical data for SPY, QQQ, GLD
tickers = ['SPY', 'QQQ', 'GLD']
data = yf.download(tickers, period='3y')

# Recent yfinance versions return a MultiIndex on columns: (Ticker, Attribute)
# We need to select the 'Adj Close' attribute for all tickers.
# Using .loc to safely select the column level avoids KeyError if structure varies slightly.
if isinstance(data.columns, pd.MultiIndex):
    # Select 'Adj Close' from the second level of the MultiIndex
    data = data.loc[:, (slice(None), 'Adj Close')]
    # Flatten columns to just ticker names if needed, or keep MultiIndex for clarity
    # Here we flatten to make subsequent code simpler:
    data.columns = data.columns.get_level_values(0)
else:
    # Fallback for older versions or single ticker
    if 'Adj Close' in data.columns:
        data = data['Adj Close']
    elif all(t in data.columns for t in tickers):
        pass # Assume already correct
    else:
        raise ValueError("Could not locate 'Adj Close' data in the downloaded DataFrame.")

# Calculate daily returns for each ETF
returns = data.pct_change().dropna()

# Compute equal-weighted portfolio returns (1/3 weight each)
weights = [1/3, 1/3, 1/3]
portfolio_returns = (returns * weights).sum(axis=1)

# Create a summary DataFrame
portfolio_df = pd.DataFrame({
    'SPY_Return': returns['SPY'],
    'QQQ_Return': returns['QQQ'],
    'GLD_Return': returns['GLD'],
    'Portfolio_Return': portfolio_returns
})

# Show basic stats
portfolio_df.describe()

[*********************100%***********************]  3 of 3 completed

[31m---------------------------------------------------------------------------[39m
[31mKeyError[39m                                  Traceback (most recent call last)
[36mFile [39m[32m~/Library/mljar-studio/jlab_server/lib/python3.12/site-packages/pandas/core/indexes/base.py:3641[39m, in [36mIndex.get_loc[39m[34m(self, key)[39m
[32m   3640[39m [38;5;28;01mtry[39;00m:
[32m-> [39m[32m3641[39m     [38;5;28;01mreturn[39;00m [38;5;28;43mself[39;49m[43m.[49m[43m_engine[49m[43m.[49m[43mget_loc[49m[43m([49m[43mcasted_key[49m[43m)[49m
[32m   3642[39m [38;5;28;01mexcept[39;00m [38;5;167;01mKeyError[39;00m [38;5;28;01mas[39;00m err:

[36mFile [39m[32mpandas/_libs/index.pyx:168[39m, in [36mpandas._libs.index.IndexEngine.get_loc[39m[34m()[39m
[32m--> [39m[32m168[39m [33m'Could not get source, probably due dynamically evaluated source code.'[39m

[36mFile [39m[32mpandas/_libs/index.pyx:197[39m, in [36mpandas._libs.index.IndexEngine.get_loc[39m[34m()[39m
[32m--> [39m[32m197[39m [33m'Could not get source, probably due dynamically evaluated source code.'[39m

[36mFile [39m[32mpandas/_libs/hashtable_class_helper.pxi:7668[39m, in [36mpandas._libs.hashtable.PyObjectHashTable.get_item[39m[34m()[39m
[32m-> [39m[32m7668[39m [33m'Could not get source, probably due dynamically evaluated source code.'[39m

[36mFile [39m[32mpandas/_libs/hashtable_class_helper.pxi:7676[39m, in [36mpandas._libs.hashtable.PyObjectHashTable.get_item[39m[34m()[39m
[32m-> [39m[32m7676[39m [33m'Could not get source, probably due dynamically evaluated source code.'[39m

[31mKeyError[39m: 'Adj Close'

The above exception was the direct cause of the following exception:

[31mKeyError[39m                                  Traceback (most recent call last)
[36mCell[39m[36m [39m[32mIn[4][39m[32m, line 13[39m
[32m      9[39m [38;5;66;03m# We need to select the 'Adj Close' attribute for all tickers.[39;00m
[32m     10[39m [38;5;66;03m# Using .loc to safely select the column level avoids KeyError if structure varies slightly.[39;00m
[32m     11[39m [38;5;28;01mif[39;00m isinstance(data.columns, pd.MultiIndex):
[32m     12[39m     [38;5;66;03m# Select 'Adj Close' from the second level of the MultiIndex[39;00m
[32m---> [39m[32m13[39m     data = data.loc[:, (slice([38;5;28;01mNone[39;00m), [33m'Adj Close'[39m)]
[32m     14[39m     [38;5;66;03m# Flatten columns to just ticker names if needed, or keep MultiIndex for clarity[39;00m
[32m     15[39m     [38;5;66;03m# Here we flatten to make subsequent code simpler:[39;00m
[32m     16[39m     data.columns = data.columns.get_level_values([32m0[39m)

[36mFile [39m[32m~/Library/mljar-studio/jlab_server/lib/python3.12/site-packages/pandas/core/indexing.py:1200[39m, in [36m_LocationIndexer.__getitem__[39m[34m(self, key)[39m
[32m   1198[39m     [38;5;28;01mif[39;00m [38;5;28mself[39m._is_scalar_access(key):
[32m   1199[39m         [38;5;28;01mreturn[39;00m [38;5;28mself[39m.obj._get_value(*key, takeable=[38;5;28mself[39m._takeable)
[32m-> [39m[32m1200[39m     [38;5;28;01mreturn[39;00m [38;5;28;43mself[39;49m[43m.[49m[43m_getitem_tuple[49m[43m([49m[43mkey[49m[43m)[49m
[32m   1201[39m [38;5;28;01melse[39;00m:
[32m   1202[39m     [38;5;66;03m# we by definition only have the 0th axis[39;00m
[32m   1203[39m     axis = [38;5;28mself[39m.axis [38;5;129;01mor[39;00m [32m0[39m

[36mFile [39m[32m~/Library/mljar-studio/jlab_server/lib/python3.12/site-packages/pandas/core/indexing.py:1386[39m, in [36m_LocIndexer._getitem_tuple[39m[34m(self, tup)[39m
[32m   1384[39m [38;5;28;01mwith[39;00m suppress(IndexingError):
[32m   1385[39m     tup = [38;5;28mself[39m._expand_ellipsis(tup)
[32m-> [39m[32m1386[39m     [38;5;28;01mreturn[39;00m [38;5;28;43mself[39;49m[43m.[49m[43m_getitem_lowerdim[49m[43m([49m[43mtup[49m[43m)[49m
[32m   1388[39m [38;5;66;03m# no multi-index, so validate all of the indexers[39;00m
[32m   1389[39m tup = [38;5;28mself[39m._validate_tuple_indexer(tup)

[36mFile [39m[32m~/Library/mljar-studio/jlab_server/lib/python3.12/site-packages/pandas/core/indexing.py:1067[39m, in [36m_LocationIndexer._getitem_lowerdim[39m[34m(self, tup)[39m
[32m   1065[39m [38;5;66;03m# we may have a nested tuples indexer here[39;00m
[32m   1066[39m [38;5;28;01mif[39;00m [38;5;28mself[39m._is_nested_tuple_indexer(tup):
[32m-> [39m[32m1067[39m     [38;5;28;01mreturn[39;00m [38;5;28;43mself[39;49m[43m.[49m[43m_getitem_nested_tuple[49m[43m([49m[43mtup[49m[43m)[49m
[32m   1069[39m [38;5;66;03m# we maybe be using a tuple to represent multiple dimensions here[39;00m
[32m   1070[39m ax0 = [38;5;28mself[39m.obj._get_axis([32m0[39m)

[36mFile [39m[32m~/Library/mljar-studio/jlab_server/lib/python3.12/site-packages/pandas/core/indexing.py:1172[39m, in [36m_LocationIndexer._getitem_nested_tuple[39m[34m(self, tup)[39m
[32m   1169[39m     axis -= [32m1[39m
[32m   1170[39m     [38;5;28;01mcontinue[39;00m
[32m-> [39m[32m1172[39m obj = [38;5;28;43mgetattr[39;49m[43m([49m[43mobj[49m[43m,[49m[43m [49m[38;5;28;43mself[39;49m[43m.[49m[43mname[49m[43m)[49m[43m.[49m[43m_getitem_axis[49m[43m([49m[43mkey[49m[43m,[49m[43m [49m[43maxis[49m[43m=[49m[43maxis[49m[43m)[49m
[32m   1173[39m axis -= [32m1[39m
[32m   1175[39m [38;5;66;03m# if we have a scalar, we are done[39;00m

[36mFile [39m[32m~/Library/mljar-studio/jlab_server/lib/python3.12/site-packages/pandas/core/indexing.py:1442[39m, in [36m_LocIndexer._getitem_axis[39m[34m(self, key, axis)[39m
[32m   1440[39m [38;5;66;03m# nested tuple slicing[39;00m
[32m   1441[39m [38;5;28;01mif[39;00m is_nested_tuple(key, labels):
[32m-> [39m[32m1442[39m     locs = [43mlabels[49m[43m.[49m[43mget_locs[49m[43m([49m[43mkey[49m[43m)[49m
[32m   1443[39m     indexer: [38;5;28mlist[39m[[38;5;28mslice[39m | npt.NDArray[np.intp]] = [[38;5;28mslice[39m([38;5;28;01mNone[39;00m)] * [38;5;28mself[39m.ndim
[32m   1444[39m     indexer[axis] = locs

[36mFile [39m[32m~/Library/mljar-studio/jlab_server/lib/python3.12/site-packages/pandas/core/indexes/multi.py:4032[39m, in [36mMultiIndex.get_locs[39m[34m(self, seq)[39m
[32m   4028[39m     [38;5;28;01mcontinue[39;00m
[32m   4030[39m [38;5;28;01melse[39;00m:
[32m   4031[39m     [38;5;66;03m# a slice or a single label[39;00m
[32m-> [39m[32m4032[39m     lvl_indexer = [38;5;28;43mself[39;49m[43m.[49m[43m_get_level_indexer[49m[43m([49m[43mk[49m[43m,[49m[43m [49m[43mlevel[49m[43m=[49m[43mi[49m[43m,[49m[43m [49m[43mindexer[49m[43m=[49m[43mindexer[49m[43m)[49m
[32m   4034[39m [38;5;66;03m# update indexer[39;00m
[32m   4035[39m lvl_indexer = _to_bool_indexer(lvl_indexer)

[36mFile [39m[32m~/Library/mljar-studio/jlab_server/lib/python3.12/site-packages/pandas/core/indexes/multi.py:3885[39m, in [36mMultiIndex._get_level_indexer[39m[34m(self, key, level, indexer)[39m
[32m   3882[39m         [38;5;28;01mreturn[39;00m [38;5;28mslice[39m(i, j, step)
[32m   3884[39m [38;5;28;01melse[39;00m:
[32m-> [39m[32m3885[39m     idx = [38;5;28;43mself[39;49m[43m.[49m[43m_get_loc_single_level_index[49m[43m([49m[43mlevel_index[49m[43m,[49m[43m [49m[43mkey[49m[43m)[49m
[32m   3887[39m     [38;5;28;01mif[39;00m level > [32m0[39m [38;5;129;01mor[39;00m [38;5;28mself[39m._lexsort_depth == [32m0[39m:
[32m   3888[39m         [38;5;66;03m# Desired level is not sorted[39;00m
[32m   3889[39m         [38;5;28;01mif[39;00m [38;5;28misinstance[39m(idx, [38;5;28mslice[39m):
[32m   3890[39m             [38;5;66;03m# test_get_loc_partial_timestamp_multiindex[39;00m

[36mFile [39m[32m~/Library/mljar-studio/jlab_server/lib/python3.12/site-packages/pandas/core/indexes/multi.py:3458[39m, in [36mMultiIndex._get_loc_single_level_index[39m[34m(self, level_index, key)[39m
[32m   3456[39m     [38;5;28;01mreturn[39;00m -[32m1[39m
[32m   3457[39m [38;5;28;01melse[39;00m:
[32m-> [39m[32m3458[39m     [38;5;28;01mreturn[39;00m [43mlevel_index[49m[43m.[49m[43mget_loc[49m[43m([49m[43mkey[49m[43m)[49m

[36mFile [39m[32m~/Library/mljar-studio/jlab_server/lib/python3.12/site-packages/pandas/core/indexes/base.py:3648[39m, in [36mIndex.get_loc[39m[34m(self, key)[39m
[32m   3643[39m     [38;5;28;01mif[39;00m [38;5;28misinstance[39m(casted_key, [38;5;28mslice[39m) [38;5;129;01mor[39;00m (
[32m   3644[39m         [38;5;28misinstance[39m(casted_key, abc.Iterable)
[32m   3645[39m         [38;5;129;01mand[39;00m [38;5;28many[39m([38;5;28misinstance[39m(x, [38;5;28mslice[39m) [38;5;28;01mfor[39;00m x [38;5;129;01min[39;00m casted_key)
[32m   3646[39m     ):
[32m   3647[39m         [38;5;28;01mraise[39;00m InvalidIndexError(key) [38;5;28;01mfrom[39;00m[38;5;250m [39m[34;01merr[39;00m
[32m-> [39m[32m3648[39m     [38;5;28;01mraise[39;00m [38;5;167;01mKeyError[39;00m(key) [38;5;28;01mfrom[39;00m[38;5;250m [39m[34;01merr[39;00m
[32m   3649[39m [38;5;28;01mexcept[39;00m [38;5;167;01mTypeError[39;00m:
[32m   3650[39m     [38;5;66;03m# If we have a listlike key, _check_indexing_error will raise[39;00m
[32m   3651[39m     [38;5;66;03m#  InvalidIndexError. Otherwise we fall through and re-raise[39;00m
[32m   3652[39m     [38;5;66;03m#  the TypeError.[39;00m
[32m   3653[39m     [38;5;28mself[39m._check_indexing_error(key)

[31mKeyError[39m: 'Adj Close'

Score cards for qwen3.5:397b for Value at Risk (VaR) Analysis in Python

Total score: 2/10

Task Completion: 1/2
Needs work
The workflow only attempts the initial data download and equal-weighted return calculation. It does not compute VaR, CVaR, maximum drawdown, or produce the required histogram.
Execution Correctness: 0/2
Needs work
The provided code fails with a KeyError on 'Adj Close', so it does not run successfully as shown. The notebook output indicates the core data extraction step is broken.
Output Quality: 0/3
Needs work
Expected outputs are missing: there is no VaR/CVaR result, no drawdown metric, and no plot. The only output is an error traceback.
Reasoning Quality: 1/2
Needs work
The assistant correctly identifies the need to download data and compute equal-weighted returns, but the reasoning stops far short of the full risk analysis task. It also incorrectly assumes the selected 'Adj Close' structure will work without verifying the actual yfinance output format.
Reliability: 0/1
Needs work
The workflow is fragile because it makes an unsupported assumption about the downloaded DataFrame structure and immediately errors. It is not robust to common yfinance output variations.

Try MLJAR Studio

Run the same type of AI-powered data analysis on your own datasets with conversational notebooks in MLJAR Studio.

Download Free Trial Learn about AI Data Analyst

Explore More AI Analysis Examples

Discover additional workflows across categories. Each example includes prompts, conversation outputs, and model-level scoring so you can compare approaches and results.

Value at Risk (VaR) Analysis in Python

What this AI workflow does

Who this example is for

Expected analysis outcomes

Tools and libraries used

Prompt sequence

Model Comparison

AI Analysis with qwen3.5:397b

Score cards for qwen3.5:397b for Value at Risk (VaR) Analysis in Python

Try MLJAR Studio

Explore More AI Analysis Examples

Exploratory Data Analysis

Time Series

Data Analysis

Machine Learning

NLP

Finance

Statistics