Energy Consumption Forecasting with Prophet
Analyze hourly energy consumption data, explore daily and weekly patterns, and build a forecasting model using Prophet.
What this AI workflow does
This AI Data Analyst workflow loads the AEP hourly energy consumption CSV and plots the full time series to inspect the overall trend. It computes and visualizes average consumption patterns by hour of day and by day of week to highlight recurring seasonality. It fits a Prophet model to the historical data, forecasts the next 30 days, and displays forecast components including trend and seasonal effects.
Who this example is for
This is for analysts and data scientists who need a quick, reproducible baseline forecast for hourly electricity demand. It is also useful for practitioners learning how to translate exploratory time-series patterns into a Prophet forecasting workflow.
Expected analysis outcomes
These are the results the AI workflow is expected to generate.
- Line chart of the full hourly consumption time series
- Bar chart of average consumption by hour of day
- Bar chart of average consumption by day of week
- 30-day Prophet forecast plot with uncertainty intervals
- Prophet components plot showing trend and seasonalities
Tools and libraries used
Main Python packages and tooling used to run this AI data analysis task.
- pandas
- numpy
- matplotlib
- seaborn
- prophet
Prompt sequence
This is the exact list of prompts used in this workflow. The same prompt sequence is sent to each model so outputs and scores can be compared fairly.
- 1load AEP hourly energy data from https://raw.githubusercontent.com/pplonski/datasets-for-start/refs/heads/master/aep-hourly-energy-consumption/AEP_hourly.csv and plot overall trend
- 2show average consumption by hour of day and by day of week
- 3fit a hourly Prophet model and forecast for the next 30 days
- 4show the forecast components: trend, weekly, yearly seasonality
Model Comparison
We compared several LLM models on the same analysis task. The table below shows their scores side by side. You can also open each model run to check the full conversation and notebook results.
| Model Source | Score | Task | Execution | Output | Reasoning | Reliability | Run |
|---|---|---|---|---|---|---|---|
| gemma4:31b | 10/10 | 2/2 | 2/2 | 3/3 | 2/2 | 1/1 | Open gemma4:31b conversation |
| glm-5.1 | 10/10 | 2/2 | 2/2 | 3/3 | 2/2 | 1/1 | Open glm-5.1 conversation |
| gpt-5.4 | 10/10 | 2/2 | 2/2 | 3/3 | 2/2 | 1/1 | Open gpt-5.4 conversation |
| gpt-oss:120b | 10/10 | 2/2 | 2/2 | 3/3 | 2/2 | 1/1 | Open gpt-oss:120b conversation |
| qwen3.5:397b | 10/10 | 2/2 | 2/2 | 3/3 | 2/2 | 1/1 | Open qwen3.5:397b conversation |
| qwen3-coder-next | 3/10 | 1/2 | 0/2 | 1/3 | 1/2 | 0/1 | Open qwen3-coder-next conversation |
How scoring works
Each run is graded on five dimensions:
- Task Completion (0-2): how fully the workflow answered the requested analysis task and completed required steps.
- Execution Correctness (0-2): whether generated code is correct, consistent, and likely runnable without major fixes.
- Output Quality (0-3): how well tables, charts, and results match the expected analytical outcomes.
- Reasoning Quality (0-2): clarity and correctness of interpretation, explanation, and conclusions.
- Reliability (0-1): robustness of the run, including low hallucination risk and stable behavior.
The final score is the sum of these dimensions, so total score = 0-10. Higher values indicate more complete, correct, and reliable notebook results for this workflow.
AI Analysis with gpt-5.4
This run uses AI with a step-by-step list of prompts to perform the analysis task Energy Consumption Forecasting with Prophet. Below, you can review the full conversation generated with gpt-5.4. After the conversation, we show detailed score cards for this run.
Score cards for gpt-5.4 for Energy Consumption Forecasting with Prophet
Total score: 10/10
Task Completion: 2/2
ExcellentThe notebook covers all major requested steps: loading the data, plotting the full series, summarizing hourly and weekly patterns, fitting Prophet, forecasting 30 days, and showing components.
Execution Correctness: 2/2
ExcellentThe provided code appears runnable and produced outputs for each major step, including the Prophet fit and forecast. No major syntax or execution errors are evident in the notebook content.
Output Quality: 3/3
ExcellentOutputs match the expected outcomes semantically: a full time-series plot, hourly and weekly summaries, a 30-day forecast with uncertainty intervals, and a components plot. The hourly pattern peaks in the evening as expected.
Reasoning Quality: 2/2
ExcellentThe explanations are generally correct and tied to the outputs, including the observed daily and weekly seasonality and the handling of duplicate timestamps before Prophet fitting. The reasoning is clear and relevant.
Reliability: 1/1
ExcellentThe workflow is reasonably robust: it checks for duplicates and missing hours, aggregates duplicates before modeling, and handles Prophet installation if needed.
Try MLJAR Studio
Run the same type of AI-powered data analysis on your own datasets with conversational notebooks in MLJAR Studio.
Explore More AI Analysis Examples
Discover additional workflows across categories. Each example includes prompts, conversation outputs, and model-level scoring so you can compare approaches and results.