Data AnalystData ExplorationAdvancedSingle prompt

Dimensionality Assessment AI Prompt

Dimensionality Assessment is a advanced prompt for data exploration. This prompt helps the user understand the structure, meaning, and analytical potential of a dataset before moving into deeper work. It is designed to surface what is in the data, how trustworthy it looks, and which columns, relationships, or patterns deserve attention first. Use it early in an analysis workflow to reduce guesswork and create a shared understanding of the dataset. It is best suited for direct execution against a real dataset. The requested output should be comprehensive, methodical, and suitable for expert review or production-style work.

Prompt text
Assess the dimensionality and information density of this dataset:

1. How many features are there relative to the number of rows? Is this dataset wide, tall, or balanced?
2. Apply PCA and report how many components explain 80%, 90%, and 95% of the variance — this shows the true effective dimensionality
3. Identify groups of highly correlated features that are effectively measuring the same thing
4. Flag any features that appear to be near-linear combinations of others (redundant features)
5. Identify features with near-zero variance — they carry almost no information
6. Recommend a minimum feature set that retains 90% of the information in the dataset

Return a dimensionality report with: original features, effective dimensions, redundant groups, and recommended feature set.

When to use this prompt

Use case 01

When you have a new dataset and need a fast but structured first assessment.

Use case 02

When you want to understand columns, grain, date coverage, or basic quality before analysis.

Use case 03

When you need to decide which variables are worth deeper investigation.

Use case 04

When you want a repeatable starting point for exploratory data analysis.

What the AI should return

The AI should return a structured analysis of the dataset, using clear headings, compact tables where useful, and a short narrative that explains the main takeaways. It should explicitly call out quality issues, notable patterns, and any assumptions it had to make about the data. Where the prompt asks for calculations or plots, those should be included with concise interpretation. The final answer should help the user understand both what the data contains and what to inspect next.

How to use this prompt

1

Open your data context

Load your dataset, notebook, or working environment so the AI can operate on the actual project context.

2

Copy the prompt text

Use the copy button above and paste the prompt into the AI assistant or prompt input area.

3

Review the output critically

Check whether the result matches your data, assumptions, and desired format before moving on.

4

Chain into the next prompt

Once you have the first result, continue deeper with related prompts in Data Exploration.

Frequently asked questions

What does the Dimensionality Assessment prompt do?+

It gives you a structured data exploration starting point for data analyst work and helps you move faster without starting from a blank page.

Who is this prompt for?+

It is designed for data analyst workflows and marked as advanced, so it works well as a guided starting point for that level of experience.

What type of prompt is this?+

Dimensionality Assessment is a single prompt. You can copy it as-is, adapt it, or use it as one step inside a larger workflow.

Can I use this outside MLJAR Studio?+

Yes. The prompt text works in other AI tools too, but MLJAR Studio is the best fit when you want local execution, visible Python code, and reusable notebooks.

What should I open next?+

Natural next steps from here are Bivariate Relationship Analysis, Categorical Column Profiling, Column Relationship Map.