Cloud Data Engineer20 prompts6 categoriesBeginner → Advanced19 prompts · 1 chains

Cloud Data Engineer AI Prompts

Cloud Data Engineer AI prompt library with 20 prompts in 6 categories. Copy templates for real workflows in analysis, modeling, and reporting. Browse 6 categories and copy prompts you can use as-is or adapt to your stack.

Browse Cloud Data Engineer prompt categories

6 categories

Cloud Architecture

AI prompts for cloud data architecture covering service selection, scalability, cost tradeoffs, reliability design, and platform governance.

5 promptsCloud Data Platform ArchitectureData Mesh on Cloud

→

Orchestration

AI prompts for orchestration workflows including DAG design, scheduling strategy, dependency management, and resilient pipeline execution.

4 promptsCloud Orchestration with AirflowData Contracts and SLA Management

→

Cloud Storage

AI prompts for cloud storage design, partitioning strategy, lifecycle management, format selection, and cost-performance optimization.

3 promptsCloud Data Catalog and Metadata ManagementData Lake Design on Cloud Object Storage

→

Cloud Warehouse

AI prompts for cloud data warehouse modeling and operations, including performance patterns, workload management, and analytics-ready schemas.

3 promptsBigQuery OptimizationRedshift Architecture and Tuning

→

Streaming

AI prompts for streaming data systems including event design, lag management, windowing logic, and reliable real-time analytics pipelines.

3 promptsCDC Pipeline DesignReal-Time Analytics Architecture

→

Security and Governance

AI prompts for security and governance including policy controls, data access standards, auditability, and compliance-aligned architecture.

2 promptsCloud Cost ManagementCloud Data Security

→

Advanced search and filtering

Browse all prompts in this role with category, skill-level, type, and text filtering.

Showing 20 of 20 prompts

Cloud Architecture

5 prompts

Cloud ArchitectureBeginnerPrompt

Cloud Data Platform Architecture

Design a cloud-native data platform architecture for this organization. Cloud provider: {{provider}} (AWS, GCP, Azure) Data sources: {{sources}} Users: {{users}} (analysts, data scientists, engineers) Scale: {{scale}} 1. AWS reference architecture: - Ingestion: Kinesis Data Streams (streaming) / AWS Glue (batch ETL) - Storage: S3 (data lake) + Redshift (warehouse) + RDS (operational) - Processing: AWS Glue / EMR (Spark) / Lambda (serverless) - Serving: Redshift / Athena (S3 queries) / DynamoDB (low-latency lookups) - Orchestration: Apache Airflow on MWAA / AWS Step Functions - Catalog: AWS Glue Data Catalog - BI: QuickSight / Tableau / Looker 2. GCP reference architecture: - Ingestion: Pub/Sub (streaming) / Cloud Dataflow / Cloud Composer (Airflow) - Storage: Cloud Storage (data lake) + BigQuery (warehouse) - Processing: Dataflow (Apache Beam) / Dataproc (Spark) - Serving: BigQuery / Bigtable (low-latency) / Cloud Spanner (transactional) - Catalog: Dataplex / Data Catalog - BI: Looker / Looker Studio / Tableau 3. Azure reference architecture: - Ingestion: Event Hubs (streaming) / Azure Data Factory (ETL/ELT) - Storage: ADLS Gen2 (data lake) + Synapse Analytics (warehouse) - Processing: Databricks / Azure Synapse Spark / Azure Stream Analytics - Serving: Synapse / Cosmos DB / Azure SQL - Catalog: Microsoft Purview - BI: Power BI 4. Lake House pattern (recommended default): - Single storage layer (cloud object storage) holds all data in open formats (Parquet, Delta, Iceberg) - Multiple compute engines query the same data (Spark, Athena, BigQuery Omni, Trino) - Delta Lake / Apache Iceberg: ACID transactions on the data lake - Eliminates data duplication between a separate data lake and warehouse 5. Cost optimization: - Separate storage and compute: scale them independently - Use spot/preemptible instances for batch processing - Implement data tiering: hot (SSD), warm (HDD/standard), cold (archival) Return: reference architecture diagram (text), component selection rationale, lake house vs traditional warehouse decision, and cost optimization approach.

Browse Cloud Data Engineer prompt categories

Cloud Architecture

Orchestration

Cloud Storage

Cloud Warehouse

Streaming

Security and Governance

Advanced search and filtering

Cloud Architecture

Orchestration

Cloud Storage

Cloud Warehouse

Streaming

Security and Governance

Other AI prompt roles