LLM Engineer20 prompts5 categoriesBeginner → Advanced19 prompts · 1 chains

LLM Engineer AI Prompts

LLM Engineer AI prompt library with 20 prompts in 5 categories. Copy templates for real workflows in analysis, modeling, and reporting. Browse 5 categories and copy prompts you can use as-is or adapt to your stack.

Browse LLM Engineer prompt categories

5 categories

LLM Infrastructure

AI prompts for LLM infrastructure design including model serving, latency optimization, cost controls, observability, and reliability operations.

5 promptsAgentic System DesignFull LLM Application Chain

→

Fine-tuning

AI prompts for fine-tuning LLM workflows including dataset curation, objective selection, validation strategy, and performance-risk tradeoff analysis.

4 promptsFine-tuning Data PreparationFine-tuning Evaluation

→

Prompt Engineering

AI prompts for prompt engineering including instruction design, constraint framing, output control, and iterative prompt quality improvement.

4 promptsChain-of-Thought and Reasoning PromptsPrompt Design Principles

→

RAG and Retrieval

AI prompts for RAG and retrieval systems including chunking strategy, embedding quality, retrieval tuning, and grounded answer generation.

4 promptsAdvanced RAG ArchitecturesRAG Evaluation Framework

→

Evaluation and Safety

AI prompts for LLM evaluation and safety including quality benchmarks, hallucination checks, guardrails, and risk-aware deployment criteria.

3 promptsLLM Benchmark and Evaluation SuiteLLM Hallucination Detection

→

Advanced search and filtering

Browse all prompts in this role with category, skill-level, type, and text filtering.

Showing 20 of 20 prompts

LLM Infrastructure

5 prompts

LLM InfrastructureAdvancedPrompt

Agentic System Design

Design a reliable LLM agent system that uses tools to complete multi-step tasks. Agent task: {{task}} Available tools: {{tools}} (web search, code execution, database query, API calls, file operations) Reliability requirement: {{reliability}} (best-effort or guaranteed completion) Human-in-the-loop: {{hitl}} (yes/no — is human approval required for certain actions?) 1. Agent architecture: ReAct loop (Reasoning + Acting): - Thought: the agent reasons about what to do next - Action: the agent selects and calls a tool - Observation: the agent receives the tool result - Repeat until the agent decides the task is complete Plan-and-execute (more reliable for complex tasks): - Planning step: decompose the task into a sequence of sub-tasks - Execution: execute each sub-task sequentially (or in parallel where possible) - Re-planning: if a step fails, re-plan from the current state 2. Tool design: - Each tool has: name, description (the agent reads this to decide when to use it), input schema, output schema - Tools must be: idempotent where possible (safe to retry), fast (< 5s for most tools), well-scoped (do one thing well) - Tool description quality is critical: the agent's tool selection depends entirely on the description - Validation: validate tool outputs before passing to the next step 3. Error handling and retries: - Transient failures: retry the tool call up to 3 times with backoff - Persistent failures: skip the step and log; reroute to a fallback tool if available - Maximum iterations: set a hard limit (e.g., 20 steps) to prevent infinite loops - Checkpoint saving: save the agent's state after each completed step; resume from the last checkpoint on failure 4. Safety for agentic systems: - Minimal footprint: request only the permissions needed for the current task - Human approval gates: require human confirmation before irreversible actions (sending emails, deleting data, making payments) - Sandboxed execution: run code in an isolated container (e.g., E2B sandbox) - Audit log: log every action the agent takes, every tool it calls, and every decision it makes 5. Frameworks: - LangGraph: production-grade graph-based agent framework with state management - LlamaIndex Agents: strong for RAG-augmented agents - AutoGen (Microsoft): multi-agent conversation framework - Pydantic AI: type-safe agent framework with validation - Anthropic's computer use: for agents that interact with GUIs Return: agent architecture selection, tool specification schema, error handling strategy, safety controls, and framework recommendation.

Browse LLM Engineer prompt categories

LLM Infrastructure

Fine-tuning

Prompt Engineering

RAG and Retrieval

Evaluation and Safety

Advanced search and filtering

LLM Infrastructure

Fine-tuning

Prompt Engineering

RAG and Retrieval

Evaluation and Safety

Other AI prompt roles