Prompt Engineering Techniques for Data Analysts

Most data analysts who use language models regularly have noticed something frustrating. Two people can ask what appears to be the same question and get dramatically different results. One analyst gets a precise SQL query that handles edge cases correctly and runs cleanly against their schema. Another gets a generic template that needs significant rewriting before it is useful. One analyst gets a thoughtful statistical interpretation of their findings. Another gets a surface-level summary that adds no value.

The difference is almost never the model. It is the prompt.

Prompt engineering is the practice of structuring inputs to language models to get consistently useful, accurate, and specific outputs. For data analysts specifically, it is one of the highest-leverage skills available right now. The analysts who understand how language models process instructions, what context they need to perform well, and which techniques reliably improve output quality are getting dramatically more value from AI tools than those who treat prompting as a black box.

This guide covers the prompt engineering techniques that matter most for data analysis work. Not abstract theory but concrete, applicable techniques with real examples drawn from the kinds of tasks analysts actually do. SQL generation, statistical interpretation, data cleaning logic, exploratory analysis, and communicating findings to non-technical stakeholders.

Why Prompting Matters More Than Most Analysts Realize

Before getting into specific techniques it is worth understanding why prompts affect output quality so significantly.

Language models are trained to predict the most likely continuation of a given text. What they generate is heavily influenced by what they have been primed to produce. A vague prompt primes the model to generate a generic, average response because vague prompts in the training data were typically followed by generic responses. A specific, well-structured prompt primes the model to produce a specific, well-structured response because that is what good prompts in the training data were followed by.

This means that the structure, specificity, and framing of your prompt are not superficial concerns. They shape what the model treats as the appropriate output. A prompt that says “write a SQL query for sales analysis” is asking the model to guess at everything: the database schema, the specific metric, the time period, the aggregation logic, the output format, and the audience. The model fills those gaps with the most statistically average assumptions, which rarely match your specific situation.

A prompt that provides the schema, specifies the exact metric, defines the time period, describes the edge cases that matter, and specifies the output format leaves the model with one job: applying its reasoning to your clearly defined problem. The output is correspondingly more precise and useful.

The other important mental model is that language models have no memory between prompts and no access to context you have not provided. Everything the model needs to know to do a task well must be in the prompt. Experienced prompt engineers treat this as a constraint that forces clarity: if you cannot articulate the context and requirements precisely enough to include them in a prompt, the task probably is not well-defined enough to delegate to anyone, human or AI.

Technique 1: Be Specific About Context and Constraints

The single most impactful prompt engineering technique requires no special structure or syntax. It is simply providing enough context for the model to understand your specific situation rather than a generic version of your problem.

For data analysts, context means the database schema, the business domain, the specific metric definition, the known data quality issues, and the constraints the output must satisfy. Every piece of context you omit forces the model to make assumptions. Those assumptions are often wrong in ways that are not immediately obvious.

Without context:

Write a SQL query to find the best customers.

With context:

I am working with a PostgreSQL database with the following schema:

customers (customer_id BIGINT, email VARCHAR, created_at TIMESTAMP, country VARCHAR)
orders (order_id BIGINT, customer_id BIGINT, status VARCHAR, total_amount DECIMAL, created_at TIMESTAMP)
order_items (order_item_id BIGINT, order_id BIGINT, product_id BIGINT, quantity INTEGER, unit_price DECIMAL)

"Best customers" for this analysis means highest lifetime value, defined as the
sum of total_amount across all orders with status = 'completed'.

Write a query that returns the top 100 customers by lifetime value, including
their customer_id, email, country, total lifetime value, number of completed orders,
and date of their most recent completed order.

Exclude customers created in the last 30 days since they have not had time to
establish a full purchase history.

Order results by lifetime value descending.

The second prompt eliminates every assumption the model would otherwise make. Schema is explicit. Metric definition is explicit. Filters and exclusions are explicit. Output columns are explicit. Sort order is explicit. The model’s entire cognitive effort goes into writing the correct query rather than guessing at what you mean.

A practical template for SQL generation prompts:

Database system: [PostgreSQL / BigQuery / Snowflake / etc]

Relevant tables and schemas:
[Table name] ([column_name DATA_TYPE, column_name DATA_TYPE, ...])
[Repeat for each relevant table]

Known data quality issues:
[Any nulls, duplicates, or anomalies the query should handle]

What I need:
[Specific metric or output description]

Filters and constraints:
[Any WHERE conditions, date ranges, or exclusions]

Output format:
[Column names, sort order, row limit if applicable]

Technique 2: Role Prompting

Role prompting assigns the model a specific identity or expertise before asking it to perform a task. Done well, it activates the model’s knowledge and reasoning patterns associated with that role and produces outputs that reflect the perspective and vocabulary of that role.

For data analysts, role prompting is useful in two directions. You can assign the model a technical role to get more rigorous technical output. Or you can ask the model to respond as a specific non-technical audience to evaluate whether your analysis is clear and complete from their perspective.

Assigning a technical role:

You are a senior data engineer with deep expertise in query optimization
on Snowflake. You have specific knowledge of how Snowflake's query planner
handles large table joins, partition pruning, and clustering keys.

Review the following query and identify any performance issues.
For each issue, explain why it is a problem on Snowflake specifically
and provide the optimized version with an explanation of why it performs better.

[Query here]

Assigning a domain expert role:

You are an experienced e-commerce analyst who has worked with online retail
conversion data for ten years. You understand the typical drivers of conversion
rate changes, common seasonal patterns in e-commerce, and how to distinguish
signal from noise in weekly metrics.

I am seeing a 12% drop in checkout conversion rate this week compared to
the prior four-week average. The drop is consistent across desktop and mobile.
Cart abandonment increased while add-to-cart rate stayed flat.

What are the most likely explanations for this pattern? What additional data
would you look at first to diagnose the root cause?

Assigning the audience role to evaluate communication:

You are a non-technical VP of Marketing who is comfortable with business
metrics but has no background in statistics or data science. You care about
actionable insights and business implications, not methodology.

Read the following analysis summary and tell me:
1. What questions would you have after reading this?
2. What parts are unclear or feel like unexplained jargon?
3. What seems to be missing that you would need to make a decision?

[Analysis summary here]

The audience evaluation technique is particularly powerful for data analysts who regularly present to non-technical stakeholders. Having the model simulate your audience before the actual presentation reveals gaps in your explanation that are easy to miss when you are close to the analysis.

Technique 3: Chain of Thought Prompting

Chain of thought prompting asks the model to reason through a problem step by step before producing its final answer. This technique dramatically improves accuracy on complex analytical tasks where the correct answer depends on a sequence of intermediate reasoning steps.

The reason chain of thought works is that forcing the model to make its reasoning explicit before generating an answer reduces the chance that it skips a step or makes an implicit assumption that leads to a wrong conclusion. The model is less likely to arrive at a wrong answer if it has to show the work that gets there.

Without chain of thought:

A cohort of 1,000 users signed up in January. By the end of February 600 were
still active. By the end of March 420 were still active. What is the month 2
retention rate and month 3 retention rate, and what does the retention curve
suggest about user behavior?

With chain of thought:

A cohort of 1,000 users signed up in January. By the end of February 600 were
still active. By the end of March 420 were still active.

Work through this step by step:
1. Calculate the month 1 to month 2 retention rate (users active at end of
   February divided by original cohort size)
2. Calculate the month 2 to month 3 retention rate (users active at end of
   March divided by users active at end of February)
3. Calculate the cumulative month 3 retention rate (users active at end of
   March divided by original cohort size)
4. Interpret what the relationship between these rates suggests about
   where in the user journey churn is concentrated
5. State what additional cohort data you would want to draw a stronger conclusion

The step-by-step instruction forces the model to compute each retention rate separately before interpretation, which avoids the common error of conflating the period-over-period rate with the cumulative rate. It also forces the model to distinguish between what the data shows and what additional data would be needed, which produces a more epistemically honest output.

A general chain of thought template for analytical tasks:

[Problem description and data]

Work through this systematically:
1. [First reasoning step]
2. [Second reasoning step]
3. [Third reasoning step]
4. [Interpretation step]
5. [Limitations or caveats step]

Based on your step-by-step analysis, provide your final answer.

The specific steps should match the structure of the problem. For statistical interpretation, steps might include identifying the test used, checking whether assumptions are met, interpreting the test statistic, interpreting the p-value in context, and stating what the result does and does not imply. For query debugging, steps might include reading the query structure, identifying what each clause does, checking for logical errors, and testing edge cases mentally.

Technique 4: Few-Shot Prompting

Few-shot prompting provides the model with examples of the input-output pattern you want before asking it to perform the actual task. The examples function as a demonstration that shows the model exactly what format, style, level of detail, and kind of reasoning you expect.

Few-shot prompting is particularly useful when you need a specific output format, when you want a consistent style across many outputs, or when the task involves a non-obvious convention that the model might not infer from a description alone.

Zero-shot (no examples):

Write a metric definition for "monthly active users" suitable for
inclusion in our data dictionary.

Few-shot (with examples):

Write metric definitions suitable for inclusion in our data dictionary.
Each definition should follow this format exactly, as shown in these examples:

---
METRIC NAME: Daily Active Users (DAU)
DEFINITION: Count of distinct users who performed at least one qualifying event
within a calendar day. A qualifying event is any session start, page view, or
feature interaction recorded in the events table.
GRAIN: One row per day
DATA SOURCE: events table, filtered to event_type IN ('session_start', 'page_view', 'feature_interaction')
OWNER: Product Analytics
KNOWN LIMITATIONS: Bot traffic is excluded via the is_bot flag. Users who only
triggered background sync events are not counted.
LAST UPDATED: 2024-01-15
---

METRIC NAME: 7-Day Rolling Retention
DEFINITION: Percentage of users from a given acquisition cohort who performed
at least one qualifying event in the 7-day window starting 30 days after
their first session. Measures whether users return after their initial
exploration period.
GRAIN: One row per acquisition cohort date
DATA SOURCE: Derived from events table joined to users table on user_id
OWNER: Growth Analytics
KNOWN LIMITATIONS: Requires at least 37 days of user history to calculate.
Users acquired in the last 37 days are excluded from the denominator.
LAST UPDATED: 2024-01-15
---

Now write a definition for: Monthly Active Users (MAU)

The few-shot examples communicate more precisely than any description could. The model understands the exact fields, the level of specificity expected in each field, the tone, and the convention for documenting limitations, all from seeing two examples. The output for MAU will match the established format closely.

Few-shot prompting for SQL style consistency:

I am going to give you SQL queries to write. Follow the style shown
in these examples exactly:

Example 1 input: Total revenue by month for completed orders in 2023
Example 1 output:
with completed_orders as (
    select
        order_id,
        total_amount,
        date_trunc('month', created_at) as order_month
    from orders
    where status = 'completed'
      and created_at >= '2023-01-01'
      and created_at < '2024-01-01'
)

select
    order_month,
    count(order_id)       as order_count,
    sum(total_amount)     as total_revenue,
    avg(total_amount)     as avg_order_value
from completed_orders
group by order_month
order by order_month

Example 2 input: Count of new customers by country for Q1 2024
Example 2 output:
with q1_customers as (
    select
        customer_id,
        country,
        created_at
    from customers
    where created_at >= '2024-01-01'
      and created_at < '2024-04-01'
)

select
    country,
    count(customer_id) as new_customers
from q1_customers
group by country
order by new_customers desc

Now write: Revenue per customer by acquisition channel for the last 90 days

The examples establish conventions: CTEs for intermediate logic, snake_case aliases, consistent indentation, column alignment, and no semicolons. The model follows these conventions because the examples make them unambiguous.

Technique 5: Constraint Specification

Explicitly stating what the output should not do is as important as stating what it should do. Language models have default behaviors that are sometimes useful and sometimes not. Constraint specification overrides defaults that do not fit your use case.

For data analysts, common defaults worth overriding include:

Suppress explanation when you want only code:

Write a BigQuery SQL query that calculates 28-day rolling average revenue
for each day in 2023.

Return only the SQL query. Do not include any explanation, markdown
formatting, or code fences. The output should be raw SQL that I can
paste directly into BigQuery.

Require explanation when you want understanding:

Write a Python function that calculates the Benjamini-Hochberg correction
for multiple hypothesis testing.

After the code, explain in plain language:
- What problem this correction solves
- Why a data analyst running many A/B tests simultaneously would need it
- What the q-value threshold of 0.05 means in practical terms

Do not assume statistical background in the explanation.

Prevent hallucination of specific values:

I am going to share a dataset description. Based on this description,
suggest five hypotheses worth investigating.

Important constraints:
- Do not invent specific numbers or statistics not present in the description
- When you are uncertain, say so explicitly rather than speculating confidently
- Mark each hypothesis as high, medium, or low confidence based on how
  directly the description supports it

Dataset description:
[Description here]

Enforce output format:

Analyze the following A/B test results and provide your interpretation.

Format your response as follows and use no other format:

RESULT: [one sentence stating whether the test achieved significance]
EFFECT SIZE: [the observed difference and its practical magnitude]
RECOMMENDATION: [one sentence on what to do based on these results]
CAVEATS: [bullet points listing any concerns about the test validity]
NEXT STEPS: [bullet points listing what to investigate or test next]

Constraint specification is especially important when prompts will be used repeatedly in a workflow, such as in a script that calls an LLM API to generate analysis summaries automatically. Inconsistent output format from a model following its defaults causes downstream parsing problems. Explicit format constraints produce consistent outputs that are easier to process programmatically.

Technique 6: Prompt Chaining

Prompt chaining breaks a complex analytical task into a sequence of simpler prompts where each prompt builds on the output of the previous one. This is more effective than attempting to get a complex end-to-end result from a single prompt for several reasons.

Complex single prompts force the model to do too many things simultaneously. Quality tends to degrade when a model is asked to load data, clean it, analyze it, interpret the results, and format the output all in one step. Breaking the task into stages allows each stage to be done well before the next builds on it.

Chaining also creates inspection points where you can review intermediate outputs before they propagate. If the data cleaning logic in step two produces a wrong result, you catch it before step four builds analysis on top of it.

Example: chained analysis of customer churn

Prompt 1: Define the problem precisely

I want to analyze customer churn in our subscription business.

Help me define the analysis precisely by answering:
1. What are the two or three most common definitions of churn for
   subscription businesses and what are the tradeoffs of each?
2. Given that our business has monthly and annual subscriptions,
   which definition is most appropriate and why?
3. What is the minimum data I need to calculate this metric?

Do not write any code yet. Focus only on getting the definition right.

Prompt 2: Design the query logic

Based on our agreed churn definition from our discussion (a customer is
churned if they have not renewed within 30 days of their subscription
end date), help me design the query logic.

Our schema:
subscriptions (subscription_id, customer_id, plan_type, start_date, end_date, renewed BOOLEAN)
customers (customer_id, acquired_date, acquisition_channel, country)

Walk through the logic of the query step by step before writing any SQL.
Identify any edge cases that the logic needs to handle, such as customers
with multiple subscriptions, customers who upgraded or downgraded, and
customers who cancelled mid-period.

Prompt 3: Write and validate the query

Based on the logic we designed, write the PostgreSQL query.

After the query, write five specific test cases I should run to
validate that it handles the edge cases correctly. For each test case,
describe the input data pattern and what the query should return.

Prompt 4: Interpret the results

I ran the churn analysis query and here are the results:

[Paste actual results here]

Provide a structured interpretation:
1. What does the overall churn rate suggest about business health
   relative to typical SaaS benchmarks?
2. What patterns in the data are most worth investigating further?
3. What hypotheses do these results suggest about the drivers of churn?
4. What additional analysis would you recommend as immediate next steps?

Each prompt in the chain does one thing well. The analytical quality of each stage is higher than it would be if all four stages were requested simultaneously.

Technique 7: Self-Critique and Revision Prompting

Asking the model to critique its own output before you accept it produces better results than accepting the first response. Language models can identify weaknesses in their outputs when explicitly asked to do so, even when those weaknesses were not apparent in the initial generation.

Basic self-critique:

[Previous prompt and model response]

Now critique your own response:
1. What assumptions did you make that might not hold in my specific situation?
2. What are the weaknesses or limitations of the approach you suggested?
3. What important considerations did you omit?
4. Revise your response to address the most significant of these issues.

Domain-specific self-critique for statistical analysis:

You just provided an interpretation of an A/B test result.

Before I accept this interpretation, critique it from the perspective
of a statistician who is skeptical of overconfident conclusions:

1. Did the interpretation correctly distinguish between statistical
   significance and practical significance?
2. Were any of the assumptions of the test violated based on the
   data I provided?
3. Did the interpretation overstate the strength of the evidence?
4. What alternative explanations for the result were not considered?

Revise the interpretation to address any issues you identify.

Self-critique for SQL correctness:

You just wrote a SQL query for me.

Review the query as if you were a code reviewer looking for bugs:
1. Are there any join conditions that could produce duplicate rows?
2. Are there any filter conditions that might incorrectly exclude
   valid records or include invalid ones?
3. Does the aggregation logic handle NULL values correctly?
4. Are there any edge cases in the data (empty tables, all-NULL columns,
   customers with no orders) that would cause incorrect results?

Fix any issues you identify and explain what you changed and why.

Self-critique is most valuable for high-stakes outputs like production queries, statistical interpretations going into reports, and analyses that will inform significant decisions. The extra step of prompting for critique and revision consistently produces more reliable outputs.

Technique 8: Structured Output Specification

Specifying the exact structure of the output you want, including field names, data types, and nesting, produces outputs that are easier to use programmatically and more consistent across repeated calls. This is especially important when building workflows that use LLM outputs as inputs to other systems.

For analysis summaries going into reports:

Analyze the following sales performance data and return your analysis
in exactly this JSON structure:

{
  "executive_summary": "2-3 sentence summary for non-technical audience",
  "key_findings": [
    {
      "finding": "one-sentence description of finding",
      "supporting_data": "specific numbers from the data that support it",
      "confidence": "high | medium | low",
      "confidence_rationale": "why this confidence level"
    }
  ],
  "anomalies": [
    {
      "description": "what the anomaly is",
      "possible_explanations": ["explanation 1", "explanation 2"],
      "recommended_investigation": "what to look at next"
    }
  ],
  "recommended_actions": [
    {
      "action": "specific recommended action",
      "priority": "high | medium | low",
      "expected_impact": "what outcome this action is expected to produce"
    }
  ],
  "data_quality_concerns": ["concern 1", "concern 2"]
}

Return only the JSON. No preamble, no explanation, no markdown.

Data: [paste data here]

For generating dbt model documentation:

Generate documentation for the following dbt model in exactly this YAML format:

models:
  - name: [model_name]
    description: |
      [2-3 sentence description of what this model contains,
       who uses it, and what decisions it supports]
    columns:
      - name: [column_name]
        description: [one sentence description]
        tests:
          - [relevant tests: unique, not_null, accepted_values, relationships]

Model SQL:
[paste SQL here]

Return only the YAML. No explanation or surrounding text.

Structured output specification is the foundation of any workflow that uses LLM outputs programmatically. When the output format is consistent and predictable, it can be parsed, stored, and passed to downstream processes reliably.

Technique 9: Iterative Refinement

Prompting is rarely a one-shot process for complex tasks. Iterative refinement, treating a prompt as the first draft of a conversation rather than a complete specification, produces better results than trying to write a perfect prompt before seeing any output.

The practical workflow is to write a reasonable first prompt, review the output critically, identify the specific ways it falls short, and add targeted instructions that address those gaps.

First iteration:

I have a dataset of 50,000 customer support tickets with the following columns:
ticket_id, created_at, resolved_at, category, priority, customer_id, agent_id,
satisfaction_score (1-5), resolution_text.

Suggest an exploratory analysis plan for understanding support team performance.

After reviewing the first response, add refinements:

Your suggestions are a good starting point. Refine the analysis plan with
these additional constraints:

1. We care most about agent performance differences, not aggregate team metrics.
   Prioritize analyses that surface individual agent strengths and weaknesses.

2. Satisfaction score has 40% missing values. Include how to handle this
   in each analysis that uses it.

3. We have a known issue where tickets created on weekends are systematically
   reassigned Monday morning, inflating Monday resolution times. Flag which
   analyses this affects and how to account for it.

4. Add specific SQL for the three analyses you consider highest priority.
   Use PostgreSQL syntax with our exact column names.

After reviewing the second response:

The agent performance analysis looks good. For the satisfaction score analysis,
the approach you suggested will be biased because higher-priority tickets have
lower satisfaction scores on average and some agents handle more high-priority
tickets than others.

Revise the agent satisfaction analysis to control for ticket priority.
Show me the SQL that would calculate satisfaction scores adjusted for
the priority mix each agent handles.

Each iteration sharpens the output by targeting the specific gaps or errors in the previous response. The third prompt in this chain would have been impossible to write initially because it addresses a specific issue that only became visible after seeing the second response.

Practical Prompt Templates for Data Analysts

SQL generation:

Database: [system]
Schema: [table definitions]
Task: [specific query description]
Edge cases to handle: [nulls, duplicates, date ranges, etc]
Output: [columns, sort order, format]
Style: [any conventions to follow]

Statistical interpretation:

I ran [test name] to compare [what was compared].
Results: [test statistic, p-value, confidence interval, effect size]
Sample sizes: [n for each group]
Context: [what decision this analysis supports]

Interpret these results for a [technical/non-technical] audience.
Cover: statistical significance, practical significance, limitations,
and what conclusion is and is not supported by this data.

Data quality investigation:

I am investigating a data quality issue:
Symptom: [what you observed]
Table: [table name and relevant columns]
Expected behavior: [what should be happening]
Actual behavior: [what is actually happening]

Suggest: diagnostic queries to identify root cause, likely explanations
ranked by probability, and remediation approach for each explanation.

Exploratory analysis planning:

Dataset: [description of table, columns, row count, time period]
Business context: [what this data represents]
Goal: [what question or decision this analysis should inform]
Constraints: [time, known data quality issues, required format]

Produce: a prioritized analysis plan with specific metrics to calculate,
hypotheses to test, and visualizations to create. For the top three
priorities, include the SQL.

Finding communication:

Audience: [role and technical level]
Finding: [what the data shows]
Context: [why this matters to the business]
Confidence: [how certain you are and why]

Write: a [email/slide/memo] communicating this finding.
Requirements: [length, tone, whether to include methodology]

Prompt Engineering Cheat Sheet for Data Analysts

Technique	When to Use	Key Pattern
Context specification	Always, for any domain-specific task	Provide schema, definitions, constraints explicitly
Role prompting	Technical depth or audience simulation	“You are a [role] with expertise in [domain]”
Chain of thought	Multi-step reasoning, calculations	“Work through this step by step: 1… 2… 3…”
Few-shot prompting	Consistent format or style needed	Provide 2-3 input/output examples before the task
Constraint specification	Control defaults, enforce format	“Return only X. Do not include Y.”
Prompt chaining	Complex multi-stage tasks	Break into sequential single-purpose prompts
Self-critique	High-stakes outputs, SQL, statistics	“Now critique your response for [specific issues]”
Structured output	Programmatic use, reports, pipelines	Specify exact JSON or YAML schema
Iterative refinement	When first output is close but not right	Target specific gaps with focused follow-up prompts

Common Prompting Mistakes Data Analysts Make

Asking vague questions and accepting vague answers. “Analyze this data” produces a generic analysis. “Calculate the 30-day retention rate for each acquisition channel, identify which channels show statistically different retention, and explain what might drive the differences” produces a specific, useful analysis. Vagueness in gets vagueness out.

Not providing the schema. Writing SQL prompts without the actual table and column names forces the model to invent schema that almost certainly differs from yours. Always paste the relevant CREATE TABLE statements or column lists. The model writes dramatically better SQL when it knows the exact schema.

Accepting the first response without review. Language models are confident even when wrong. A SQL query that looks correct can have subtle logical errors, incorrect join conditions, or edge case failures that only appear when you run it against real data. Self-critique prompting and careful review before execution are non-negotiable for production queries.

Using the same prompt for different audiences. A statistical interpretation written for a data scientist audience is not appropriate for a VP of Sales. Always specify the audience and their technical level explicitly. The same analysis requires completely different framing, vocabulary, and emphasis depending on who will read it.

Treating a bad output as evidence the model cannot do the task. When a model produces a poor output, the problem is almost always the prompt. Before concluding the model cannot do something, try adding context, adding examples, breaking the task into smaller steps, or asking for step-by-step reasoning. Most apparent capability failures are prompt failures.

Not saving prompts that work. When you write a prompt that produces consistently good outputs for a recurring task, save it as a template. The investment in crafting a good prompt is wasted if you start from scratch every time you have the same task.

FAQs

What is prompt engineering and why should data analysts learn it?

Prompt engineering is the practice of structuring inputs to language models to get consistently useful, accurate, and specific outputs. Data analysts should learn it because it is one of the highest-leverage skills for getting value from AI tools. The difference between a well-engineered prompt and a vague one is often the difference between a production-ready SQL query and a generic template that needs significant rework, or between a nuanced statistical interpretation and a surface-level summary. As LLM tools become standard in analyst workflows, prompting skill increasingly determines how much analysts can accomplish with them.

What is the difference between zero-shot and few-shot prompting?

Zero-shot prompting asks the model to perform a task without any examples. Few-shot prompting provides two to five examples of the input-output pattern before asking the model to perform the actual task. Few-shot prompting is more effective when you need a specific output format, a consistent style, or adherence to a non-obvious convention that would be difficult to describe precisely but easy to demonstrate. For data analysts, few-shot prompting is particularly useful for generating documentation in a specific format, writing SQL in a consistent style, and producing analysis summaries with a consistent structure.

How do I write better SQL prompts?

The most impactful improvement is always providing the full schema for relevant tables rather than describing the data abstractly. Include the exact table names, column names, and data types. Specify the database system because SQL dialects differ significantly. Define any business metrics precisely rather than using shorthand. List edge cases the query should handle. Specify the exact output columns and sort order. Ask the model to critique the query for logical errors after writing it. These steps together eliminate most of the assumption-making that causes SQL prompts to produce wrong or unusable queries.

What is chain of thought prompting and when should I use it?

Chain of thought prompting asks the model to reason through a problem step by step before producing its final answer. It is most useful for tasks where the correct answer depends on a sequence of intermediate reasoning steps, such as statistical calculations, multi-step data transformations, metric derivations, or analytical interpretations that require considering multiple factors. Explicitly listing the steps you want the model to work through before answering reduces errors that occur when models skip steps or make implicit assumptions that go unexamined.

How do I get consistent output format from an LLM for a recurring analysis task?

Use a combination of explicit format specification, few-shot examples, and constraint instructions. Specify the exact format you want, whether JSON, YAML, or a structured text template, and provide an example of a correctly formatted output. Include explicit constraints like “return only the JSON, no preamble or explanation.” If the task runs repeatedly in a workflow, test the prompt on multiple inputs to verify format consistency before relying on it in production. Saving the prompt as a reusable template ensures consistent results across multiple uses.