What is an index, and how does a B-tree index work under the hood?

An index is a separate, ordered data structure that lets the database find rows without scanning the whole table. A **B-tree** (balanced tree) index stores keys in sorted order across a shallow, wide tree of pages: internal nodes hold routing keys, and leaf nodes hold the indexed value plus a pointer to the actual row (or the row itself, for a clustered index). Lookups, range scans, and sorted retrieval are all O(log n) against the tree's height, which stays small (typically 3-4 levels) even for tables with millions of rows.

What's the difference between a clustered and a non-clustered index?

A **clustered index** determines the actual physical storage order of table rows — the leaf level of the index *is* the data itself, so a table can have at most one. A **non-clustered (secondary) index** is a separate structure whose leaves store a pointer back to the row's location (or, in engines like InnoDB, the clustering key), rather than the row's data itself — a table can have many.

What is a covering index, and how does it avoid a key lookup?

A covering index is one that includes every column a query needs — both the filter/join columns and the selected output columns — so the engine can satisfy the entire query from the index alone, without a second trip to the base table (a "key lookup"). This is done by including extra columns in the index, either as regular indexed key columns or via an `INCLUDE`/`STORING` clause that stores them at the leaf level without making them part of the sort key.

Why does column order matter in a composite (multi-column) index?

A composite index is sorted by its first column, then its second within each value of the first, and so on — like a phone book sorted by last name, then first name. A query can only use the index efficiently for a **left-to-right prefix** of the indexed columns: an index on `(a, b, c)` helps queries filtering on `a`, or on `a AND b`, or on `a AND b AND c`, but does not efficiently help a query filtering on `b` alone or `c` alone.

What is index selectivity/cardinality, and why does the optimizer care?

**Cardinality** is the number of distinct values in a column; **selectivity** is the ratio of distinct values to total rows (higher selectivity = fewer rows match a typical value). The query optimizer uses selectivity estimates (from stored statistics) to decide whether using an index is actually cheaper than a full table scan — a low-selectivity index (e.g., a boolean `is_active` column) often gets ignored because matching "half the table" via random index lookups is slower than just scanning sequentially.

How do you read and interpret a query execution plan (EXPLAIN / EXPLAIN ANALYZE)?

`EXPLAIN` shows the plan the optimizer *would* use — join order, access methods (scan vs. index), and estimated cost/row counts — without running the query. `EXPLAIN ANALYZE` actually executes the query and adds real timing and actual row counts alongside the estimates, which is essential for spotting cases where the optimizer's estimate was wrong (a major cause of bad plans). Read plans from the innermost/deepest operations outward, and look for scan types, large estimate-vs-actual row-count gaps, and expensive operations like sorts or nested loops over large row counts.

What are the downsides of adding too many indexes to a table?

Every index must be updated on every `INSERT`, `UPDATE` (of an indexed column), and `DELETE` — so more indexes mean slower writes and more lock contention. Indexes also consume disk space and memory (competing with data pages for cache), and can actively mislead the optimizer into a worse plan when there are many overlapping, redundant options to choose between. Index maintenance (rebuilding, statistics updates) also adds operational overhead.

What's the difference between a hash index and a B-tree index?

A **hash index** maps a hashed key directly to a bucket location, giving O(1) average-case equality lookups, but has no concept of order — it can't support range queries, sorting, or prefix matching. A **B-tree index** maintains sorted order, giving O(log n) lookups but also supporting ranges, sorting, and prefix matches. Most databases default to B-tree indexes because they cover a much broader set of query patterns; hash indexes are a narrower optimization for pure equality-lookup workloads.

How do functions or expressions on a column in WHERE affect index usage (sargability)?

A predicate is **sargable** ("Search ARGument ABLE") if the optimizer can use an index to evaluate it directly. Wrapping an indexed column in a function or expression (`WHERE UPPER(email) = 'X'`, `WHERE price * 1.1 > 100`) usually makes the predicate non-sargable, because the index stores the *raw* column values, not the transformed result — forcing a full scan that computes the function on every row. The fix is either to rewrite the predicate so the raw column is compared directly, or to create a functional/expression index matching the transformation.

What's the difference between database statistics and indexes in query optimization?

An **index** is a physical data structure that speeds up finding specific rows. **Statistics** are metadata the optimizer uses to *decide* whether and how to use an index in the first place — value distributions, distinct counts, histograms, and table/index sizes. An index with stale or missing statistics can be effectively invisible to a good query plan, even though the index itself is perfectly healthy, because the optimizer is reasoning from an outdated or wrong picture of the data.

How would you diagnose and fix a slow query in production?

Get the actual execution plan (`EXPLAIN ANALYZE`) rather than guessing, compare estimated vs. actual row counts to spot stale statistics, check for missing/unused indexes and non-sargable predicates, and look for join-order or row-explosion issues. Fix the highest-leverage cause first — usually a missing index or a rewritten predicate — verify with the plan again, and only then consider heavier changes like denormalization, caching, or schema changes.

Indexing and Query Performance

How indexes work internally, reading execution plans, and diagnosing slow queries.

Questions

11 total

11 questions in this section

Difficulty

Open as page

Without an index, finding a row requires a full table scan — reading every page of the table to check each row against the condition, which is O(n).

What an index actually is

An index is a redundant, auxiliary copy of one or more columns' worth of data, stored in a structure optimized for searching — most commonly a B-tree (or its variant, the B+tree, which most real databases actually use). Maintaining it costs extra storage and slows down writes slightly (every INSERT/UPDATE/DELETE must also update every affected index), in exchange for much faster reads on the indexed column(s).

B-tree structure

                    [ 50 ]
                 /          \
          [ 20, 35 ]      [ 70, 90 ]
          /   |   \        /   |   \
      leaf  leaf  leaf  leaf  leaf  leaf   <- sorted, linked leaf pages
       (actual key values + row pointers)

Internal (non-leaf) nodes hold routing keys used purely to decide which child to descend into — they don't necessarily correspond to real rows.
Leaf nodes hold the actual indexed key values in sorted order, each paired with a pointer to the row's physical location (a "row ID"/"ctid"/"RID," depending on engine) — or, for a clustered index, the leaf is the row itself.
Leaf pages are typically linked together in a doubly-linked list, so once you've found your starting point, a range scan (BETWEEN, >, <, ORDER BY) can walk forward/backward through sorted leaves without re-descending the tree each time.

Why B-trees specifically (not a plain binary tree)

A B-tree is wide and shallow rather than narrow and deep — each node holds many keys (often hundreds, sized to match a disk page), so even a huge table needs only 3-4 levels of tree traversal to reach any leaf. This matters because each level touched is potentially a disk I/O (or at least a cache-line/page fetch); minimizing tree height directly minimizes I/O for a lookup. A plain binary tree, by contrast, would need log₂(n) levels — vastly more for large n — because each node holds only one key.

What a B-tree index is good at, and not good at

Excellent for: equality lookups (WHERE id = 5), range queries (WHERE price BETWEEN 10 AND 50), sorted retrieval (ORDER BY indexed_col), and prefix matching (WHERE name LIKE 'Smith%').
Not useful for: conditions that don't preserve the sorted-order relationship, like WHERE name LIKE '%Smith' (leading wildcard) or applying a function to the indexed column (WHERE UPPER(email) = ...) unless a matching functional/expression index exists — see the sargability question.

Alternatives worth knowing exist

Hash indexes (equality-only, no range support, no sort order — see the hash-vs-B-tree question), GiST/GIN indexes (full-text search, geometric data, JSONB containment in PostgreSQL), and bitmap indexes (low-cardinality columns in analytical/OLAP systems) all trade the B-tree's general-purpose versatility for better performance on a narrower class of queries.

Related Resources

Use the Index, Luke: Anatomy of an Index

Open as page

Clustered index — defines physical row order

The table's rows are physically stored sorted by the clustered index's key. There can be only one per table, because rows can only be physically ordered one way.

-- SQL Server: explicit clustered index, often on the primary key by default
CREATE CLUSTERED INDEX ix_orders_id ON orders(id);

When you look up by the clustered key, the engine finds the leaf page and that page is the full row — no second lookup needed.

MySQL/InnoDB specifics: every InnoDB table always has a clustered index — if you declare a PRIMARY KEY, that becomes the clustered index; if you don't, InnoDB creates a hidden 6-byte row ID and clusters on that internally. This is unlike SQL Server, where a table can be a "heap" with no clustered index at all.

PostgreSQL specifics: PostgreSQL doesn't maintain clustering automatically — CLUSTER table USING index physically reorders the table once, but subsequent inserts don't preserve that order unless you re-run CLUSTER. So PostgreSQL tables are effectively heap-organized by default, and "clustered index" in the SQL Server/MySQL sense doesn't map directly.

Non-clustered (secondary) index — a separate lookup structure

CREATE INDEX ix_orders_customer_id ON orders(customer_id);

This index's leaf nodes store customer_id values in sorted order, each paired with a pointer to the actual row — in SQL Server, a row locator (physical page/slot); in InnoDB, the value of the clustered key (id), since InnoDB's secondary indexes always store the primary key rather than a physical address (so that the clustered index can be reorganized without invalidating every secondary index).

The "bookmark lookup" / "key lookup" cost

Querying by a non-clustered index column, but selecting other columns not in that index, requires two steps: (1) traverse the secondary index to find the pointer/key, then (2) go to the clustered index (or heap) to fetch the full row. This second step — a key lookup or bookmark lookup — is where a covering index (see next question) helps by avoiding it entirely.

-- Needs a key lookup: ix_orders_customer_id doesn't contain 'total'
SELECT total FROM orders WHERE customer_id = 42;

Choose the clustered index key (often the primary key) based on the most common range-scan/sequential access pattern for that table — e.g., an auto-incrementing id or a time-ordered column, since sequential inserts into a clustered index avoid the page-splitting overhead that inserting into the middle of a random-order clustered key causes. Add non-clustered indexes for other frequently-filtered/joined columns.

Related Resources

SQL Server: Clustered and Nonclustered Indexes

Open as page

The problem a covering index solves

CREATE INDEX ix_orders_customer_id ON orders(customer_id);

SELECT id, customer_id, total FROM orders WHERE customer_id = 42;

The index on customer_id quickly finds matching rows, but total isn't in that index — so for every matching row, the engine must do an extra key/bookmark lookup back to the full table (or clustered index) just to fetch total. On a query returning many rows, that's many extra random I/Os.

Making the index cover the query

-- Option A: add 'total' as a regular index column
CREATE INDEX ix_orders_customer_covering ON orders(customer_id, total);

-- Option B (SQL Server): INCLUDE non-key columns at the leaf level only
CREATE INDEX ix_orders_customer_covering ON orders(customer_id) INCLUDE (total);

-- Option B (PostgreSQL): the equivalent is INCLUDE in CREATE INDEX (v11+)
CREATE INDEX ix_orders_customer_covering ON orders(customer_id) INCLUDE (total);

Now the same query can be answered entirely from the index's leaf pages — an index-only scan — with no trip to the base table at all, because every column the query needs (customer_id to filter, total to return; id is implicitly available via the clustering key) is present in the index.

Key columns vs INCLUDE(d)/STORING columns

Putting total directly in the index key (Option A) makes it part of the sort order too, which enlarges the key, affects ORDER BY/range-scan usefulness, and duplicates it into every level of the B-tree. An INCLUDE/STORING clause (Option B) stores the extra column only at the leaf level, not in internal nodes, and doesn't affect sort order — generally the more efficient choice when the extra column is only needed for the SELECT list, not for filtering or sorting.

Caveats

Covering indexes trade write cost and storage for read speed — every additional column stored means more data duplicated and updated on every write. Don't cover every possible query; reserve this for genuinely hot, high-value queries.
SELECT * defeats covering indexes almost by definition — the engine can't predict which columns you'll need in the future, and a covering index for "all columns" is just... the whole table. Covering indexes work best paired with narrow, deliberate SELECT lists.
PostgreSQL's index-only scans additionally require the visibility map to confirm a page's rows are all visible to the current transaction — under heavy write/vacuum churn, PostgreSQL can silently fall back to a regular index scan (with key lookups) even when the index technically covers the query.

Related Resources

Use the Index, Luke: Covering Indexes

Open as page

CREATE INDEX ix_orders_customer_status_date
ON orders(customer_id, status, order_date);

The phone book analogy

Think of this index like a phone book sorted by (last name, first name). You can efficiently find "everyone named Smith," or "everyone named Smith, John" — but you cannot efficiently find "everyone whose first name is John" without scanning the whole book, because first names aren't sorted independent of last name.

Which queries this index helps

-- Efficient: uses the full 3-column prefix
WHERE customer_id = 42 AND status = 'shipped' AND order_date > '2024-01-01'

-- Efficient: uses the leading 2-column prefix (order_date unconstrained is fine)
WHERE customer_id = 42 AND status = 'shipped'

-- Efficient: uses just the leading column
WHERE customer_id = 42

-- NOT efficiently helped: doesn't start with customer_id
WHERE status = 'shipped'
WHERE order_date > '2024-01-01'

-- Partially helped: customer_id can use the index, but status is skipped
-- (this is a "range then unordered" scenario -- order_date range breaks the
-- ability to also use status as a further seek predicate in most engines)
WHERE customer_id = 42 AND order_date > '2024-01-01'

Equality columns before range columns

A useful rule of thumb when deciding column order: put columns used with equality (=) before columns used with a range (>, <, BETWEEN, LIKE 'prefix%'). Once the index encounters a range condition, it can still narrow down using that range, but everything after it in the index can no longer be used to further narrow the search within that range efficiently — the equality columns should exhaust their filtering power first.

-- Good: status (equality) before order_date (range)
CREATE INDEX ix_orders_status_date ON orders(status, order_date);
WHERE status = 'shipped' AND order_date > '2024-01-01'   -- both columns pull weight

-- Worse: order_date (range) before status (equality)
CREATE INDEX ix_orders_date_status ON orders(order_date, status);
WHERE status = 'shipped' AND order_date > '2024-01-01'   -- status can't narrow further after the range scan begins

Design composite indexes around your actual query patterns, leading with the column(s) most consistently filtered by equality across your hottest queries. It's common (and fine) to need multiple composite indexes with different column orderings if your application has several distinct hot query shapes against the same table — but each additional index has a write-cost tradeoff, so don't create one per query without checking for meaningful overlap first.

Related Resources

Use the Index, Luke: The Order of Columns in an Index

Open as page

Definitions

Cardinality: the count of distinct values in a column. gender (2-3 values) has low cardinality; email (nearly all unique) has high cardinality.
Selectivity: distinct_values / total_rows. A selectivity close to 1 means most values are unique (highly selective — a typical WHERE col = x matches very few rows); a selectivity close to 0 means values repeat heavily (poorly selective — a typical WHERE col = x matches a large fraction of the table).

Why the optimizer cares

An index lookup for a value that matches, say, 1% of rows is a huge win over a full scan — jump straight to the relevant rows. But an index lookup for a value that matches 50% of rows can be worse than a full scan: each matched row potentially requires a separate random-access page fetch (via a key lookup, unless it's a covering or clustered index), whereas a sequential full scan reads pages in order, which is much friendlier to disk/OS-level read-ahead and caching.

-- is_active: 95% of rows are TRUE, 5% are FALSE -- very low selectivity for TRUE
CREATE INDEX ix_users_active ON users(is_active);

SELECT * FROM users WHERE is_active = true;
-- Optimizer likely IGNORES the index and does a full table scan --
-- fetching 95% of the table via random-access index lookups would be slower.

SELECT * FROM users WHERE is_active = false;
-- Optimizer likely USES the index here -- only 5% of rows match.

This is why the same index, on the same column, can be used for one query and ignored for another — the optimizer's decision depends on the specific value's estimated selectivity, not just whether an index technically exists.

How the optimizer knows selectivity

Databases maintain statistics — histograms and distinct-value counts per column, refreshed periodically (ANALYZE in PostgreSQL, auto-updated stats in SQL Server/MySQL, or manually triggered). Stale statistics (e.g., after a bulk load that drastically changes the data distribution) are a common real-world cause of the optimizer picking a bad plan — it's reasoning from an outdated picture of the data. Running ANALYZE (PostgreSQL/MySQL) or UPDATE STATISTICS (SQL Server) after major data changes is a standard troubleshooting step when a previously-fast query suddenly gets a bad plan.

Low-cardinality columns (booleans, small enums) are usually poor standalone index candidates — an index rarely helps if a typical query still matches a large fraction of the table. They can still be useful as a secondary column in a composite index (e.g., (customer_id, is_active)) where the leading column already narrows things down enough that the low-cardinality column just adds a bit more precision within an already-small result set.

Related Resources

PostgreSQL: Row Estimation Examples

Open as page

EXPLAIN ANALYZE
SELECT c.name, o.total
FROM customers c
JOIN orders o ON o.customer_id = c.id
WHERE c.region = 'EU'
ORDER BY o.total DESC
LIMIT 10;

Example PostgreSQL output (abridged):

Limit  (cost=1520.44..1520.46 rows=10) (actual time=45.2..45.3 rows=10 loops=1)
  ->  Sort  (cost=1520.44..1545.10 rows=9865) (actual time=45.2..45.2 rows=10 loops=1)
        Sort Key: o.total DESC
        ->  Hash Join  (cost=245.00..1290.32 rows=9865) (actual time=3.1..38.7 rows=9910 loops=1)
              Hash Cond: (o.customer_id = c.id)
              ->  Seq Scan on orders o  (cost=0.00..820.00 rows=50000) (actual rows=50000 loops=1)
              ->  Hash  (cost=200.00..200.00 rows=3600) (actual rows=3550 loops=1)
                    ->  Seq Scan on customers c  (cost=0.00..200.00 rows=3600)
                          Filter: (region = 'EU')

How to read it

Indentation = nesting. Innermost/deepest operations execute first; their output feeds the operation above them. Read from the bottom/innermost outward.
cost=startup..total — the optimizer's estimated cost (an arbitrary unit, not milliseconds) to produce the first row and all rows, respectively. Only meaningful for comparing plans against each other on the same engine/config, not as an absolute number.
actual time=... rows=... loops=... — only present with ANALYZE: real measured time, real row counts, and how many times this node executed (relevant inside a nested loop, where an inner node runs once per outer row).

What to look for

Scan type on each table — Seq Scan (full table scan) on a large table you expected to hit an index is the first thing to investigate. Not always wrong (see the selectivity question — sometimes a scan genuinely is cheaper), but worth questioning.
Estimated vs. actual row counts — a huge gap (e.g., estimated 10 rows, actual 50,000) means the optimizer's statistics are stale or a predicate is hard to estimate (like a correlated condition across columns), and it likely picked a suboptimal plan as a result. This is one of the most valuable things ANALYZE gives you that plain EXPLAIN can't.
Nested loops with a high loops count on an expensive inner operation — if the inner side of a nested loop isn't using an index and it runs once per outer row, cost multiplies fast.
Sorts on large row counts — an expensive Sort node (as in the example above) sometimes disappears entirely if an index already provides the required order, avoiding the sort altogether.
The overall top-level total cost/time relative to what you expect for the query's importance — but always validate against the actual time under ANALYZE, not just the estimate.

Engine differences

Every major engine has some form of this: PostgreSQL/MySQL use EXPLAIN/EXPLAIN ANALYZE; SQL Server has both a text/XML plan and the graphical "Actual Execution Plan" in SSMS; Oracle has EXPLAIN PLAN FOR plus DBMS_XPLAN. The concepts (scan types, join algorithms, cost estimates, actual vs. estimated rows) transfer across engines even though the exact syntax and terminology differ.

Caution: EXPLAIN ANALYZE actually executes the query, including any INSERT/UPDATE/DELETE — never run it carelessly against a write statement in production without wrapping it in a transaction you intend to roll back, unless you specifically want the write to happen.

Related Resources

PostgreSQL: Using EXPLAIN

Use the Index, Luke: Execution Plans

Open as page

Indexes are not free — they're a deliberate tradeoff of write cost and storage for read speed, and over-indexing is a genuine, common production problem.

Write amplification

Every INSERT must add an entry to every index on that table. Every UPDATE that touches an indexed column must remove the old index entry and insert a new one (even if the underlying row didn't physically move). Every DELETE must remove entries from every index. A table with 10 indexes turns one logical INSERT into up to 11 physical write operations (1 for the row + 10 for the indexes).

-- If orders has 8 indexes, this single INSERT triggers 9 total index/data writes
INSERT INTO orders (customer_id, total, status, ...) VALUES (...);

Storage and cache pressure

Each index is a full, separate data structure — a table with several large composite indexes can easily have more total on-disk size in its indexes than in the actual table data. This also means the database's memory cache (buffer pool) has to compete between caching hot table data and hot index pages; excess unused indexes waste cache space that could otherwise hold frequently-accessed data.

Confusing the optimizer

More indexes mean more candidate plans for the optimizer to evaluate, and query planning time itself grows (usually negligible, but non-zero on very complex queries). More meaningfully, several overlapping/redundant indexes (e.g., (a), (a, b), and (a, b, c) all existing simultaneously when only (a, b, c) is needed, since it already covers queries that only need a or a, b) waste maintenance cost without adding real query benefit, since a composite index's leading-column prefix already serves those narrower queries.

Lock contention

In engines with more index-level locking overhead, concurrent writers touching the same index page (e.g., inserting into the "hot end" of a sequential index) can serialize on that page, and more indexes multiply the surfaces where this kind of contention can occur.

Regularly audit indexes for ones that are never used by the optimizer (pg_stat_user_indexes in PostgreSQL, sys.dm_db_index_usage_stats in SQL Server) and drop them.
Consolidate overlapping composite indexes into the widest one that covers all the narrower use cases, when column order allows it.
Add indexes deliberately, backed by an actual slow query and its execution plan — not speculatively "just in case."
Remember that a UNIQUE constraint and a FOREIGN KEY each typically create an index implicitly — don't double-count these when reasoning about how many indexes a table "really" has.

Related Resources

Use the Index, Luke: Indexing Tradeoffs

Open as page

-- PostgreSQL: explicit hash index
CREATE INDEX ix_sessions_token_hash ON sessions USING HASH (token);

-- Default (B-tree) index
CREATE INDEX ix_sessions_token_btree ON sessions (token);

Hash index

Applies a hash function to the key and stores the entry in the corresponding bucket. Looking up WHERE token = 'abc123' computes the hash once and jumps directly to the bucket — average O(1), independent of table size.

Limitations:

Equality only. WHERE token = 'x' works; WHERE token > 'x', BETWEEN, ORDER BY token, or LIKE 'x%' cannot use a hash index at all, since hashing destroys any relationship between similar keys' storage locations.
No multi-column ordering benefit — a composite hash index doesn't support the same leading-prefix behavior a composite B-tree index does.
Historically (pre-PostgreSQL 10), hash indexes weren't even crash-safe/WAL-logged, which discouraged their use; this has since been fixed, but B-tree remains the overwhelmingly more common default across the industry regardless.

B-tree index

Maintains keys in sorted order (see the B-tree internals question), supporting equality, range, prefix, and sorted-retrieval queries — a strict superset of what a hash index can do, at the cost of O(log n) instead of O(1) average-case lookup.

Why B-tree wins by default almost everywhere

The performance difference between O(1) and O(log n) is negligible in practice — even a billion-row table has a B-tree height of only about 5-6 levels — while the functionality difference is large: almost every real query workload eventually needs a range scan, a sort, or a prefix match somewhere, which a hash index simply cannot provide. This is why MySQL's default index type (B-tree) and PostgreSQL's default (btree) both make B-tree the assumed choice unless you explicitly ask for something else, and why hash indexes are a niche optimization reserved for confirmed pure-equality workloads (e.g., a session-token lookup table where you genuinely never range-query or sort by the token).

When a hash index can still be worth it

If a specific column is queried only via exact-match equality, is large (long strings), and is under heavy lookup load, a hash index can offer a modest, measurable win — but this should be validated with benchmarking on your actual workload rather than assumed, since the B-tree's O(log n) is already extremely fast in absolute terms.

Related Resources

PostgreSQL: Index Types

Open as page

The problem

CREATE INDEX ix_users_email ON users(email);

-- Non-sargable: the index stores raw 'email' values, not UPPER(email),
-- so the engine can't use the index to jump to matching rows -- it must
-- compute UPPER(email) for every row and compare, i.e. a full scan.
SELECT * FROM users WHERE UPPER(email) = 'ALICE@EXAMPLE.COM';

Common non-sargable patterns:

WHERE UPPER(last_name) = 'SMITH'          -- function wraps the column
WHERE price * 1.1 > 100                    -- arithmetic on the column
WHERE YEAR(order_date) = 2024              -- function wraps the column
WHERE SUBSTRING(phone, 1, 3) = '555'       -- function wraps the column
WHERE '%' || search_term || '%' LIKE name  -- leading wildcard, effectively

Making them sargable

Rewrite to isolate the column — move the transformation to the constant side of the comparison instead:

-- Sargable: 'order_date' itself is compared directly, function only applies to constants
WHERE order_date >= '2024-01-01' AND order_date < '2025-01-01'

-- Instead of price * 1.1 > 100, isolate price:
WHERE price > 100 / 1.1

Or create a matching expression/functional index, if the transformed predicate is unavoidable (e.g., you genuinely need case-insensitive lookups everywhere):

-- PostgreSQL: expression index matches the exact expression used in the query
CREATE INDEX ix_users_email_upper ON users (UPPER(email));

-- Now this query CAN use the index, because the index itself stores UPPER(email)
SELECT * FROM users WHERE UPPER(email) = 'ALICE@EXAMPLE.COM';

(SQL Server achieves the same effect with a computed column plus an index on that column; MySQL supports functional key parts directly since 8.0.13.)

Or normalize the data at write time instead of transforming at read time — e.g., store emails already lowercased in a dedicated column, and compare against that directly, avoiding the need for any function at query time.

Why this trips people up

The predicate looks like it's filtering on an indexed column, and many developers assume "there's an index on email, so this must be fast" without noticing the function wrapped around it defeats that index entirely. Always check EXPLAIN when a seemingly-indexed query is slow — a Seq Scan/full scan despite an apparently relevant index is the classic symptom of a non-sargable predicate.

Related Resources

Use the Index, Luke: Functions and Sargability

Open as page

The relationship

An index is the structure the optimizer could use; statistics are the evidence the optimizer uses to decide whether using it is actually a good idea, and to estimate how many rows each step of a plan will produce. The optimizer is a cost-based planner: it needs quantitative estimates (row counts, selectivity) to compare candidate plans, and those estimates come entirely from statistics, not from the index structure itself.

What statistics typically capture

Total row count and table size.
Number of distinct values per column (used to estimate selectivity — see that question).
A histogram of value frequency buckets, so the optimizer can estimate WHERE price BETWEEN 10 AND 50 more accurately than assuming a uniform distribution across the whole range.
Correlation between a column's order and physical row order (helps estimate range-scan I/O cost).

How they get out of date

-- Bulk load that drastically shifts the data distribution
INSERT INTO orders SELECT * FROM legacy_orders;  -- adds 10 million rows

-- If statistics aren't refreshed, the optimizer still thinks the table
-- is small and the old value distribution still holds -- and may pick
-- a plan (e.g., a nested loop join) that was fine for the old size but
-- is disastrous at the new size.

Most engines auto-update statistics based on a percentage-of-rows-changed threshold (PostgreSQL's autovacuum/autoanalyze, SQL Server's auto-update statistics), but large, fast bulk operations can outrun that threshold's responsiveness, or auto-stats can be disabled in performance-sensitive environments and only run on a schedule.

Manually refreshing statistics

-- PostgreSQL
ANALYZE orders;

-- SQL Server
UPDATE STATISTICS orders;

-- MySQL
ANALYZE TABLE orders;

This is a standard, safe, low-risk first troubleshooting step when a previously-fine query suddenly gets slow after a large data change — often cheaper and faster to try than rewriting the query or adding new indexes, and frequently the actual root cause.

A candidate who says "just add an index" without mentioning statistics is missing half the picture — a perfectly good index can be ignored by the optimizer if the statistics backing its cost estimate are stale, and conversely, refreshing statistics can sometimes fix a bad plan with zero schema changes at all.

Related Resources

PostgreSQL: ANALYZE

Open as page

A structured approach, roughly in order of "cheapest to check, highest signal first":

1. Confirm it's actually the query, not something else

Rule out lock contention (is the query waiting on another transaction's lock, not actually doing slow work?), connection pool exhaustion, or network latency before assuming the query plan itself is the problem. Check pg_stat_activity (PostgreSQL) or equivalent to see if the query is active and burning CPU/IO, or idle in transaction/blocked waiting on a lock.

2. Get the real execution plan

EXPLAIN ANALYZE
SELECT ...;   -- the actual slow query, with representative parameter values

Never guess from just reading the SQL — the optimizer's actual chosen plan (scan types, join order, join algorithm) is often surprising.

3. Compare estimated vs. actual row counts

A large estimate/actual gap at any node is the single most common root cause of a bad plan — it means the optimizer is working from wrong information. This usually points to stale statistics (ANALYZE/UPDATE STATISTICS) or a predicate the optimizer inherently can't estimate well (e.g., a correlated condition across two columns it assumes are independent).

4. Look for the classic culprits, in the plan

Full table scan on a large table where you expected an index seek — check for a missing index, or a non-sargable predicate (function wrapping the column — see that question) defeating an existing one.
Nested loop with a high iteration count on an inner side that isn't indexed — O(n×m) behavior hiding inside what looks like a normal join.
Unnecessary sort — often removable if an index already provides the required order for ORDER BY.
Row-count explosion mid-plan — a sign of an unintended one-to-many join fan-out (see the join explosion question) before aggregation.

5. Apply the smallest fix that addresses the actual bottleneck

Missing index → add the specific index the plan is missing, verify with EXPLAIN ANALYZE again that it's actually chosen and helps.
Non-sargable predicate → rewrite the predicate, or add a matching expression index.
Stale statistics → refresh them; sometimes this alone fixes the plan.
Genuine algorithmic/schema issue (deep pagination via large OFFSET, unnecessary joins, a report needing full aggregation from raw rows) → consider keyset pagination, denormalization, a materialized view, or caching — but only after confirming a simpler index/predicate fix isn't sufficient.

6. Re-measure, don't assume

Always re-run EXPLAIN ANALYZE (and ideally a real load test) after the fix to confirm the plan actually changed and performance actually improved — it's common to "fix" a query in a way that helps the specific parameter values tested but doesn't generalize, or to add an index that the optimizer still doesn't choose to use.

What interviewers are listening for

A methodical, plan-driven process (measure → hypothesize → verify) rather than jumping straight to "add an index" or "add a cache" without first confirming what's actually happening in the plan. Mentioning statistics staleness and sargability specifically signals real production experience, not just textbook knowledge.

Related Resources

PostgreSQL: Using EXPLAIN