How do you approach reducing a bloated Docker image and its build time for a legacy application?

Detailed Answer

This is a practical, judgment-oriented question testing whether a candidate approaches optimization methodically (measure, prioritize, validate) rather than applying every known trick indiscriminately and hoping nothing breaks.

Step 1: measure before changing anything

docker history myapp:legacy --no-trunc
docker images myapp:legacy
time docker build -t myapp:legacy .    # establish a baseline build time

docker history reveals exactly which layers are contributing the most to the image's total size. This often surfaces a surprise (an accidentally-included large dependency cache, a full package manager's index left behind, an unnecessarily broad COPY) that's a much bigger win to fix first than anything more subtle. Never start optimizing based on assumption alone; the actual biggest contributor is often not what you'd guess.

Step 2: apply changes roughly in order of expected impact, validating each independently

Multi-stage build, if the application has any build/compile step at all — often the single biggest win, since it can eliminate entire build-toolchain layers from the final image (see that question).
Base image swap (full → slim, or → Alpine if compatible) — a real, meaningful size reduction, but requires actually testing the application still works correctly afterward (see the caveat below).
Dockerfile instruction reordering for cache efficiency (see that question) doesn't shrink the final image itself, but dramatically speeds up iterative rebuilds. This matters enormously for day-to-day developer and CI velocity, even if the shipped image size is unchanged.
.dockerignore tightening — often a quick, low-risk win, especially for legacy projects that have never had one properly maintained and are likely copying in unnecessary files (old build artifacts, version control history, documentation).

The specific risk with legacy applications: hidden runtime assumptions

FROM node:20-alpine    # switching to alpine...

Error: Cannot find module 'some-native-dependency'
# (a native/compiled dependency built against glibc, incompatible with Alpine's musl libc)

A legacy application, especially one that's been running unchanged for a long time, sometimes has accumulated implicit dependencies on specifics of its original base image that aren't obvious from reading the Dockerfile alone. Examples include a native compiled dependency assuming glibc (breaking under Alpine's musl — see the base-image question), a script assuming a specific shell or utility's exact behavior that differs between distributions, or file paths/permissions the application implicitly relies on. This is exactly why each change should be validated independently with the application's actual test suite (and ideally a staging/canary deployment) before moving to the next optimization. Bundling several changes together, and only then discovering something broke, makes it much harder to identify which specific change caused the regression.

Communicating the plan and tradeoffs to a team

A strong approach also includes framing this work for stakeholders: "here's the current baseline (image size, build time), here's the expected improvement from each specific change, here's the validation plan for each, and here's the rollback plan if a change introduces a regression." This treats image/build optimization as a deliberate, measured engineering effort with a clear before/after story, not just "I made the Dockerfile better" with no quantified evidence.

Reporting the outcome with real numbers

"We reduced the image from 1.2GB to 180MB (mostly via a multi-stage build eliminating the build toolchain), and cut typical incremental rebuild time from 90 seconds to 4 seconds by reordering dependency installation ahead of application code copying. This was validated against the full regression suite and a week of staging traffic before rolling out to production." Concrete, specific numbers make this kind of answer far more convincing than a vague "we made it smaller and faster." They also demonstrate the same measure-first discipline that matters most, precisely because legacy applications hide more unstated assumptions than a project built with today's best practices from scratch.