What are the security risks of `pickle`, `eval`, and `exec`, and how do you avoid them?

`pickle.loads()` on untrusted data can execute **arbitrary code** during deserialization (a pickled object's `__reduce__` can call any function), making it as dangerous as `eval()` on untrusted input — never unpickle data from a source you don't fully trust. `eval`/`exec` run arbitrary Python source directly and should essentially never be used on user-supplied input; use `json`/`ast.literal_eval` for safe parsing, and safer serialization formats (JSON, `msgpack`, protobuf) for data exchange across a trust boundary.

What's the difference between WSGI and ASGI, and why does it matter?

**WSGI** (Web Server Gateway Interface) is the traditional synchronous standard interface between a Python web application and a web server — one request handled per worker thread/process at a time, no native async or WebSocket support. **ASGI** (Asynchronous Server Gateway Interface) is its async-capable successor, supporting `async`/`await`, WebSockets, and long-lived connections, letting a single worker handle many concurrent connections cooperatively via an event loop.

How do Django, Flask, and FastAPI compare, and when would you choose each?

**Django** is a full-featured, "batteries-included" framework (ORM, admin panel, auth, forms) best for content-heavy, database-driven applications built quickly with conventions already decided. **Flask** is a minimal, unopinionated WSGI microframework — you assemble your own stack of extensions, giving maximum flexibility for small services or unconventional architectures. **FastAPI** is a modern, async-first (ASGI) framework built around type hints, automatically generating request validation and OpenAPI docs — the default choice for new, high-performance JSON APIs.

How do you connect to and query databases from Python?

Low-level access goes through the **DB-API 2.0** standard (PEP 249) — every major database driver (`psycopg2`/`psycopg` for Postgres, `sqlite3` built-in, `pymysql`) exposes the same `connect()`/`cursor()`/`execute()`/`fetchall()` interface. Most applications instead use an **ORM** (SQLAlchemy, Django ORM) to work with Python objects instead of raw SQL, at the cost of an abstraction layer — always use **parameterized queries**, never string-formatted SQL, to avoid SQL injection.

What are the major differences between Python 2 and Python 3 that still matter today?

The most consequential change: **strings are Unicode by default in Python 3** (`str` is text, `bytes` is binary — no more implicit, error-prone mixing of the two as in Python 2's `str`/`unicode` split). Other lasting changes: `print` became a function, `/` performs true division by default (`//` for floor division), and iteration-heavy built-ins (`range`, `dict.keys()`, `map`, `filter`) return lazy iterators/views instead of lists. Python 2 reached end-of-life in January 2020.

How do you handle dependency versioning and reproducible builds?

Declare **loose, compatible version ranges** in your project's dependency list (`requests>=2.28,<3.0`) to allow reasonable updates, but **pin exact resolved versions** (including transitive dependencies) in a **lock file** (`poetry.lock`, `Pipfile.lock`, or a `pip-compile`-generated `requirements.txt`) for actual deployments — guaranteeing every environment (dev, CI, production) installs the identical set of package versions, not just versions that happen to satisfy the range at install time.

What are common Python anti-patterns to avoid in production code?

Common ones: mutable default arguments (shared state leak across calls), bare `except:` clauses (swallow everything including `SystemExit`/`KeyboardInterrupt`), using `pickle`/`eval` on untrusted data, wildcard imports (`from module import *`, polluting the namespace and hiding where names come from), catching exceptions just to `pass` silently, and using a mutable class attribute where an instance attribute was intended.

How do you package and deploy a Python application?

For a library, build a **wheel** (`python -m build`) and publish it to PyPI (or a private index) so it can be installed via `pip`. For a deployable application/service, the standard modern approach is a **Docker container** with dependencies pinned via a lock file, giving a reproducible runtime environment independent of the host machine's Python version or installed packages — orchestrated via Kubernetes, a PaaS (Heroku, Fly.io), or a serverless platform depending on the workload.

Python in Production, Security & Ecosystem

Q: How should you manage secrets and configuration in a Python application?

Never hardcode secrets (API keys, database passwords, tokens) in source code or commit them to version control — load them from **environment variables** (via `os.environ`, `python-dotenv` for local development) or a dedicated **secrets manager** (AWS Secrets Manager, HashiCorp Vault, environment injection from your deployment platform) at runtime, and keep non-secret configuration separate (a `.env`/config file that's safe to commit, or environment-specific settings modules) from actual credentials.

Q: What are common Python anti-patterns to avoid in production code?

Common ones: mutable default arguments (shared state leak across calls), bare `except:` clauses (swallow everything including `SystemExit`/`KeyboardInterrupt`), using `pickle`/`eval` on untrusted data, wildcard imports (`from module import *`, polluting the namespace and hiding where names come from), catching exceptions just to `pass` silently, and using a mutable class attribute where an instance attribute was intended.

Q: How do you package and deploy a Python application?

For a library, build a **wheel** (`python -m build`) and publish it to PyPI (or a private index) so it can be installed via `pip`. For a deployable application/service, the standard modern approach is a **Docker container** with dependencies pinned via a lock file, giving a reproducible runtime environment independent of the host machine's Python version or installed packages — orchestrated via Kubernetes, a PaaS (Heroku, Fly.io), or a serverless platform depending on the workload.

Security pitfalls, WSGI/ASGI, web framework choices, database access, dependency management, and deployment.

Difficulty

Open as page

Why unpickling untrusted data is a full remote-code-execution risk

import pickle
import os

class Exploit:
    def __reduce__(self):
        return (os.system, ("echo pwned; rm -rf /tmp/demo",))

payload = pickle.dumps(Exploit())

# Anywhere this runs on untrusted input:
pickle.loads(payload)   # actually executes os.system(...) during unpickling!

__reduce__ is a legitimate protocol pickle uses to know how to reconstruct an object — but it can name any callable, and unpickling calls it. There is no way to "sandbox" pickle.loads() against a maliciously crafted payload; the official docs state plainly: never unpickle data received from an untrusted or unauthenticated source. This is the reason cache backends, message queues, or APIs that use pickle for convenience are a known attack surface if any external input can reach them.

`eval()`/`exec()`: running arbitrary source directly

user_input = "__import__('os').system('rm -rf /')"
eval(user_input)   # executes it -- catastrophic if user_input is attacker-controlled

eval (expressions) and exec (statements) execute Python source text directly, with the full power of the language — passing any externally-influenced string to either is effectively giving that input author full code execution in your process.

The safe alternatives

import json
import ast

# Safe: parsing structured data
data = json.loads(user_json_string)              # only produces JSON-compatible values

# Safe: parsing a Python LITERAL (not arbitrary code)
value = ast.literal_eval("[1, 2, {'a': True}]")    # only literals -- no function calls, no imports
ast.literal_eval("__import__('os').system('x')")     # raises ValueError -- not a literal, rejected

json.loads only ever produces plain data (dicts, lists, strings, numbers, booleans, None) — it cannot execute anything. ast.literal_eval is a genuinely safe, restricted subset of eval that parses only Python literals (numbers, strings, tuples, lists, dicts, booleans, None) and explicitly rejects anything resembling a function call or attribute access.

For serialization across a trust boundary, avoid pickle entirely

# Instead of pickling to send data between services / store in a shared cache:
import json
data = json.dumps({"user_id": 1, "action": "login"})

# For richer/faster binary serialization with the same "no code execution" safety:
# msgpack, protobuf, or a schema-validated format (pydantic models -> JSON)

pickle is appropriate only for trusted, same-process or same-organization data you fully control (e.g., caching your own computed Python objects to local disk) — never for data crossing a trust boundary (received from a network request, a third-party queue, user uploads, or any source you don't fully control end to end).

Interview-ready summary: pickle.loads() on untrusted data is equivalent to arbitrary code execution, because a crafted payload's __reduce__ can invoke any callable during deserialization — never unpickle data you don't fully trust. eval/exec on any externally-influenced string is the same class of risk. Use json/ ast.literal_eval for safe parsing, and JSON/msgpack/protobuf instead of pickle for any data that crosses a trust boundary.

Related Resources

pickle — Python docs (security warning)

ast.literal_eval — Python docs

Open as page

The mistake: hardcoded secrets in source

# DON'T -- committed to git, visible in history forever, even if later "removed"
DATABASE_PASSWORD = "hunter2"
API_KEY = "sk-live-abc123..."

Once a secret is committed to version control, it's in the repository's history permanently (removing it from the latest commit doesn't remove it from history) — anyone with read access to the repo, now or in the future, can find it. This is one of the most common real-world causes of security incidents.

Loading from environment variables

import os

DATABASE_PASSWORD = os.environ["DATABASE_PASSWORD"]   # raises KeyError if missing -- fail loudly
API_KEY = os.environ.get("API_KEY")                     # or provide a fallback if optional

Environment variables keep secrets out of the codebase entirely — they're injected at deploy/runtime by the hosting platform, CI secrets store, or orchestration system (Kubernetes secrets, systemd environment files), and never touch the repository.

Local development: `.env` files (never committed)

# .env  (in .gitignore -- never committed!)
DATABASE_PASSWORD=local-dev-password
API_KEY=sk-test-...

from dotenv import load_dotenv
load_dotenv()   # reads .env into os.environ, for local dev convenience

import os
password = os.environ["DATABASE_PASSWORD"]

python-dotenv loads a local .env file into the process environment, giving the same os.environ access pattern locally as in production — critically, .env must be listed in .gitignore, and a .env.example (with placeholder, non-real values) is committed instead to document what variables are needed.

Dedicated secrets managers for production

import boto3

client = boto3.client("secretsmanager")
secret = client.get_secret_value(SecretId="prod/db-password")["SecretString"]

For production systems, a dedicated secrets manager (AWS Secrets Manager, HashiCorp Vault, Google Secret Manager) adds capabilities plain environment variables don't offer: access auditing (who fetched which secret, when), automatic rotation, and fine-grained access control per service — worth the added complexity for anything beyond small applications.

Separating secrets from non-secret configuration

# settings.py -- safe to commit; no actual secrets here
import os

DEBUG = os.environ.get("DEBUG", "false").lower() == "true"
DATABASE_HOST = os.environ.get("DATABASE_HOST", "localhost")
DATABASE_PASSWORD = os.environ["DATABASE_PASSWORD"]   # the actual secret, injected at runtime

Non-sensitive configuration (feature flags, hostnames, timeouts) can reasonably live in a committed settings file with sensible defaults; only the genuinely sensitive values need to come exclusively from the environment/secrets manager with no committed default at all.

A useful checklist

Add .env, *.pem, credentials.json, etc. to .gitignore from day one.
Use pre-commit secret-scanning hooks (detect-secrets, gitleaks) to catch accidental commits before they happen.
Rotate any secret that was ever accidentally committed — removing it from the latest commit is not sufficient; treat it as compromised.

Interview-ready summary: Secrets belong in environment variables or a dedicated secrets manager, injected at runtime — never hardcoded in source or committed to version control, since git history is effectively permanent. Use .env files (gitignored) for local development convenience, and treat any secret that was ever committed as compromised and due for rotation.

Related Resources

The Twelve-Factor App — Config

python-dotenv

Open as page

WSGI: the synchronous standard

def application(environ, start_response):
    status = "200 OK"
    headers = [("Content-Type", "text/plain")]
    start_response(status, headers)
    return [b"Hello, World!"]

A WSGI application is literally a callable matching this signature — environ describes the incoming request, start_response sends back the status/headers, and the return value is the response body. Every production WSGI setup (Flask, Django's traditional mode, running under Gunicorn/uWSGI) is built on this one synchronous, blocking-call contract: one request occupies one worker (thread or process) until it's fully handled.

Why WSGI's synchronous model limits concurrency

# A slow WSGI view blocks the entire worker handling it
def slow_view(request):
    time.sleep(5)         # this worker can't serve ANY other request meanwhile
    return HttpResponse("done")

Scaling a WSGI application to handle more concurrent slow requests means adding more worker processes/threads (each with real memory overhead) — there's no way for a single WSGI worker to cooperatively juggle many in-flight requests the way an event loop can.

ASGI: the async-capable successor

async def application(scope, receive, send):
    await send({
        "type": "http.response.start",
        "status": 200,
        "headers": [(b"content-type", b"text/plain")],
    })
    await send({"type": "http.response.body", "body": b"Hello, World!"})

ASGI applications are async callables built around the same scope/receive/send message-passing pattern used throughout asyncio — a single worker process, running an event loop, can hold thousands of concurrent connections open (including long-lived ones like WebSockets or Server-Sent Events, which WSGI has no first-class way to represent at all) as long as each one spends most of its time await-ing rather than blocking.

Framework alignment

Framework	Interface	Notes
Flask (classic)	WSGI	Synchronous by design; can run under Gunicorn
Django (traditional views)	WSGI	Async views supported since Django 3.1, running under ASGI
FastAPI	ASGI	Built async-first, typically served by Uvicorn/Hypercorn
Starlette	ASGI	The lightweight ASGI toolkit FastAPI itself is built on

Why this distinction matters practically

Choosing WSGI vs ASGI isn't just a framework preference — it determines whether the application can efficiently support WebSockets, long-polling, or very high connection counts with modest resource usage. A traditional synchronous CRUD app with modest concurrency needs is often perfectly well served by WSGI (simpler mental model, mature tooling); an app needing real-time features or very high concurrent connection counts benefits substantially from ASGI's async model.

Interview-ready summary: WSGI is the synchronous, one-request-per- worker standard interface web servers and Python apps have used for decades; ASGI is its async successor, enabling a single worker to cooperatively handle many concurrent (including long-lived, WebSocket) connections via async/await. The choice determines whether the application's concurrency model can scale via an event loop or only via adding more OS-level workers.

Related Resources

PEP 3333 – WSGI

ASGI documentation

Open as page

Django: batteries included

# models.py
from django.db import models

class Article(models.Model):
    title = models.CharField(max_length=200)
    body = models.TextField()
    published_at = models.DateTimeField(auto_now_add=True)

Django ships an ORM, a migration system, an admin panel generated automatically from your models, authentication/authorization, a templating engine, and form handling — all designed to work together out of the box, following the "convention over configuration" philosophy. Best fit: content-driven sites, internal tools, and applications where you want a mature, opinionated full-stack framework so you're not assembling and gluing together a dozen separate pieces yourself.

Flask: minimal and unopinionated

from flask import Flask, jsonify

app = Flask(__name__)

@app.route("/users/<int:user_id>")
def get_user(user_id):
    return jsonify({"id": user_id, "name": "Ada"})

Flask provides routing and request/response handling and essentially nothing else by default — you choose your own ORM (SQLAlchemy is common), your own auth solution, your own validation library. This flexibility is the point: Flask fits well for small services, APIs with unconventional requirements, or teams that want full control over which pieces go into their stack rather than accepting Django's defaults.

FastAPI: async-first, type-hint-driven

from fastapi import FastAPI
from pydantic import BaseModel

app = FastAPI()

class User(BaseModel):
    name: str
    age: int

@app.post("/users")
async def create_user(user: User):   # request body validated automatically from the type hint
    return {"id": 1, **user.model_dump()}

FastAPI uses Python type hints (via Pydantic) to automatically validate incoming request data, serialize responses, and generate interactive OpenAPI/Swagger documentation — all with minimal boilerplate compared to manually validating input in Flask/Django views. Being ASGI-native, it handles high-concurrency async workloads (calling other services, databases) efficiently without extra configuration.

Comparison at a glance

	Django	Flask	FastAPI
Philosophy	batteries-included	minimal, unopinionated	modern, type-hint-driven
Interface	WSGI (ASGI for async views since 3.1)	WSGI	ASGI (async-first)
ORM	built-in	bring your own (commonly SQLAlchemy)	bring your own
Auto validation/docs	forms + DRF (for APIs)	manual / extensions	built-in (Pydantic + OpenAPI)
Best for	full-stack apps, admin-heavy tools	small services, custom stacks	high-performance JSON APIs
Learning curve	steeper upfront, faster after	gentle, scales with complexity	gentle, strong typing payoff

Practical decision guide

Building a content-heavy site or internal admin-driven tool quickly, with a database and want conventions already decided → Django.
Building a small service or need full control over exactly which libraries make up the stack → Flask.
Building a new JSON API, especially one needing high concurrency, automatic validation, and generated docs → FastAPI.

Interview-ready summary: Django trades flexibility for productivity via a complete, opinionated stack (ORM, admin, auth) best suited to full-stack, database-driven apps. Flask trades built-in features for flexibility, suiting small or custom-architected services. FastAPI is the modern default for high-performance async JSON APIs, using type hints for automatic validation and documentation generation.

Related Resources

Django documentation

FastAPI documentation

Flask documentation

Open as page

The DB-API 2.0 standard: a common low-level interface

import sqlite3

conn = sqlite3.connect("app.db")
cursor = conn.cursor()
cursor.execute("SELECT id, name FROM users WHERE age > ?", (18,))
rows = cursor.fetchall()
conn.close()

Every DB-API-compliant driver (sqlite3 built-in, psycopg2/psycopg for PostgreSQL, pymysql/mysqlclient for MySQL) exposes the same shape: connect() returns a connection, .cursor() gets a cursor, .execute(sql, params) runs a query, and .fetchall()/.fetchone() retrieve results — learning this pattern once transfers across database backends.

SQL injection: the critical vulnerability to avoid

# NEVER do this -- string formatting builds SQL from untrusted input
name = "'; DROP TABLE users; --"
cursor.execute(f"SELECT * FROM users WHERE name = '{name}'")   # SQL INJECTION!

# ALWAYS use parameterized queries -- the driver handles escaping safely
cursor.execute("SELECT * FROM users WHERE name = ?", (name,))   # safe, regardless of content

String-interpolating user input directly into SQL lets an attacker inject arbitrary SQL (as the classic "Bobby Tables" example shows) — parameterized queries (? or %s placeholders, driver-dependent syntax) send the query and its values separately to the database, which handles escaping correctly regardless of what the value contains. This is not optional hardening — it's the baseline requirement for any code that builds a query using data from outside the program.

ORMs: working with objects instead of raw SQL

from sqlalchemy import create_engine, select
from sqlalchemy.orm import Session, DeclarativeBase, Mapped, mapped_column

class Base(DeclarativeBase):
    pass

class User(Base):
    __tablename__ = "users"
    id: Mapped[int] = mapped_column(primary_key=True)
    name: Mapped[str]
    age: Mapped[int]

engine = create_engine("postgresql://localhost/app")
with Session(engine) as session:
    users = session.scalars(select(User).where(User.age > 18)).all()

SQLAlchemy (and Django's built-in ORM) let you query and manipulate data as Python objects/classes instead of writing raw SQL strings, and automatically parameterize values (so ORM queries are inherently safe from SQL injection for their generated queries). The tradeoff: an abstraction layer that can generate inefficient queries if misused (the classic N+1 query problem — fetching a list, then separately querying related data for each item in a loop) and a learning curve of its own.

Connection pooling for production

engine = create_engine("postgresql://localhost/app", pool_size=10, max_overflow=5)

Opening a new database connection per request is expensive; production applications use a connection pool (built into SQLAlchemy's engine, or a standalone pooler like PgBouncer for Postgres) that reuses a fixed set of open connections across requests, dramatically reducing per-request connection overhead.

Async database access

import asyncpg

async def get_users(pool):
    async with pool.acquire() as conn:
        return await conn.fetch("SELECT * FROM users WHERE age > $1", 18)

Standard DB-API drivers are synchronous/blocking, which would stall an asyncio event loop — async applications (FastAPI, etc.) use async-native drivers (asyncpg, aiomysql) or an async-compatible ORM layer (SQLAlchemy's async engine) instead.

Interview-ready summary: DB-API 2.0 gives a consistent low-level interface across database drivers; most applications build on an ORM (SQLAlchemy, Django ORM) for productivity, understanding the tradeoff of an abstraction layer that can hide inefficient query patterns like N+1. Regardless of layer, always use parameterized queries — never string-format untrusted input into SQL — to avoid SQL injection.

Related Resources

PEP 249 – Python Database API Specification

SQLAlchemy documentation

Open as page

The biggest change: text vs binary data

# Python 2 (historical) -- 'str' was actually a byte string; 'unicode' was text
s = "hello"        # bytes, in Python 2
u = u"hello"        # explicit unicode, in Python 2
s + u                 # implicit conversion -- worked until it silently didn't, on non-ASCII data

# Python 3 -- unambiguous
s = "hello"    # str -- ALWAYS Unicode text
b = b"hello"    # bytes -- ALWAYS binary data
s + b            # TypeError: can't concat str to bytes -- caught immediately, not silently wrong

Python 2's implicit str/unicode mixing was a constant, subtle source of UnicodeDecodeError crashes in production whenever non-ASCII data appeared somewhere unexpected. Python 3 makes the distinction explicit and enforced at the type level — you must deliberately .encode()/ .decode() to cross between text and bytes, which surfaces the issue immediately during development instead of as an intermittent production bug.

`print` as a function, not a statement

# Python 2
print "hello"

# Python 3
print("hello")

Making print a real function (rather than special statement syntax) allows it to accept keyword arguments (sep, end, file, flush) and be passed around/reassigned like any other function — a small but representative example of Python 3's broader push toward consistency.

True division by default

# Python 2
5 / 2    # 2  -- integer division by default (surprising for many)
5 / 2.0   # 2.5

# Python 3
5 / 2    # 2.5  -- true division by default
5 // 2    # 2    -- floor division, now an explicit, separate operator

Python 2's / silently performed integer division when both operands were ints — a frequent source of subtle bugs when a variable that was expected to be a float turned out to be an int. Python 3 splits this into two unambiguous operators.

Lazy iterators instead of eager lists

# Python 2
range(10)         # a full list: [0, 1, 2, ..., 9]  -- built immediately in memory
dict.keys()        # a list

# Python 3
range(10)          # a range object -- lazy, O(1) memory regardless of size
dict.keys()         # a view object -- reflects live changes to the dict, no list copy
map(f, items)        # a lazy iterator, not an eagerly built list

This shift (also affecting filter, zip) reflects Python 3's general preference for laziness by default, improving memory efficiency for large ranges/collections — code relying on range(...) behaving like an indexable, sliceable list still mostly works (since range supports indexing/slicing), but code relying on it being an actual list (e.g., isinstance(x, list)) breaks.

Why this still matters today

Python 2 reached official end-of-life on January 1, 2020 — no more security patches, and most major libraries (NumPy, Django, and virtually the entire PyPI ecosystem) dropped Python 2 support around the same time. It matters in interviews mainly as historical/foundational knowledge: understanding why Python 3 made these changes (especially the Unicode/bytes split) demonstrates a solid grasp of Python's string and type model that's directly relevant even though nobody should be writing new Python 2 code today.

Interview-ready summary: The Unicode/bytes split (str is always text, bytes is always binary, no implicit mixing) is the change with the most lasting practical impact, eliminating a whole class of Python 2 encoding bugs. print() as a function, true division by default, and lazy iterators instead of eagerly built lists rounded out Python 3's broader push toward consistency and correctness — Python 2 itself is long past end-of-life (January 2020) and shouldn't appear in new code.

Related Resources

What's New In Python 3.0

Open as page

The problem: unpinned dependencies drift over time

# requirements.txt -- loose, no pinning
requests
django

pip install -r requirements.txt   # today: gets requests 2.31, django 4.2
# ... three months later, on a fresh machine ...
pip install -r requirements.txt    # gets requests 2.32, django 5.0 -- different versions!

Without pinning, "the same" requirements.txt can resolve to entirely different package versions depending on when it's installed — a transitive dependency's new release could introduce a breaking change or a subtle behavior difference, and the bug only shows up on a fresh install (a new developer's machine, a rebuilt CI image, a redeployed production server), not in the environment where it was originally tested.

The fix: separate "what I depend on" from "what I actually installed"

# pyproject.toml -- loose ranges, expressing compatibility intent
[project]
dependencies = [
    "requests>=2.28,<3.0",
    "django>=4.2,<5.0",
]

# poetry.lock / Pipfile.lock (generated, committed to version control)
# exact, fully-resolved versions of EVERY dependency, including transitive ones:
requests==2.31.0
urllib3==2.0.7      <- a transitive dependency of requests, also pinned
django==4.2.7
sqlparse==0.4.4      <- a transitive dependency of django, also pinned

The pyproject.toml/Pipfile declares acceptable ranges (compatibility intent — "any 2.x of requests is fine"); the lock file records the exact versions that were actually resolved and tested, including every transitive dependency, down to a fully reproducible tree — everyone running poetry install/pipenv install from the same lock file gets byte-for-byte identical dependency versions.

Achieving the same with plain pip: `pip-tools`

# requirements.in -- loose, human-maintained
requests>=2.28,<3.0
django>=4.2,<5.0

pip-compile requirements.in     # generates requirements.txt with EVERY package pinned
pip install -r requirements.txt   # exact, reproducible install

pip-compile (from pip-tools) fills the same role as poetry.lock for projects using plain pip — a fully pinned, reproducible requirements.txt generated from a loose, human-edited input file.

Why loose ranges still matter, not just exact pins everywhere

dependencies = ["requests==2.31.0"]   # too strict for a LIBRARY's own dependency declaration

If a library (as opposed to a deployable application) pins exact versions for its own dependencies, it forces every consumer of that library into the exact same versions too — creating conflicts when two libraries in the same project pin incompatible exact versions of a shared dependency. Libraries should declare loose, compatible ranges; applications (the actual deployable unit) are what should carry a fully pinned lock file for their own reproducible deployment.

Security: keeping pinned dependencies from going stale

poetry update requests    # deliberately bump one pinned dependency, re-lock
pip-audit                  # scan installed/locked dependencies for known CVEs

Pinning solves reproducibility but introduces a new responsibility: dependencies need periodic, deliberate updates (not just "never touch it again") to pull in security patches — tools like pip-audit, Dependabot, or poetry show --outdated help surface when a pinned version has a known vulnerability.

Interview-ready summary: Declare loose, compatible version ranges for what a project depends on, but pin exact, fully-resolved versions (including transitive dependencies) in a committed lock file (poetry.lock, Pipfile.lock, or pip-compile's output) for actual deployment reproducibility — and revisit those pins periodically for security updates rather than freezing them permanently.

Related Resources

pip-tools documentation

Open as page

1. Mutable default arguments

# BAD -- shared across every call that omits the argument
def add_item(item, bucket=[]):
    bucket.append(item)
    return bucket

# GOOD
def add_item(item, bucket=None):
    if bucket is None:
        bucket = []
    bucket.append(item)
    return bucket

Covered in depth in the Fundamentals topic — worth repeating here as one of the most common real-world bugs traced back to a single anti-pattern.

2. Bare `except:` (or overly broad exception handling)

# BAD
try:
    risky_operation()
except:
    pass    # swallows EVERYTHING, including typos (NameError) and Ctrl+C

Silently swallowing all exceptions hides real bugs and makes production issues nearly impossible to diagnose — always catch specific exceptions, and if you must log-and-continue at a boundary, log the actual exception (logger.exception(...)), don't discard it.

3. Mutable class attributes intended as instance attributes

# BAD -- shared across every instance!
class ShoppingCart:
    items = []
    def add(self, item):
        self.items.append(item)

cart1 = ShoppingCart()
cart2 = ShoppingCart()
cart1.add("apple")
cart2.items   # ['apple'] -- BUG: cart2 sees cart1's item!

# GOOD -- instance attribute, set per-object in __init__
class ShoppingCart:
    def __init__(self):
        self.items = []

items = [] at class scope creates one list shared by every instance; each instance needs its own list created in __init__.

4. Wildcard imports

# BAD -- pollutes the namespace, unclear where names come from
from mymodule import *

value = some_function()   # which module defined this? impossible to tell by reading

# GOOD -- explicit imports, or a namespaced import
from mymodule import some_function
import mymodule
mymodule.some_function()

import * makes it impossible to tell, just by reading the code, which module a given name came from — it also risks silently shadowing existing names, and static analysis tools/IDEs can't reliably follow it.

5. Catching an exception just to silence it

# BAD -- hides real failures, produces confusing downstream behavior
try:
    result = fetch_data()
except Exception:
    result = None   # caller now has no idea WHY this is None

# GOOD -- handle it meaningfully, or let it propagate
try:
    result = fetch_data()
except ConnectionError:
    logger.warning("fetch_data failed, using cached value")
    result = get_cached_value()

Swallowing an exception into a generic fallback value without logging or distinguishing why it failed turns a diagnosable failure into a mysterious downstream symptom.

6. Using `type()` instead of `isinstance()` for type checks

# BAD -- breaks for subclasses
if type(obj) == list:
    ...

# GOOD -- respects inheritance/polymorphism
if isinstance(obj, list):
    ...

type(obj) == list fails for any subclass of list, defeating polymorphism; isinstance (which also accepts a tuple of types) is almost always what's actually intended.

7. String concatenation in a loop instead of `str.join`

# BAD -- O(n^2): each += creates a new string, copying everything so far
result = ""
for item in items:
    result += str(item)

# GOOD -- O(n): join builds the final string in one pass
result = "".join(str(item) for item in items)

Since strings are immutable, repeated += in a loop recreates the entire string on every iteration — quadratic behavior that str.join avoids entirely.

Interview-ready summary: Most Python anti-patterns share a common thread — a subtle mismatch between what the code visually appears to do and what actually happens under the hood (mutable defaults/class attributes shared unexpectedly, bare except: hiding real failures, type() breaking polymorphism). Recognizing and avoiding this small, well-known set of patterns eliminates a large share of real-world Python bugs.

Related Resources

Common Gotchas — Hitchhiker's Guide to Python

Open as page

Packaging a library: wheels and PyPI

python -m build              # builds dist/mypackage-1.0.0-py3-none-any.whl + a source dist
python -m twine upload dist/*   # publishes to PyPI

A wheel (.whl) is a pre-built, ready-to-install package format — pip install mypackage fetches and installs it without needing to run any build step on the user's machine (unlike a source distribution, which may require compiling C extensions locally). This is the right target when the deliverable is a reusable library other projects will pip install.

Packaging an application: containerization

FROM python:3.12-slim

WORKDIR /app
COPY pyproject.toml poetry.lock ./
RUN pip install poetry && poetry install --no-dev --no-root

COPY . .
CMD ["python", "-m", "myapp"]

For a deployable service (as opposed to a library), a Docker image bundles the exact Python version, exact pinned dependencies (via the lock file), and the application code into one artifact that runs identically regardless of the host machine — eliminating "works on my machine" class issues entirely, since the container is the runtime environment.

Why containers dominate for applications specifically

Reproducibility: the same image runs identically in CI, staging, and production — no drift from differing host Python versions or system libraries.
Isolation: no conflicts with other applications' dependencies on the same host.
Orchestration compatibility: container images are the standard unit Kubernetes, ECS, and most modern deployment platforms expect.

Deployment targets, by workload shape

Workload	Common choice
Long-running web service, need fine control	Kubernetes / ECS running the container
Simpler apps, less ops overhead desired	PaaS (Heroku, Fly.io, Render) — often deploys straight from a `Procfile`/buildpack, no Dockerfile needed
Event-driven, sporadic/bursty invocations	Serverless (AWS Lambda, Google Cloud Functions) — packaged as a zip/layer or container image, billed per invocation
CLI tool distributed to end users	A wheel published to PyPI, or a bundled executable (`pyinstaller`, `shiv`) for non-Python-savvy users

Entry points and process management in production

gunicorn myapp.wsgi:application --workers 4     # WSGI, process-based concurrency
uvicorn myapp.asgi:application --workers 4        # ASGI, event-loop-based concurrency per worker

A production WSGI/ASGI application is served by a dedicated application server (Gunicorn, Uvicorn) rather than a development server (Django's runserver, Flask's built-in dev server) — the dev servers are explicitly not designed for production traffic (no proper worker management, no production-grade concurrency handling).

Interview-ready summary: Libraries are packaged as wheels and published to PyPI for pip install; deployable applications are typically containerized with pinned dependencies for full environment reproducibility, then run via an orchestration platform or PaaS matched to the workload's shape (long-running service, serverless, or CLI tool). Production traffic is always served by a dedicated app server (Gunicorn/ Uvicorn), never a framework's built-in development server.

Related Resources

Packaging Python Projects — PyPA

Python in Production, Security & Ecosystem

What are the security risks of `pickle`, `eval`, and `exec`, and how do you avoid them?

Why unpickling untrusted data is a full remote-code-execution risk

eval()/exec(): running arbitrary source directly

The safe alternatives

For serialization across a trust boundary, avoid pickle entirely

Related Resources

How should you manage secrets and configuration in a Python application?

The mistake: hardcoded secrets in source

Loading from environment variables

Local development: .env files (never committed)

Dedicated secrets managers for production

Separating secrets from non-secret configuration

A useful checklist

Related Resources

What's the difference between WSGI and ASGI, and why does it matter?

WSGI: the synchronous standard

Why WSGI's synchronous model limits concurrency

ASGI: the async-capable successor

Framework alignment

Why this distinction matters practically

Related Resources

How do Django, Flask, and FastAPI compare, and when would you choose each?

Django: batteries included

Flask: minimal and unopinionated

FastAPI: async-first, type-hint-driven

Comparison at a glance

Practical decision guide

Related Resources

How do you connect to and query databases from Python?

The DB-API 2.0 standard: a common low-level interface

SQL injection: the critical vulnerability to avoid

ORMs: working with objects instead of raw SQL

Connection pooling for production

Async database access

Related Resources

What are the major differences between Python 2 and Python 3 that still matter today?

The biggest change: text vs binary data

print as a function, not a statement

True division by default

Lazy iterators instead of eager lists

Why this still matters today

Related Resources

How do you handle dependency versioning and reproducible builds?

The problem: unpinned dependencies drift over time

The fix: separate "what I depend on" from "what I actually installed"

Achieving the same with plain pip: pip-tools

Why loose ranges still matter, not just exact pins everywhere

Security: keeping pinned dependencies from going stale

Related Resources

What are common Python anti-patterns to avoid in production code?

1. Mutable default arguments

2. Bare except: (or overly broad exception handling)

3. Mutable class attributes intended as instance attributes

4. Wildcard imports

5. Catching an exception just to silence it

6. Using type() instead of isinstance() for type checks

7. String concatenation in a loop instead of str.join

Related Resources

How do you package and deploy a Python application?

Packaging a library: wheels and PyPI

Packaging an application: containerization

Why containers dominate for applications specifically

Deployment targets, by workload shape

Entry points and process management in production

Related Resources

`eval()`/`exec()`: running arbitrary source directly

Local development: `.env` files (never committed)

`print` as a function, not a statement

Achieving the same with plain pip: `pip-tools`

2. Bare `except:` (or overly broad exception handling)

6. Using `type()` instead of `isinstance()` for type checks

7. String concatenation in a loop instead of `str.join`