What does "everything is an object" mean in Python, and why does it matter?

In Python, **ints, functions, classes, and modules** are all objects with an identity, type, and attributes — there's no primitive/object split like in Java. This means you can pass functions as arguments, attach attributes to a function, inspect a class's `__dict__` at runtime, and treat `type` itself as an object (an instance of `type`). It's the foundation for decorators, introspection, and duck typing.

What's the difference between `is` and `==`?

`==` calls `__eq__` and checks **value equality** (do these objects represent the same value?). `is` checks **identity** — whether two names point to the *same object in memory* (equivalent to `id(a) == id(b)`). Use `is` for singletons like `None`, `True`, `False`, and `is not` for sentinel checks; use `==` for comparing values.

What's the mutable default argument trap, and how do mutable vs immutable types cause it?

Default argument values are evaluated **once**, when the `def` statement runs, not on every call. If the default is a mutable object (a list, dict, or set), every call that doesn't pass that argument **shares and mutates the same object**, causing state to leak across calls. Fix it by defaulting to `None` and creating the mutable object inside the function body.

How does Python's LEGB scoping rule work?

Python resolves a name by searching, in order: **L**ocal (current function) → **E**nclosing (any enclosing function's scope, for closures) → **G**lobal (module level) → **B**uilt-in (`builtins` module). Assignment inside a function makes a name local by default *for that entire function body*, unless declared `nonlocal` or `global` — which is why assigning to a name before using it can raise `UnboundLocalError`.

What are `*args` and `**kwargs`, and how do you use them together?

`*args` collects extra **positional** arguments into a `tuple`; `**kwargs` collects extra **keyword** arguments into a `dict`. They let a function accept a variable number of arguments and are commonly used for wrapper/proxy functions (e.g., decorators) that forward whatever they receive to another callable. Order matters: positional-only, then `*args`, then keyword-only, then `**kwargs`.

What is duck typing, and how does `typing.Protocol` formalize it?

Duck typing means Python checks **behavior, not declared type** — "if it walks like a duck and quacks like a duck, it's a duck." Any object with the right methods/attributes works, regardless of its class hierarchy. `typing.Protocol` (PEP 544) lets you describe that *structurally* for static type checkers — a class satisfies a `Protocol` by having matching methods, with no explicit inheritance required.

What's the difference between `__str__` and `__repr__`?

`__repr__` returns an unambiguous, developer-facing representation (ideally one that could recreate the object) and is used by the REPL, debuggers, and containers (`print([obj])` calls `repr`, not `str`). `__str__` returns a human-readable, user-facing string and is used by `print(obj)`/`str(obj)`. If `__str__` is not defined, Python falls back to `__repr__`.

How does Python's import system work (modules, packages, `sys.path`)?

`import x` searches `sys.modules` (already-imported cache) first, then searches the directories in `sys.path` (script dir, `PYTHONPATH`, installed site-packages) using **finders** and **loaders**. A directory becomes a regular package if it has an `__init__.py` (optional since Python 3.3, which introduced namespace packages). Each module is executed once and cached in `sys.modules`, so re-importing just returns the cached module object.

What are dunder (magic) methods, and how do they enable operator overloading?

Dunder methods (`__init__`, `__add__`, `__eq__`, `__len__`, `__getitem__`, etc.) are hooks that Python's syntax and built-in functions call implicitly — `a + b` calls `a.__add__(b)`, `len(x)` calls `x.__len__()`, `x[i]` calls `x.__getitem__(i)`. Implementing them lets user-defined classes participate in built-in syntax (arithmetic, comparisons, iteration, indexing, context managers) the same way built-in types do.

How does slicing work, and what's the difference between slicing a list, a string, and using `slice()`?

`seq[start:stop:step]` returns a **new** sequence of the same type containing elements from `start` (inclusive) to `stop` (exclusive), stepping by `step`; negative indices count from the end and a negative `step` reverses direction. Under the hood, `a[start:stop:step]` builds a `slice` object and calls `a.__getitem__(slice(start, stop, step))` — the same mechanism for lists, strings, tuples, and any custom class implementing `__getitem__`.

What are f-strings, and how do they compare to `%`-formatting and `.format()`?

f-strings (`f"{value}"`, PEP 498) embed expressions directly inside string literals and are evaluated **at the point the string is defined** — they're the fastest and most readable option and support format specs (`{value:.2f}`) and debugging output (`{value=}`). `%`-formatting is the oldest, printf-style approach; `str.format()` is more flexible than `%` but more verbose than f-strings. Modern Python code should default to f-strings.

How does variable assignment actually work in Python (references, not boxes)?

A Python variable is a **name bound to an object**, not a labeled memory box holding a value. `x = [1, 2]` makes the name `x` point at a list object; `y = x` makes `y` point at the *same* object — no copy happens. Reassigning `x = something_else` just repoints the name; it never affects `y` or the original object. Mutating through one name (`x.append(3)`) is visible through every other name bound to that same object.

What's the walrus operator (`:=`), and when should you use it?

The walrus operator (`:=`, PEP 572, Python 3.8+) assigns a value to a name **as part of a larger expression**, letting you both compute and bind a value in a single line — most commonly inside a `while` condition, an `if` condition, or a comprehension, to avoid computing the same expression twice.

Python Fundamentals & the Data Model

Core language semantics: objects, references, scoping, unpacking, dunder methods, imports, and formatting.

Questions

13 total

13 questions in this section

Difficulty

Open as page

Everything is an object

In Python, 1, "hello", def f(): pass, a class, a module, and even type itself are all objects: each has an identity (id(x)), a type (type(x)), and a set of attributes. There is no distinction between "primitive types" and "reference types" the way there is in Java or C#.

def greet():
    return "hi"

print(type(greet))          # <class 'function'>
print(greet.__name__)       # 'greet'
greet.calls = 0             # you can attach arbitrary attributes to a function
greet.calls += 1

print(type(int))            # <class 'type'>
print(type(type))           # <class 'type'>  -- type is an instance of itself

Why it matters

1. Functions are first-class values. You can store them in variables, put them in lists, pass them as arguments, and return them from other functions — this is what makes decorators, callbacks, and higher-order functions like map/sorted(key=...) work.

2. Classes and types are runtime objects. class Foo: ... executes a statement that creates a type object and binds it to Foo. That's why you can build classes dynamically with type(name, bases, namespace), and why metaclasses (which customize how type builds a class) are possible.

3. Introspection is cheap and universal. Because every object exposes __dict__, __class__, type(), and dir(), generic tooling (debuggers, serializers, ORMs, pytest fixtures) can inspect any object the same way, regardless of whether it's a number, a function, or a user-defined class.

4. It underlies duck typing. Since behavior is just "does this object respond to this attribute/method," Python doesn't need a common base type to treat unrelated objects polymorphically — it just checks capabilities at the point of use.

Interview-ready summary: Python has a single, uniform object model — numbers, functions, classes, and modules are all first-class objects with identity, type, and attributes. That uniformity is why closures, decorators, metaclasses, and duck typing all work through the same mechanism: attribute access and the type system, not special-cased primitive rules.

Related Resources

Data model — Python docs

Open as page

`==` vs `is`

== invokes __eq__ and answers "are these values equal?" is answers "are these literally the same object?" (identical id()).

a = [1, 2, 3]
b = [1, 2, 3]
c = a

a == b   # True  -- same contents
a is b   # False -- two distinct list objects
a is c   # True  -- c is a name bound to the same object as a

Why `a == b` can be True while `a is b` is False

Two separate lists (or dicts, or custom objects with a custom __eq__) can be equal in value without being the same object. The default __eq__ inherited from object actually falls back to identity, but built-in containers and most user classes override it to compare contents.

When to use `is`

None checks: always x is None, never x == None. None is a singleton, and is avoids accidentally invoking a custom __eq__ that might behave unexpectedly.
Singletons/sentinels: x is True, or a private sentinel object (_MISSING = object()) used to distinguish "not provided" from "explicitly None".
Identity-sensitive logic: e.g., checking whether a cache returned the exact cached instance rather than an equal copy.

The small-integer/string trap

CPython caches small integers (-5 to 256) and some string literals, so is can appear to work for equality by coincidence:

x = 256
y = 256
x is y   # True (cached)

x = 257
y = 257
x is y   # False on most builds — a new object, no caching guarantee

This is a CPython implementation detail, not part of the language spec — relying on it for anything beyond None/True/False is a bug waiting to happen.

Interview-ready summary: == compares values via __eq__; is compares identity via id(). Always use is for None/singleton checks and == for everything else — never rely on integer/string caching as a substitute for ==.

Related Resources

Comparisons — Python docs

Open as page

The trap

def add_item(item, bucket=[]):
    bucket.append(item)
    return bucket

add_item("a")   # ['a']
add_item("b")   # ['a', 'b']  -- surprise! same list as before

The default [] is created once, at function-definition time, and stored on the function object (add_item.__defaults__). Every call that omits bucket reuses that exact same list, so mutations accumulate across unrelated calls.

Why immutable defaults don't have this problem

def greet(name, suffix="!"):
    return name + suffix

"!" is immutable — nothing inside greet can mutate the string object itself, so there's no shared, mutable state to leak. The bug is specific to mutable default values (list, dict, set, or any mutable custom object).

The fix

def add_item(item, bucket=None):
    if bucket is None:
        bucket = []
    bucket.append(item)
    return bucket

Now a fresh list is created on every call that doesn't supply bucket, while callers who do want to accumulate into a shared list can still pass one explicitly.

The general lesson: mutable vs immutable

Immutable (int, float, str, tuple, frozenset, bytes): any "modification" creates a new object; the original is never changed. Safe to share across function calls, default arguments, and dict keys.
Mutable (list, dict, set, most custom classes): the object can be changed in place; sharing a reference means all holders see the mutation. Never use a mutable object as a default argument, and be careful when a mutable object is a class attribute (shared across all instances) instead of an instance attribute (set in __init__).

class Bad:
    items = []          # class attribute — shared by every instance!
    def __init__(self):
        pass

class Good:
    def __init__(self):
        self.items = []  # instance attribute — one per object

Interview-ready summary: Default arguments are evaluated once at def-time and stored on the function object, so a mutable default is shared across every call that uses it. Always default mutable arguments to None and construct the real object inside the function body — and apply the same caution to mutable class attributes.

Related Resources

Default Argument Values — Python docs

Open as page

The four scopes, in lookup order

x = "global"

def outer():
    x = "enclosing"
    def inner():
        x = "local"
        print(x)          # 'local'   -- found in Local scope
    inner()
    print(x)               # 'enclosing'

outer()
print(x)                    # 'global'
print(len)                  # built-in, found in Built-in scope

When Python looks up a bare name, it checks Local, then Enclosing function scopes (innermost to outermost), then Global (module), then Built-in — the first scope where the name is bound wins.

The gotcha: assignment makes a name local for the whole function

Python decides whether a name is local to a function at compile time, by scanning the function body for assignments — not by checking whether the assignment has "already happened" at runtime.

count = 0

def increment():
    print(count)     # UnboundLocalError!
    count = count + 1

Because count = ... appears anywhere in increment, Python treats count as local for the entire function body — including the print(count) line before the assignment. It never falls back to the global count.

Fixing it: `global` and `nonlocal`

count = 0

def increment():
    global count
    count += 1          # now refers to the module-level count

def make_counter():
    total = 0
    def add(n):
        nonlocal total   # refers to make_counter's `total`, not a new local
        total += n
        return total
    return add

global binds a name to the module-level scope.
nonlocal binds a name to the nearest enclosing function scope (not global) — this is what makes stateful closures possible.

Why this matters for closures

The "E" in LEGB is exactly what lets a nested function remember variables from its enclosing function after that function has returned — the classic closure pattern (make_counter above). Without nonlocal, a nested function can read an enclosing variable freely, but assigning to it creates a new local instead of updating the enclosing one.

Interview-ready summary: Name resolution follows Local → Enclosing → Global → Built-in, and whether a name is "local" is decided statically by scanning for assignments in the function body — which is why referencing a name before assigning it in the same function raises UnboundLocalError instead of falling back to an outer scope. global and nonlocal are the explicit escape hatches for writing to an outer scope.

Related Resources

Naming and binding — Python docs

Open as page

Collecting variable arguments

def summarize(*args, **kwargs):
    print(args)     # tuple of positional args
    print(kwargs)   # dict of keyword args

summarize(1, 2, 3, name="Ada", active=True)
# (1, 2, 3)
# {'name': 'Ada', 'active': True}

*args gathers any positional arguments beyond the named parameters into a tuple; **kwargs gathers any keyword arguments not matched by name into a dict.

Forwarding arguments (the most common real use)

def logged(func):
    def wrapper(*args, **kwargs):
        print(f"calling {func.__name__}")
        return func(*args, **kwargs)   # forward everything, unchanged
    return wrapper

This is why almost every decorator's wrapper signature is (*args, **kwargs) — it makes the wrapper work for any wrapped function signature without needing to know it in advance.

Combining with named and keyword-only parameters

def request(url, *args, timeout=30, **kwargs):
    ...

Parameter order must be: positional params → *args → keyword-only params (anything after *args must be passed by name) → **kwargs. This lets you mix a required positional API with an "escape hatch" for extra options.

Unpacking at the call site

The same */** syntax unpacks a sequence or mapping into a call:

values = (1, 2, 3)
options = {"name": "Ada"}
summarize(*values, **options)   # same as summarize(1, 2, 3, name="Ada")

Interview-ready summary: *args/**kwargs are Python's mechanism for variadic functions — *args as a tuple of extra positional arguments, **kwargs as a dict of extra keyword arguments. They're essential for writing generic wrappers (decorators, proxies) that forward calls without caring about the wrapped function's exact signature.

Related Resources

More on Defining Functions — Python docs

Open as page

Duck typing in practice

class Duck:
    def quack(self):
        return "Quack!"

class Person:
    def quack(self):
        return "I'm quacking!"

def make_it_quack(thing):
    return thing.quack()   # no type check — just calls the method

make_it_quack(Duck())     # works
make_it_quack(Person())   # also works — Person isn't a Duck subclass

make_it_quack never checks isinstance(thing, Duck) — it just calls .quack() and trusts that the object supports it. This is why for x in anything: works on lists, files, generators, and custom classes alike: they all implement __iter__, regardless of ancestry.

The problem for static typing

Duck typing is dynamic — you find out at runtime whether an object has the right method. Static type checkers (mypy, pyright) need something they can verify ahead of time without requiring every duck-typed class to inherit from a common base (which would defeat the point).

`Protocol`: structural typing for static checkers

from typing import Protocol

class Quacks(Protocol):
    def quack(self) -> str: ...

def make_it_quack(thing: Quacks) -> str:
    return thing.quack()

make_it_quack(Duck())     # type-checks fine — Duck has a matching quack()
make_it_quack(Person())   # also fine — no inheritance from Quacks needed

Quacks is never inherited from; mypy checks whether Duck and Person structurally match its method signatures. This is "nominal vs structural" typing: ABC/inheritance is nominal (you must declare the relationship), Protocol is structural (the shape is enough).

When to use which

Plain duck typing: quick scripts, internal code, when you don't need static verification.
Protocol: public APIs and larger codebases where you want mypy to catch "this object doesn't have the method you're calling" before runtime, without forcing every caller into a shared base class.

Interview-ready summary: Duck typing lets Python code work with any object that has the right methods, independent of its class hierarchy. Protocol gives that the same runtime flexibility plus static verification — a class satisfies a Protocol by shape, not by declared inheritance.

Related Resources

typing.Protocol — Python docs

Open as page

Two different audiences

from datetime import date

d = date(2024, 1, 15)
str(d)    # '2024-01-15'                -- readable
repr(d)   # 'datetime.date(2024, 1, 15)' -- unambiguous, evaluable

__str__ targets end users / logs — readable output. __repr__ targets developers debugging — precise, ideally eval()-able output that makes it unambiguous exactly what the object is.

Defining both on a custom class

class Point:
    def __init__(self, x, y):
        self.x, self.y = x, y

    def __repr__(self):
        return f"Point(x={self.x}, y={self.y})"

    def __str__(self):
        return f"({self.x}, {self.y})"

p = Point(1, 2)
print(p)          # (1, 2)              -- calls __str__
print(repr(p))    # Point(x=1, y=2)     -- calls __repr__
print([p])        # [Point(x=1, y=2)]   -- containers always use repr()

The fallback rule

If you only define __repr__, str(obj) falls back to calling __repr__ (since object.__str__ calls self.__repr__() by default). So a common shortcut for simple classes is to define only __repr__ and skip __str__ entirely, unless the two representations genuinely need to differ.

Why containers always use `repr`

print([1, "a", p]) shows [1, 'a', Point(x=1, y=2)] — note the quotes around 'a' and the repr form of p. Lists, dicts, and other containers call repr() on their elements so that nested strings are visibly quoted and distinguishable from the surrounding structure; using str() would make ["a", "b"] print as [a, b], indistinguishable from bare identifiers.

Interview-ready summary: __repr__ is for developers/debugging (unambiguous, ideally eval-able); __str__ is for end users (readable). str() falls back to __repr__ if __str__ isn't defined, and containers always render their elements with repr, never str.

Related Resources

object.__repr__ — Python docs

Open as page

The lookup sequence

When you run import foo, Python:

Checks sys.modules["foo"] — if already imported, returns the cached module object immediately (imports are idempotent and side-effect-free after the first time).
Otherwise, walks sys.path (a list of directories: the script's own directory, PYTHONPATH entries, and the standard library/site-packages paths) using finders, which locate the module and return a spec.
A loader then executes the module's code in a fresh namespace, which becomes the module object, and stores it in sys.modules.

import sys
print(sys.path)          # search directories, in order
print(sys.modules.keys()) # every module imported so far, cached

Packages vs modules

A module is a single .py file.
A package is a directory containing modules (and possibly subpackages). Historically it required an __init__.py (even if empty) to mark the directory as a package; since PEP 420 (Python 3.3), directories without __init__.py can act as namespace packages and still be importable, though most real projects still use __init__.py for explicit control over what a package exports.

myapp/
    __init__.py
    models.py
    utils/
        __init__.py
        strings.py

from myapp.utils.strings import slugify

Absolute vs relative imports

# absolute — resolved from sys.path, preferred for clarity
from myapp.utils import strings

# relative — resolved from the current package's position
from .strings import slugify     # same package
from ..models import User        # parent package

Relative imports only work inside a package (a module run directly as a script has no package context), which is a common source of ImportError: attempted relative import with no known parent package.

Circular imports

If a.py imports b.py and b.py imports a.py, whichever module runs first will see a partially initialized version of the other (only the names defined before the circular import line exist). Common fixes: move the import inside the function that needs it (deferred import), restructure shared code into a third module, or import the module object itself (import a) instead of pulling names out of it at import time.

Interview-ready summary: Imports are cached in sys.modules and only execute a module's top-level code once; the search path is sys.path, walked by finders/loaders. Packages are directories of modules (optionally marked by __init__.py); relative imports resolve against the current package, and circular imports break when one module is only partially initialized by the time the other needs it.

Related Resources

The import system — Python docs

Open as page

How operator syntax maps to method calls

class Money:
    def __init__(self, cents):
        self.cents = cents

    def __add__(self, other):
        return Money(self.cents + other.cents)

    def __eq__(self, other):
        return self.cents == other.cents

    def __repr__(self):
        return f"Money({self.cents})"

a = Money(150)
b = Money(50)
a + b       # calls a.__add__(b)  -> Money(200)
a == b      # calls a.__eq__(b)   -> False

a + b is syntax sugar that the interpreter desugars to type(a).__add__(a, b) (falling back to type(b).__radd__(b, a) if a doesn't know how to add b). Every piece of "special" syntax in Python has a dunder method behind it.

Common categories

Syntax	Dunder method(s)
`a + b`, `a - b`, `a * b`	`__add__`, `__sub__`, `__mul__` (+ `__radd__`, etc.)
`a == b`, `a < b`	`__eq__`, `__lt__`, ...
`len(x)`	`__len__`
`x[i]`, `x[i] = v`	`__getitem__`, `__setitem__`
`for i in x`	`__iter__` / `__next__`
`with x:`	`__enter__`, `__exit__`
`str(x)`, `repr(x)`	`__str__`, `__repr__`
`x()`	`__call__`
`hash(x)`	`__hash__`
`x in y`	`__contains__`

The `eq`/`hash` contract

If you override __eq__, Python sets __hash__ to None unless you also define it, making instances unhashable (can't be used as dict keys or set members). Two objects that are equal must have the same hash, so if you define __eq__, define a consistent __hash__ too (or explicitly leave the class unhashable if that's intended).

Why this is more than syntax sugar

Dunders are how Python achieves polymorphism without a common base class requirement — any object implementing __iter__/__next__ works in a for loop, any object implementing __enter__/__exit__ works in a with block. It's the mechanism duck typing runs on.

Interview-ready summary: Dunder methods are the hooks that back Python's operators and built-in functions — +, ==, len(), indexing, iteration, and context managers all desugar to method calls on the operand's type. Implementing them lets custom classes plug into the same syntax built-in types use, and overriding __eq__ without __hash__ makes instances unhashable.

Related Resources

Data model — Python docs

Open as page

The basic mechanics

s = "Hello, World!"
s[0:5]      # 'Hello'
s[7:]       # 'World!'   -- omit stop -> to the end
s[:5]       # 'Hello'    -- omit start -> from the beginning
s[-6:]      # 'World!'   -- negative index counts from the end
s[::-1]     # '!dlroW ,olleH'  -- negative step reverses
s[::2]      # 'Hlo ol!'  -- every 2nd character

Slicing always returns a new object of the same type as the original (a slice of a str is a str, a slice of a list is a list) — it never mutates the original sequence, and out-of-range indices are clamped rather than raising an error (unlike single-index access, which raises IndexError).

`list` vs `str` slicing

Both use identical syntax and semantics, but a list slice creates a shallow copy of the sliced elements (the list itself is new, but if elements are mutable objects, they're the same objects, not deep copies):

nums = [1, 2, 3, 4, 5]
nums[1:3] = [20, 30]     # slice assignment: replaces elements 1:3
nums                      # [1, 20, 30, 4, 5]

nums[1:3] = [7, 8, 9, 10]  # can even change length!
nums                        # [1, 7, 8, 9, 10, 4, 5]

Strings are immutable, so s[1:3] = "x" raises TypeError — you can only read a slice of a string, never assign into it; "modifying" a string means building a new one (s = s[:1] + "X" + s[3:]).

The `slice()` object

a[start:stop:step] is syntax sugar for a.__getitem__(slice(start, stop, step)):

sl = slice(1, 5, 2)
"Hello, World!"[sl]     # 'el'  -- same as "Hello, World!"[1:5:2]

sl.start, sl.stop, sl.step   # (1, 5, 2)
sl.indices(13)               # normalizes negative/None values for a length-13 sequence

Storing a slice object as a variable is useful when the same slicing pattern is reused in multiple places, or when a custom __getitem__ implementation needs to distinguish obj[i] (an int) from obj[i:j] (a slice instance) to support both.

Interview-ready summary: Slicing is syntax sugar over __getitem__ with a slice object, works identically across strings, lists, and tuples, always produces a new object of the source's type, and clamps out-of-range bounds instead of raising. Lists additionally support slice assignment (including changing length); strings, being immutable, support slicing only for reading.

Related Resources

Sequence Types — Python docs

Open as page

The three formatting styles

name, score = "Ada", 97.456

# printf-style (%) -- oldest, C-inspired
"%s scored %.2f%%" % (name, score)

# str.format() -- more flexible, more verbose
"{} scored {:.2f}%".format(name, score)

# f-string (PEP 498) -- modern default
f"{name} scored {score:.2f}%"

All three ultimately support the same format spec mini-language (:.2f, :>10, :,, :%), but f-strings embed the expression and the spec directly in the literal, which is both shorter and lets your editor/ type checker see the actual expression being formatted.

Why f-strings are generally preferred

Any expression is allowed inline, not just a variable name: f"{price * 1.08:.2f}", f"{obj.method()}".
The = debug specifier prints both the expression and its value: f"{score=}" → "score=97.456" — handy for quick debugging without writing print(f"score: {score}") by hand.
Performance: f-strings are compiled to efficient bytecode (essentially a sequence of BUILD_STRING operations) and are generally faster than % or .format() at runtime.
Readability: the value appears exactly where it's used in the string, rather than being separated into an argument list you have to cross-reference by position.

When you'd still see `%` or `.format()`

%-formatting is still common in logging calls (logging.info("x=%s", x)), because the logging module only formats the string if the log level is enabled — deferring the cost — whereas an f-string is evaluated immediately regardless of whether the log line is emitted.
.format() is useful when the template string itself is not known until runtime (e.g., loaded from a config file or translation catalog), since f-strings must be literal source-code strings, not read from data.

Interview-ready summary: f-strings are the modern default — inline expressions, a debug = specifier, and better performance than % or .format(). The main exception is logging calls, where lazy %-style formatting avoids the cost of building a string that might never be emitted.

Related Resources

f-strings — Python docs

Open as page

Names are labels on objects, not boxes

x = [1, 2, 3]
y = x            # y now points at the SAME list object as x

y.append(4)
x                # [1, 2, 3, 4]  -- x sees the mutation too

y = [9, 9]       # rebinds y to a NEW list; x is untouched
x                # still [1, 2, 3, 4]

Think of x and y as sticky notes pointing at objects in memory, not as separate storage slots. y = x copies the pointer, not the object. Mutating the object through any name affects every name pointing at it; reassigning a name just moves that one sticky note elsewhere.

Function arguments follow the same rule ("pass by object reference")

def add_item(lst):
    lst.append("x")       # mutates the caller's list — visible outside

def replace(lst):
    lst = ["new"]         # rebinds the LOCAL name lst; caller's list unaffected

data = [1, 2]
add_item(data)
data          # [1, 2, 'x']

replace(data)
data          # [1, 2, 'x']  -- unchanged; replace() only rebound its own local name

Python is neither "pass by value" nor "pass by reference" in the C++ sense — it's pass by object reference (sometimes called "call by sharing"): the function gets its own local name bound to the same object the caller passed. Mutating that object is visible to the caller; rebinding the local name is not.

Why `id()` and `is` make this concrete

x = [1, 2]
y = x
id(x) == id(y)   # True -- literally the same object
x is y            # True -- same thing, expressed with the `is` operator

id() returns the object's memory address (in CPython); two names with the same id() are the same object, and mutating through one is always visible through the other.

Interview-ready summary: Assignment binds a name to an object; it never copies the object. Multiple names can reference the same object, so mutating through one name is visible through all of them, while reassigning a name only changes what that one name points to. Function calls pass objects by reference-sharing: mutation is visible to the caller, rebinding the parameter name is not.

Related Resources

Data model — Python docs

Open as page

The problem it solves: avoiding duplicate computation

# Without walrus -- compute the expensive call twice, or restructure the loop
data = get_next_chunk()
while data:
    process(data)
    data = get_next_chunk()

# With walrus -- compute once, inline in the condition
while (data := get_next_chunk()):
    process(data)

Common use cases

In if conditions, to avoid computing something twice:

if (match := re.search(pattern, text)):
    print(match.group())

Without :=, you'd write match = re.search(...) on its own line, then if match: — walrus collapses that into one line while keeping match available in the following block.

In comprehensions, to avoid recomputing an expensive filter/transform:

results = [y for x in data if (y := expensive(x)) is not None]

Without walrus, expensive(x) would need to be called twice — once for the filter, once for the output expression — or the comprehension would need to be restructured into a loop.

Syntax rules and gotchas

Must be parenthesized in most contexts: while (data := f()): (bare while data := f(): is actually valid too since it's the top-level statement condition, but parens are recommended for clarity).
It's an expression, not a statement — x := 5 alone is invalid, but print(x := 5) is fine.
The scoping is the same as regular assignment (not a new scope) — inside a comprehension, the walrus-assigned name leaks into the enclosing scope, not the comprehension's own scope, which is a deliberate PEP 572 design choice (unlike the loop variable, which stays scoped to the comprehension).

Interview-ready summary: := assigns and evaluates in one expression, primarily useful in while/if conditions and comprehensions to avoid calling the same expensive expression twice. It's a readability/efficiency tool, not a new capability — anything written with := can be rewritten with an extra assignment statement.

Related Resources

PEP 572 – Assignment Expressions

Python Fundamentals & the Data Model

What does "everything is an object" mean in Python, and why does it matter?

Everything is an object

Why it matters

Related Resources

What's the difference between `is` and `==`?

== vs is

Why a == b can be True while a is b is False

When to use is

The small-integer/string trap

Related Resources

What's the mutable default argument trap, and how do mutable vs immutable types cause it?

The trap

Why immutable defaults don't have this problem

The fix

The general lesson: mutable vs immutable

Related Resources

How does Python's LEGB scoping rule work?

The four scopes, in lookup order

The gotcha: assignment makes a name local for the whole function

Fixing it: global and nonlocal

Why this matters for closures

Related Resources

What are `*args` and `**kwargs`, and how do you use them together?

Collecting variable arguments

Forwarding arguments (the most common real use)

Combining with named and keyword-only parameters

Unpacking at the call site

Related Resources

What is duck typing, and how does `typing.Protocol` formalize it?

Duck typing in practice

The problem for static typing

Protocol: structural typing for static checkers

When to use which

Related Resources

What's the difference between `__str__` and `__repr__`?

Two different audiences

Defining both on a custom class

The fallback rule

Why containers always use repr

Related Resources

How does Python's import system work (modules, packages, `sys.path`)?

The lookup sequence

Packages vs modules

Absolute vs relative imports

Circular imports

Related Resources

What are dunder (magic) methods, and how do they enable operator overloading?

How operator syntax maps to method calls

Common categories

The __eq__/__hash__ contract

Why this is more than syntax sugar

Related Resources

How does slicing work, and what's the difference between slicing a list, a string, and using `slice()`?

The basic mechanics

list vs str slicing

The slice() object

Related Resources

What are f-strings, and how do they compare to `%`-formatting and `.format()`?

The three formatting styles

Why f-strings are generally preferred

When you'd still see % or .format()

Related Resources

How does variable assignment actually work in Python (references, not boxes)?

Names are labels on objects, not boxes

Function arguments follow the same rule ("pass by object reference")

Why id() and is make this concrete

Related Resources

What's the walrus operator (`:=`), and when should you use it?

The problem it solves: avoiding duplicate computation

Common use cases

Syntax rules and gotchas

Related Resources

`==` vs `is`

Why `a == b` can be True while `a is b` is False

When to use `is`

Fixing it: `global` and `nonlocal`

`Protocol`: structural typing for static checkers

What's the difference between `str` and `repr`?

Why containers always use `repr`

The `eq`/`hash` contract

`list` vs `str` slicing

The `slice()` object

When you'd still see `%` or `.format()`

Why `id()` and `is` make this concrete