How are Python lists implemented, and what's the time complexity of common operations?

A `list` is a dynamic array (contiguous, resizable array of object pointers), not a linked list. Indexing (`lst[i]`) and appending (`lst.append`) are **O(1) amortized**; inserting/removing at the front or middle (`lst.insert(0, x)`, `lst.pop(0)`) is **O(n)** because every following element must shift; membership testing (`x in lst`) is **O(n)** since it scans linearly.

How does a Python `dict` work internally, and does it guarantee insertion order?

CPython's `dict` is a **hash table**: each key's hash determines a slot, with open addressing to resolve collisions, giving average **O(1)** lookup/insert/delete. Since Python 3.7, dicts are guaranteed (as a language spec, not just a CPython detail) to **preserve insertion order** — achieved internally by keeping a compact array of entries in insertion order alongside a sparser hash-indices array.

What makes an object hashable, and how does that relate to `__eq__`?

An object is hashable if it implements `__hash__` (returning a stable integer for its lifetime) and, if it implements `__eq__`, guarantees that **equal objects have equal hashes**. All immutable built-ins (`int`, `str`, `tuple` of hashables, `frozenset`) are hashable; mutable built-ins (`list`, `dict`, `set`) are not, since mutating them would change their hash — silently breaking dict/set invariants if they were used as keys.

When should you use a list, tuple, set, or dict?

**List**: ordered, mutable, allows duplicates — general-purpose sequence. **Tuple**: ordered, immutable — fixed-size records, safe as a dict key/set element. **Set**: unordered, unique elements, O(1) membership testing — deduplication and fast "contains" checks. **Dict**: key→value mapping, O(1) lookup by key, insertion-ordered — the default choice whenever you need to look things up by a name/id rather than by position.

How do comprehensions work, and when do they hurt readability or performance?

A comprehension (`[expr for x in iterable if cond]`) is syntax sugar for a `for` loop that builds a new list/set/dict, and it's implemented as its own **hidden function scope**, which is usually slightly faster than the equivalent explicit loop due to bytecode-level optimizations. They hurt readability once nested more than one level deep or once the expression/condition is complex — at that point, an explicit loop with named intermediate variables is clearer, not a performance sacrifice.

What are `defaultdict`, `Counter`, `deque`, and `namedtuple` used for?

`collections.defaultdict(factory)` auto-creates a default value for missing keys, removing manual `if key not in d` checks. `Counter` is a dict subclass specialized for counting hashable items, with `.most_common()`. `deque` is a double-ended queue with O(1) append/pop from **both ends** (unlike `list`, which is O(n) at the front). `namedtuple` creates lightweight, immutable, attribute-accessible tuple subclasses for simple records.

What's the difference between a shallow copy and a deep copy?

A **shallow copy** (`list(x)`, `x.copy()`, `copy.copy(x)`) creates a new outer container but reuses references to the *same* nested/inner objects — mutating a nested object through the copy affects the original too. A **deep copy** (`copy.deepcopy(x)`) recursively copies every nested object as well, so the copy is fully independent of the original, at the cost of more time and memory.

What's the difference between `list.sort()` and `sorted()`, and how do custom sort keys work?

`list.sort()` sorts a list **in place** and returns `None`; `sorted(iterable)` returns a **new** sorted list and works on any iterable, not just lists. Both accept a `key=` function (applied once per element to compute a sort key, not a full comparator) and a `reverse=True` flag, and both use Timsort — a stable, O(n log n) hybrid merge/insertion sort.

How does string interning affect performance and `is` comparisons?

CPython automatically **interns** (caches and reuses) certain strings — identifier-like literals (e.g., `"hello"`, variable names) known at compile time, and short strings composed only of letters/digits/underscores — so multiple occurrences of the same literal can share one object in memory, speeding up dict lookups keyed by those strings (interned string comparison can short-circuit to an identity check). It's a CPython optimization detail, not a language guarantee, so code should never rely on `is` for string equality.

Collections & Data Structures

List/dict/set internals and complexity, the collections module, hashability, copying, and sorting.

Difficulty

Open as page

Lists are dynamic arrays, not linked lists

CPython's list stores a contiguous array of pointers to objects, with some extra pre-allocated capacity to amortize the cost of growth. This is why indexing is O(1): lst[i] is just pointer arithmetic into the underlying array, not a traversal.

lst = [1, 2, 3]
lst[1]          # O(1) -- direct array index
lst.append(4)   # O(1) amortized -- usually just writes into pre-allocated space

Complexity cheat sheet

Operation	Complexity	Why
`lst[i]`, `lst[i] = x`	O(1)	direct array index
`lst.append(x)`	O(1) amortized	writes to spare capacity; occasional O(n) resize, amortized away
`lst.pop()` (from end)	O(1)	no shifting needed
`lst.pop(0)`, `lst.insert(0, x)`	O(n)	every remaining element shifts by one slot
`x in lst`	O(n)	linear scan, no index structure
`len(lst)`	O(1)	length is cached, not recomputed
`lst.sort()`	O(n log n)	Timsort
slicing `lst[a:b]`	O(k)	k = length of the slice, since a new list is built

Why `append` is amortized O(1)

When the underlying array runs out of spare capacity, CPython allocates a larger array (roughly growing by ~1.125x plus a constant) and copies existing elements over — an O(n) operation, but one that happens increasingly rarely as the list grows, so the average cost per append across many appends works out to O(1).

Why front-insertion/removal is expensive

lst = [1, 2, 3, 4, 5]
lst.pop(0)      # removes 1, then shifts 2,3,4,5 each one slot left -- O(n)
lst.insert(0, 0) # shifts every element right by one slot -- O(n)

If your workload needs frequent insert/pop from both ends, use collections.deque instead — it's implemented as a doubly-linked list of fixed-size blocks and supports O(1) append/pop from either end, at the cost of O(n) random access (deque[i] is O(n) for large i, unlike list).

Interview-ready summary: Python lists are dynamic arrays: O(1) indexing and end-append/pop, but O(n) for inserting/removing anywhere except the end, and O(n) membership testing. When you need efficient operations at both ends, reach for collections.deque instead of list.

Related Resources

TimeComplexity — Python wiki

Open as page

The hash table basics

d = {"a": 1, "b": 2}
d["a"]        # O(1) average -- hash("a") locates the slot directly
d["c"] = 3    # O(1) average insert

Looking up d["a"] computes hash("a"), uses it to find a candidate slot in the internal table, and (after resolving any hash collisions) returns the value — no scanning of all keys, unlike a list.

Insertion order guarantee (Python 3.7+)

d = {}
d["z"] = 1
d["a"] = 2
d["m"] = 3
list(d)   # ['z', 'a', 'm']  -- insertion order, guaranteed since 3.7

This was a CPython implementation detail in 3.6 and became an official language guarantee in 3.7. Internally, CPython separates "which slot does this hash map to" (a sparse array of indices) from "the actual key/value/ hash entries" (a dense array kept in insertion order) — iterating a dict walks the dense array, which is naturally in insertion order, while lookups still use the sparse hash-indexed array for O(1) access.

Why keys must be hashable

d = {[1, 2]: "x"}   # TypeError: unhashable type: 'list'

A dict key's hash is computed once and used to place it in a slot; mutating the key afterward (which is only possible for mutable objects) would silently break the invariant that a key's slot matches its current hash. That's why list/dict/set (all mutable) can't be dict keys, but tuple, str, int, and frozenset (all immutable, and hashable if their contents are) can.

Collision resolution: open addressing

Unlike some languages' hash maps (which chain multiple entries per bucket, e.g. a linked list per slot), CPython dicts use open addressing: on a collision, a perturbation-based probing sequence finds the next candidate slot. This keeps memory more compact and cache-friendly than chaining, at the cost of needing careful resizing (the table is resized — and re-hashed — well before it gets too full, keeping average lookup close to O(1)).

Worst case

Pathological hash collisions can degrade lookups toward O(n) in theory, but CPython uses SipHash for string hashing with a randomized seed per process (PYTHONHASHSEED) specifically to make it infeasible for an attacker to engineer such collisions deliberately (a real denial-of-service vector in older hash table implementations).

Interview-ready summary: dict is a hash table giving average O(1) lookup/insert/delete; since Python 3.7 it's spec-guaranteed to preserve insertion order, achieved by keeping entries in a dense, insertion-ordered array separate from the sparse hash-index table used for O(1) lookups. Keys must be hashable (and therefore effectively immutable) since a key's slot is determined by its hash at insertion time.

Related Resources

dict — Python docs

Open as page

What "hashable" requires

hash(42)            # works -- int is hashable
hash("abc")          # works -- str is hashable
hash((1, 2, 3))       # works -- tuple of hashables is hashable
hash([1, 2, 3])       # TypeError: unhashable type: 'list'
hash({1, 2})          # TypeError: unhashable type: 'set'
hash(frozenset({1,2})) # works -- frozenset (immutable set) is hashable

Hashability requires: (1) a __hash__ method that returns the same integer every time for a given object's lifetime, and (2) if __eq__ is defined, a == b implies hash(a) == hash(b) — this is required for correct dict/set behavior (two "equal" keys must land findable in the same bucket).

Why mutable containers are unhashable

lst = [1, 2, 3]
d = {lst: "value"}   # TypeError -- if this worked...

lst.append(4)         # ...and this mutated the key after insertion,
d[lst]                 # the dict's internal slot (based on the OLD hash) would be wrong

If list were hashable and its hash were based on contents, mutating a list already used as a dict key would silently corrupt the dict's internal structure (the key's slot no longer matches its current hash). Python sidesteps this entirely by making mutable containers unhashable.

Custom classes: hashable by default (via identity)

class Point:
    def __init__(self, x, y):
        self.x, self.y = x, y

p = Point(1, 2)
hash(p)   # works! -- default __hash__ is based on id(), inherited from object

By default, custom classes inherit object.__hash__, based on identity (id()) — two distinct Point(1, 2) instances hash differently even though they'd naturally seem "equal." This is fine until you also define __eq__ for value-based equality:

class Point:
    def __init__(self, x, y):
        self.x, self.y = x, y
    def __eq__(self, other):
        return isinstance(other, Point) and (self.x, self.y) == (other.x, other.y)

hash(Point(1, 2))   # TypeError: unhashable type: 'Point'

Defining __eq__ makes Python set __hash__ = None automatically, because the identity-based default hash would now violate the "equal implies equal hash" contract. Fix by defining a consistent __hash__:

    def __hash__(self):
        return hash((self.x, self.y))

Interview-ready summary: Hashable means "has a stable __hash__, and if __eq__ is defined, equal objects hash equally." Built-in mutable containers are unhashable by design (mutation would corrupt any dict/set using them as keys); custom classes are hashable by identity by default, but overriding __eq__ disables that default hash until you provide a matching __hash__ explicitly.

Related Resources

object.__hash__ — Python docs

Open as page

Quick decision table

Need	Structure	Why
Ordered, mutable collection, duplicates OK	`list`	general-purpose sequence
Fixed-size, immutable record (`(x, y)`, `(name, age)`)	`tuple`	safe to hash, signals "this won't change"
Fast membership testing, deduplication	`set`	O(1) average `in`, automatic uniqueness
Lookup by key/name	`dict`	O(1) average lookup by key, not position

Concrete examples

# list -- ordered sequence of items, order and duplicates matter
scores = [85, 92, 85, 78]

# tuple -- an immutable, fixed-shape record
point = (3, 4)
person = ("Ada", 36)

# set -- membership and uniqueness, order doesn't matter
seen_ids = {101, 205, 310}
if user_id in seen_ids:   # O(1) average -- much faster than `in a_list` for large data
    ...
unique_tags = set(all_tags)  # dedupe in one line

# dict -- look things up by key
users_by_id = {101: "Ada", 205: "Grace"}
users_by_id[101]   # O(1) average

Why `x in set` beats `x in list` at scale

big_list = list(range(1_000_000))
big_set = set(big_list)

999_999 in big_list   # O(n) -- scans up to a million elements
999_999 in big_set     # O(1) average -- direct hash lookup

For any workload doing repeated membership checks against a large collection, converting to a set (or using a dict if you also need associated values) is one of the cheapest, highest-impact optimizations available.

Tuple vs list: signaling intent, not just performance

def get_coordinates():
    return (self.x, self.y)   # a tuple signals "this is a fixed 2-item record"

Beyond being hashable (usable as dict keys/set elements) and slightly more memory-efficient, using a tuple for a fixed-shape value communicates to readers that the shape — not just the values — is meant to be fixed: nobody should expect to .append() to it.

Interview-ready summary: Reach for list for ordered, mutable sequences; tuple for fixed-shape, immutable records (and anything you need to hash); set for uniqueness/fast membership testing; dict whenever you look things up by key rather than by position. The performance difference between O(n) list scans and O(1) set/dict lookups is often the single biggest algorithmic win available in everyday code.

Related Resources

Data structures — Python tutorial

Open as page

The three comprehension forms

squares = [x * x for x in range(10)]                    # list
evens = {x for x in range(10) if x % 2 == 0}             # set
lookup = {x: x * x for x in range(10)}                   # dict
gen = (x * x for x in range(10))                          # generator expression (lazy!)

Each desugars roughly to a loop that appends/adds/assigns into a new container (except the generator expression, which produces a lazy iterator instead of eagerly building a container — see the iterators/generators topic).

Why they're often faster than an explicit loop

# Comprehension
squares = [x * x for x in range(1000)]

# Equivalent explicit loop
squares = []
for x in range(1000):
    squares.append(x * x)

Both do the same logical work, but the comprehension compiles to specialized bytecode (LIST_APPEND inside an isolated function frame) that avoids repeated attribute lookups (squares.append) on every iteration — in practice this is a modest, not dramatic, speedup, so "use comprehensions for performance" is a secondary benefit, not the main reason to reach for them.

When they hurt readability

# Hard to read -- nested comprehension with a condition
result = [item.strip().lower() for sublist in data
          for item in sublist if item and not item.startswith("#")]

# Clearer as an explicit loop
result = []
for sublist in data:
    for item in sublist:
        if item and not item.startswith("#"):
            result.append(item.strip().lower())

A comprehension nested two or more levels deep, or one combining multiple if conditions and a non-trivial expression, usually reads worse than the equivalent loop — there's no room for named intermediate variables or comments explaining a non-obvious filter, and reviewers have to mentally un-nest the comprehension to understand execution order (which, notably, goes left-to-right the way the for/if clauses are written, not inside- out).

A good rule of thumb

If a comprehension needs more than one for clause or more than one condition to express the logic, or if the transformation expression itself needs a helper function to stay readable, switch to an explicit loop (or a generator function with clear intermediate steps).

Interview-ready summary: Comprehensions are syntax sugar for building a list/set/dict via a loop, and are typically a bit faster due to specialized bytecode — but that's secondary to their real value: concise, readable code for a simple transform-and-filter. Once nesting or conditions pile up, prefer an explicit loop over a comprehension that's technically correct but hard to read.

Related Resources

List comprehensions — Python tutorial

Open as page

`defaultdict`: no more `if key not in d`

from collections import defaultdict

groups = defaultdict(list)
for name in ["ada", "amy", "bob", "ben"]:
    groups[name[0]].append(name)
# groups = {'a': ['ada', 'amy'], 'b': ['bob', 'ben']}

Without defaultdict, every append needs a manual check: if name[0] not in groups: groups[name[0]] = []. defaultdict(list) calls list() automatically the first time a missing key is accessed, eliminating that boilerplate.

`Counter`: counting made trivial

from collections import Counter

words = "the quick brown fox the lazy dog the".split()
counts = Counter(words)
counts["the"]              # 3
counts.most_common(2)       # [('the', 3), ('quick', 1)] -- ties broken by insertion order
counts + Counter(["fox"])   # supports arithmetic between Counters

Counter is a dict subclass where missing keys default to 0 (instead of raising KeyError), plus convenience methods like .most_common() and multiset-style arithmetic (+, -, &, |).

`deque`: O(1) at both ends

from collections import deque

dq = deque([1, 2, 3])
dq.appendleft(0)    # O(1) -- list.insert(0, x) would be O(n)
dq.append(4)         # O(1)
dq.popleft()          # O(1) -- list.pop(0) would be O(n)
dq = deque(maxlen=3)  # bounded deque -- great for "last N items" buffers

deque is implemented as a doubly-linked list of fixed-size blocks, so both ends support O(1) operations — the natural choice for queues, sliding windows, and BFS traversal, where list.pop(0)/insert(0, x) would be a performance trap (O(n) each).

`namedtuple`: lightweight, immutable records

from collections import namedtuple

Point = namedtuple("Point", ["x", "y"])
p = Point(1, 2)
p.x, p.y      # 1, 2 -- attribute access
p[0], p[1]    # 1, 2 -- still a tuple, so positional access works too
p == Point(1, 2)   # True -- structural equality, generated automatically

namedtuple generates a tuple subclass with named fields — you get attribute access and tuple behavior (unpacking, indexing, hashability), at essentially zero extra memory over a plain tuple. It's the natural choice before reaching for a full @dataclass when the type is small, immutable, and truly tuple-like.

Interview-ready summary: defaultdict removes manual missing-key checks, Counter is a purpose-built counting dict, deque gives O(1) operations at both ends (unlike list's O(n) front operations), and namedtuple gives cheap, immutable, attribute-accessible records — each solves a specific, common gap left by the plain list/dict/tuple built-ins.

Related Resources

collections — Python docs

Open as page

Shallow copy: new outer container, shared inner objects

import copy

original = [[1, 2], [3, 4]]
shallow = copy.copy(original)          # or: original[:] / list(original)

shallow.append([5, 6])                  # doesn't affect original -- outer list is new
original                                  # [[1, 2], [3, 4]]

shallow[0].append(99)                    # mutates the SHARED inner list!
original                                  # [[1, 2, 99], [3, 4]]  -- original changed too!

shallow is a genuinely new list object, but its elements are the same inner list objects as original's — appending to the outer copy doesn't touch the original, but mutating a shared inner list does, since both original[0] and shallow[0] point at the identical object.

Deep copy: fully independent

deep = copy.deepcopy(original)
deep[0].append(100)
original   # unaffected -- deep copy recursively copied every nested list too

copy.deepcopy recursively copies every object reachable from the top, building an entirely independent structure — safe to mutate at any depth without affecting the original, at the cost of recursively copying (and therefore more time/memory, and needing to handle cycles, which deepcopy does via a memo dict to avoid infinite recursion).

Which one for which types

d = {"a": [1, 2]}
d.copy()          # shallow -- new dict, same inner list object
copy.deepcopy(d)   # deep -- new dict AND a new inner list

t = (1, [2, 3])
copy.copy(t)        # shallow -- new tuple, same inner list

Most built-in containers offer a .copy() method (or slicing [:]) that performs a shallow copy; there's no built-in shortcut for a deep copy — copy.deepcopy is always the tool for that.

When shallow is fine, and when it isn't

Shallow copy is fine (and cheaper) when the container only holds immutable elements (int, str, tuple of immutables) — there's no "shared inner object" risk because nothing can mutate them in place. It becomes a real bug source specifically when elements are themselves mutable (nested lists/dicts/objects) and you need the copy to be fully independent.

Interview-ready summary: Shallow copy duplicates only the top-level container; nested mutable objects are still shared with the original. Deep copy recursively duplicates everything reachable, giving full independence at higher cost. Reach for shallow copy (or plain slicing) when contents are immutable or sharing is intentional; reach for deepcopy when you need a completely independent structure.

Related Resources

copy — Python docs

Open as page

In-place vs new list

nums = [3, 1, 2]
nums.sort()          # sorts nums in place
nums                  # [1, 2, 3]
result = nums.sort()  # None -- sort() returns None on purpose

nums = [3, 1, 2]
new_list = sorted(nums)   # returns a new sorted list
nums                        # [3, 1, 2] -- unchanged
sorted("cba")                # ['a', 'b', 'c'] -- works on any iterable, not just lists

sort() returning None is deliberate (a Python convention: mutating methods return None so you can't accidentally chain them and think you're working with a new object), which is why x = lst.sort() is a common beginner bug — x ends up None, not the sorted list.

`key=`: computed sort key, not a comparator

people = [{"name": "Ada", "age": 36}, {"name": "Bob", "age": 25}]
sorted(people, key=lambda p: p["age"])
# sorted by age ascending

sorted(people, key=lambda p: p["name"].lower())   # case-insensitive sort

words = ["banana", "kiwi", "apple", "fig"]
sorted(words, key=len)          # sort by string length
sorted(words, key=len, reverse=True)   # longest first

key is called once per element to compute a value used for comparison — this is more efficient than an older-style comparator function (called O(n log n) times, once per comparison) and is the only sorting customization mechanism in modern Python (cmp= was removed in Python 3).

Multi-key sorting via tuples

people = [{"name": "Ada", "age": 36}, {"name": "Bob", "age": 36}, {"name": "Amy", "age": 25}]
sorted(people, key=lambda p: (p["age"], p["name"]))
# sorted by age first, then name as a tiebreaker

Returning a tuple from key sorts by the first element, breaking ties with the second, and so on — the standard idiom for multi-field sorting.

Stability matters for exactly this reason

Both sort() and sorted() use Timsort, which is stable: elements that compare equal keep their original relative order. This is what makes it safe to sort by one key, then sort again by another key, to achieve a multi-level sort without needing a single combined key tuple — though the tuple-key approach above is usually clearer for a fixed set of sort fields.

Interview-ready summary: sort() mutates in place and returns None; sorted() returns a new list and accepts any iterable. Both use the stable, O(n log n) Timsort algorithm and support key= (computed once per element) for custom ordering, including multi-field sorts via tuple keys.

Related Resources

Sorting HOW TO — Python docs

Open as page

Interning in action

a = "hello"
b = "hello"
a is b   # True on CPython -- both literals interned to the same object

c = "hello world!"
d = "hello world!"
c is d   # often False -- strings with spaces/punctuation aren't auto-interned

e = "hello" + " world!"   # built at runtime -- typically NOT interned
e is "hello world!"        # unreliable -- don't rely on this

CPython auto-interns string and code-object literals that look like identifiers (letters, digits, underscores) and are known at compile time — this includes most variable names, dict keys defined as literals, and short simple string constants. Strings built dynamically at runtime (concatenation, .format(), f-strings, user input) are generally not automatically interned.

Why this exists: speeding up dict/attribute lookups

Python internally uses dicts extensively (every object's __dict__, every module's namespace, every function's local variables in some representations). If two occurrences of the string "name" used as a dict key are the same interned object, a hash-table lookup can first try a fast identity check (is) before falling back to a full __eq__ comparison — since attribute names repeat constantly across a program, interning meaningfully speeds up this extremely common path.

`sys.intern()`: forcing it explicitly

import sys

a = sys.intern("some repeated string")
b = sys.intern("some repeated string")
a is b   # True -- explicitly interned

If your program builds many dynamic strings that are frequently repeated and compared/used as dict keys (e.g., parsing a large file with many repeated tokens), explicitly interning them can meaningfully reduce memory (many duplicate strings collapse to one object) and speed up comparisons.

The crucial caveat: never rely on `is` for string equality

def check(x):
    if x is "yes":   # BUG -- works by luck sometimes, breaks other times
        ...

def check(x):
    if x == "yes":    # correct -- always compares by value
        ...

Interning is a CPython implementation detail that can vary between Python versions, between CPython and other implementations (PyPy, etc.), and even between how a string was constructed. Modern CPython actually raises a SyntaxWarning for is used with string/int literals specifically because of this trap — always use == for value comparison.

Interview-ready summary: CPython interns many string literals to speed up dict/attribute lookups via cheap identity checks, but this is an implementation detail, not a language guarantee — always compare strings with ==, and reach for sys.intern() explicitly only when you've measured a real memory/comparison benefit from deduplicating many repeated dynamic strings.

Related Resources

sys.intern — Python docs

Collections & Data Structures

How are Python lists implemented, and what's the time complexity of common operations?

Lists are dynamic arrays, not linked lists

Complexity cheat sheet

Why append is amortized O(1)

Why front-insertion/removal is expensive

Related Resources

How does a Python `dict` work internally, and does it guarantee insertion order?

The hash table basics

Insertion order guarantee (Python 3.7+)

Why keys must be hashable

Collision resolution: open addressing

Worst case

Related Resources

What makes an object hashable, and how does that relate to `__eq__`?

What "hashable" requires

Why mutable containers are unhashable

Custom classes: hashable by default (via identity)

Related Resources

When should you use a list, tuple, set, or dict?

Quick decision table

Concrete examples

Why x in set beats x in list at scale

Tuple vs list: signaling intent, not just performance

Related Resources

How do comprehensions work, and when do they hurt readability or performance?

The three comprehension forms

Why they're often faster than an explicit loop

When they hurt readability

A good rule of thumb

Related Resources

What are `defaultdict`, `Counter`, `deque`, and `namedtuple` used for?

defaultdict: no more if key not in d

Counter: counting made trivial

deque: O(1) at both ends

namedtuple: lightweight, immutable records

Related Resources

What's the difference between a shallow copy and a deep copy?

Shallow copy: new outer container, shared inner objects

Deep copy: fully independent

Which one for which types

When shallow is fine, and when it isn't

Related Resources

What's the difference between `list.sort()` and `sorted()`, and how do custom sort keys work?

In-place vs new list

key=: computed sort key, not a comparator

Multi-key sorting via tuples

Stability matters for exactly this reason

Related Resources

How does string interning affect performance and `is` comparisons?

Interning in action

Why this exists: speeding up dict/attribute lookups

sys.intern(): forcing it explicitly

The crucial caveat: never rely on is for string equality

Related Resources

Why `append` is amortized O(1)

What makes an object hashable, and how does that relate to `eq`?

Why `x in set` beats `x in list` at scale

`defaultdict`: no more `if key not in d`

`Counter`: counting made trivial

`deque`: O(1) at both ends

`namedtuple`: lightweight, immutable records

`key=`: computed sort key, not a comparator

`sys.intern()`: forcing it explicitly

The crucial caveat: never rely on `is` for string equality