Question 1

What is Spring Data JPA, and how do repository interfaces like JpaRepository work without an implementation?

Accepted Answer

Spring Data JPA is a layer on top of JPA/Hibernate that eliminates most boilerplate data-access code: you declare a repository as a plain interface extending JpaRepository<Entity, IdType>, and Spring Data generates a working implementation at runtime via a dynamic proxy, backed by a SimpleJpaRepository instance that implements the common CRUD operations using the underlying JPA EntityManager. You never write an implementation class yourself — Spring Data creates one automatically when the ApplicationContext starts.

Question 2

How do derived query methods work (e.g., findByLastNameAndAge)?

Accepted Answer

Spring Data JPA parses a repository method's name at startup, splitting it into keywords (findBy, And, OrderBy, ...) and property names matching the entity's fields, then automatically generates the corresponding JPQL query — so a method named findByLastNameAndAge(String lastName, int age) is translated into a query filtering by both properties, with no query string or annotation required at all.

Question 3

What is the difference between @Query and derived query methods?

Accepted Answer

Derived query methods generate a query automatically from the method's name, which is concise for simple filters but becomes unreadable for complex conditions and can't express arbitrary joins/aggregations easily. @Query lets you write an explicit JPQL (or native SQL, via nativeQuery = true) query string directly on the repository method, giving full control over the exact query — necessary for anything beyond straightforward property-based filtering, like custom joins, aggregate functions, or database-specific SQL features.

Question 4

Explain @Transactional — propagation, isolation, and common pitfalls (like calling a @Transactional method from within the same class).

Accepted Answer

@Transactional wraps a method's execution in a database transaction — committing on normal completion, rolling back on an unchecked exception (checked exceptions don't trigger rollback by default). Propagation controls how a transactional method behaves when called from within an existing transaction (REQUIRED joins it, the default; REQUIRES_NEW suspends it and starts a fresh one); isolation controls how concurrent transactions see each other's uncommitted/committed changes. Because @Transactional is implemented via an AOP proxy, calling a @Transactional method on 'this' from another method in the same class bypasses the proxy entirely, silently skipping the transaction.

Question 5

What is the N+1 select problem, and how do you solve it in Spring Data JPA?

Accepted Answer

The N+1 problem occurs when fetching a list of N parent entities triggers one query for the parents, then N additional queries — one per parent — to lazily fetch each one's related child collection, instead of a single, efficient join. It's solved by fetching related data eagerly in the same query where needed: a JPQL JOIN FETCH clause, a repository method annotated with @EntityGraph, or (for simpler cases) switching the relationship's fetch type, combined with enabling Hibernate's SQL statement logging in development to actually notice the problem.

Question 6

What is the difference between lazy and eager fetching, and what is LazyInitializationException?

Accepted Answer

Eager fetching (FetchType.EAGER) loads an association immediately as part of the owning entity's own query, always available but at the cost of potentially fetching data that isn't needed. Lazy fetching (FetchType.LAZY) defers loading an association until it's actually accessed in code, which is more efficient by default but requires that access to happen while a persistence context (session) is still open — accessing a lazy association after the session has closed (e.g., after a @Transactional method has returned) throws LazyInitializationException.

Question 7

How does pagination and sorting work with Pageable and Page?

Accepted Answer

A repository method that accepts a Pageable parameter (typically built via PageRequest.of(pageNumber, pageSize, sort)) automatically applies the corresponding LIMIT/OFFSET and ORDER BY to its query, and returning a Page (rather than a plain List) additionally gives you the total element/page count, computed via an extra count query Spring Data generates automatically. Sort can be composed independently or combined into the Pageable itself, letting a single repository method support arbitrary client-driven paging and ordering without custom query code.

Question 8

What is entity auditing in Spring Data (@CreatedDate, @LastModifiedBy, etc.)?

Accepted Answer

Spring Data JPA auditing automatically populates fields like createdDate, lastModifiedDate, createdBy, and lastModifiedBy on an entity whenever it's persisted or updated, without any manual code in the service layer. It requires annotating the relevant fields (@CreatedDate, @LastModifiedDate, @CreatedBy, @LastModifiedBy), enabling auditing via @EnableJpaAuditing, adding @EntityListeners(AuditingEntityListener.class) to the entity, and — for the *By fields — providing an AuditorAware bean that supplies the current user identity.

Question 9

How does connection pooling work in Spring Boot (HikariCP), and why does it matter?

Accepted Answer

Opening a new database connection is relatively expensive (TCP handshake, authentication, session setup), so a connection pool keeps a set of already-open connections ready to reuse across requests instead of opening/closing one per query. Spring Boot uses HikariCP as its default connection pool (auto-configured whenever a DataSource is needed), tunable via spring.datasource.hikari.* properties like maximum-pool-size and connection-timeout — sizing the pool correctly (not too small, causing threads to wait for a connection; not needlessly large, overwhelming the database) is a common, genuinely impactful production tuning concern.

Question 10

What role do Flyway/Liquibase play in a Spring Boot application, and why prefer them over Hibernate's ddl-auto in production?

Accepted Answer

Flyway and Liquibase are database migration tools: you write versioned migration scripts (SQL for Flyway, SQL/XML/YAML/JSON for Liquibase), and the tool tracks which migrations have already been applied to a given database, running only the new ones in a controlled, ordered, repeatable way — integrated into Spring Boot so migrations run automatically on application startup. This is strongly preferred over Hibernate's spring.jpa.hibernate.ddl-auto=update in production because ddl-auto's schema inference is unpredictable, can silently make destructive or unintended changes, and provides no history, rollback path, or review step for schema changes the way an explicit, version-controlled migration script does.

Spring Data & Persistence

What is Spring Data JPA, and how do repository interfaces like JpaRepository work without an implementation?

Related Resources

How do derived query methods work (e.g., findByLastNameAndAge)?

What is the difference between @Query and derived query methods?

Explain @Transactional — propagation, isolation, and common pitfalls (like calling a @Transactional method from within the same class).

Related Resources

What is the N+1 select problem, and how do you solve it in Spring Data JPA?

What is the difference between lazy and eager fetching, and what is LazyInitializationException?

How does pagination and sorting work with Pageable and Page?

What is entity auditing in Spring Data (@CreatedDate, @LastModifiedBy, etc.)?

How does connection pooling work in Spring Boot (HikariCP), and why does it matter?

Related Resources

What role do Flyway/Liquibase play in a Spring Boot application, and why prefer them over Hibernate's ddl-auto in production?