Test-driven development (TDD)

Test-driven development (TDD) is a software development practice in which automated tests are written before the application or library code that needs to be tested. TDD flips the traditional development workflow and uses testing as a design tool as well as a validation tool.

TDD is attributed to Kent Beck, who describes it as a rediscovery rather than an invention. An ancient book on programming described a method of writing expected output before programming until the actual output matched. Beck formalized the practice while building the first xUnit framework in Smalltalk, then popularized it as a core discipline of eXtreme Programming (XP) in the late 1990s. His book Test Driven Development: By Example (2002) remains the canonical reference.

The red-green-refactor cycle

A TDD workflow typically follows these steps:

From a stable revision, with the build and all automated tests passing, write one or more new tests for a small piece of new functionality, or modify existing tests to reflect a change in requirements in existing behavior.
Run all the tests. The new or modified tests are expected to fail, because the system-under-test has not yet been implemented or modified to meet the new requirements. This is often referred to as the "red" phase of TDD.
Update the application logic and configuration, just enough so the tests pass. Don’t add any extra functionality at this stage.
Optionally, refactor to improve the internal quality of the program, while maintaining a green build (compilation and tests passing).
Commit the changes in a single atomic, stable commit.

Repeat with the next, small incremental change.

This loop is known as the red-green-refactor cycle. The colors correspond to typical test-runner output. Failing tests are shown in red, passing tests in green. In TDD, it is common to write tests that cover edge cases and error conditions in addition to the happy path. A useful variation is test-driven debugging, in which defects are resolved by first writing a failing test that reproduces the bug, then fixing the code until the test passes.

Test isolation

Unit tests in a TDD cycle are designed to run fast and in isolation from external dependencies such as databases, file systems, and network services. When the code under test has such dependencies, TDD practitioners use test doubles – stubs, fakes, and mocks – to replace them. This keeps the unit test suite independently executable without environmental setup, and ensures failures are localized to the unit being tested rather than a downstream dependency.

Because test doubles do not prove the connection to real external components, unit tests are supplemented with integration and end-to-end tests at appropriate levels. The convention is to maximize coverage at the unit level while keeping the slower, more expensive higher-level tests to a minimum.

Test structure

Well-structured unit tests follow a consistent pattern: setup, execution, validation, and cleanup. In the setup phase the system-under-test is initialized to a known state. The execution phase triggers the behavior being tested and captures outputs. The validation phase asserts that the outputs or state changes are correct. The cleanup phase, when necessary, restores the pre-test state so other tests can run independently. This pattern is also described by the mnemonic Arrange, Act, Assert (AAA).

Anti-patterns

Common test anti-patterns to avoid:

Tests that depend on the state left by previously executed tests. Tests should always start from a known, pre-configured state.
Interdependent tests that cause cascading failures. If one test fails, subsequent tests fail even if there is no actual defect in the code under test.
Testing implementation details rather than observable behavior, which makes tests fragile and expensive to maintain during refactoring.
Slow-running tests, which discourage frequent execution and undermine the fast-feedback principle of TDD.
"All-knowing oracles": assertions that inspect far more state than is necessary to validate the behavior under test.

Benefits

TDD is not synonymous with automated testing, but rather it is a specific approach to implementing and maintaining automated tests. TDD shifts left the testing phase to before the coding phase. Tests drive the development, rather than being an afterthought or a separate phase that occurs long after the code is written. It deliberately creates just enough friction to make the design, development, and testing process more thoughtful and deliberate.

Writing tests first requires programmers to think about how to implement changes in a testable way. Testable code tends to be inherently more modular and easier to maintain than code that is not designed for automated testing.

Advocates of TDD argue that it leads to:

Better design and higher code quality, due to enforcement of modularity and well-defined interfaces.
Fewer defects, because testing is shifted left, meaning errors are caught early in the development lifecycle.
Confident refactoring, as you end up with comprehensive test coverage that makes it much easier to make changes with minimized risk of breaking existing functionality.

Criticisms

However, TDD is not universally practiced or accepted. Criticisms of TDD include that it is not suitable for all types of development tasks, such as the rapid development of prototypes and proofs-of-concept, or other disposable code. It can also be challenging to retrofit tests into legacy systems that were not designed for automated testing.

Many times it is easier to experiment with code structure, data structures, and interfaces, perhaps using temporary branches in source control, rather than using tests to drive the design. It is not always necessary or desirable to write failing tests before writing or changing code.

TDD can also over-emphasize bottom-up design. It puts focus on small increments of isolated units of code, rather than on the high-level design and maintaining conceptual integrity of the system design as a whole. This can lead to situations where individual components are well-made but which do not integrate well into a cohesive whole.

In addition, TDD tends to produce lots of low-level unit tests, which can make large-scale refactoring harder, not easier. Refactoring is easier, generally, when you err on the side of integration and system/e2e tests.

Some argue that TDD can lead to poor design decisions. For example, TDD encourages the creation of interfaces that are easy to test, rather than interfaces that are easy to use by client program code. Real-world use cases ought to drive the design of interfaces, not automated tests.

There is no silver bullet in software development. TDD is a practice that can be useful in many situations, but programming style is a personal thing.

Acceptance test-driven development (ATDD)

Acceptance test-driven development (ATDD) scales the test-first discipline up to the requirements level. Where TDD uses unit tests to drive the implementation of small units of code, ATDD uses acceptance tests – written in business domain terms – to drive the development of whole features or user stories.

Acceptance tests are created collaboratively by the requirement requester (product owner, business analyst, or customer representative), the developer, and the tester before coding begins. This collaboration is sometimes called a specification workshop, or the three amigos meeting. The practice is closely related to behavior-driven development (BDD). Both ATDD and BDD emphasize shared, structured specifications. BDD focuses specifically on the behavioral language of the tests. Tests and requirements are tightly interrelated. A requirement without an acceptance test may not be implemented correctly, and a test without a corresponding requirement is unnecessary.

Acceptance tests are written using a structured format that mirrors the Given-When-Then pattern used in Gherkin and BDD tools:

Given: The initial state of the system.
When: The action or event that occurs.
Then: The expected resulting state or output.

The tests operate at the external, visible-behavior level of the system, independently of implementation. Failing acceptance tests provide rapid feedback that a requirement is not being met.

References

Test-driven development, Martin Fowler
Canon TDD, Kent Beck