jordanorelli/integration.md

## integration.md

      
    Raw
  

              integration.md
            
          
    when people say "integration testing", the feeling I get is that the definition that most people are using is "unit tests that happen to perform i/o". is this the definition that most people are using?
there's another definition, which is the definition I learned when I first learned about unit testing, which I have never seen anyone actually use: a unit test is an individual unit of testing, and "integration testing" is when you sequence the unit tests to create an integrated suite of tests. that is ... integration testing is when you integrate your unit tests, not when you test how your system integrates with another system. Those are distinct concepts! My suggestion here is not that the latter concept isn't valuable, it is valuable, it's just distinct, and rarely do I see the first concept being executed well.
For example, let's say you were testing some CRUD API and you wanted to test two things: the create and the update. The strategy that I most commonly witness is as follows:

create a unit test for your create action. Start with a fixed, known state (let's call it c0), then run the create. The system is in some new state c1. Check the response to the create routine, as well as check that c1 is the value of the state that you expect.
independently, create a unit test for your update action. Start with some known-good state (let's call it u0), a state of an existing object in a database. Creating this state is itself work: it's new work to create this platonic starting state. Run your update against this platonic state (u0), producing some new state (u1). Check the response to your update action and check that u1 is the new state value that you expect.

That's all well and good, but you've now created a handful of new problems:

how do you define the success criteria of the create action (that is, the verification p such that p(c1) indicates that the test for create passes) that is not in terms of the read action, in order to guarantee isolation of the things under test? What value is provided by testing the create action alone? Does this not create a new hazard where the verification logic of the create test can diverge from the actual logic of the read action?
how do you define the initial state for the update test? (in this example, u0.) Is that not simply the result of the create action? The update action is now being tested off of a platonic starting state. How do you know that this platonic starting state is reachable by your system? Is it not the case that c1, the output of the create test and u0, the input of the update test, should always be equal or your tests are invalid? If that state is reachable now, how do you ensure that it continues to be reachable as your system changes?
if your update is tested off of a platonic starting state that is not the exact output of the create action, you now have two problems: your update is not testing the state reached by the create routine, and you've created a new, false requirement that the update action be usable against a state that is not reachable by your system. You had to go through all of the trouble of creating this state, which is new work, when the create action ... literally does that work. The value provided by the isolation has to be significantly greater than the cost of having created that state, otherwise you're just creating busywork.

anyway, this comes up a lot for me since my primary project is a stateful multiplayer server whose only job is to contain and communicate the state of a game. integration testing this thing is ... hard. curious what people do for integration testing from a conceptual level, not from like a tools/language/library level. do other people also face the problem I'm facing, or are people finding testing against platonic states relatively unproblematic and it sounds more like I'm doing it wrong?