The first technique we’ll use is a special case of separation of concerns. Specifically, here we’ll strive to separate the pure code that contains the core logic of the system we are building from the impure (but necessary!) code that handles all the bits and pieces we need to actually use it, usually involving user interaction, database persistence and similar.

Let’s illustrate this with an example. We’ll write a program that lets the user input a number, calculates the factorial of that number and then shows a message with the result.

We’ll start by ignoring the whole user interaction and write the pure function that calculates the result:

function factorial(n){
  if(n < 0){
    return NaN;
  }
  let result = 1;
  while(n >= 1){
    result = result * n;
    n = n - 1;
  }
  return result;
}

And then, since it is a pure function we can test it very easily trying out some possible input values and see what results they produce:

factorial(0) == 1;
//=> true

factorial(1) == 1;
//=> true

factorial(2) == 2;
//=> true

factorial(3) == 6;
//=> true

factorial(4) == 24;
//=> true

factorial(5) == 120;
//=> true

factorial(-5) == 120;
//=> false

factorial(-5) == -120;
//=> false

isNaN(factorial(-5));
//=> true

Now we can write the user interface functions and have that impure code call the factorial function:

function main(){
  let input = prompt("Please enter a number");
  let n = parseInt(input);
  let result = factorial(n);
  alert("The result is " + result);
}

This code snippet will work in a browser based JavaScript shell, e.g. Chrome Console, since it needs the browser for its dialog message support. Also note that the result of prompt is an arbitrary string that just may contain a number. We must parse it before we pass it on to factorial.

When we run main we’ll be given an input dialog box to enter the number:

prompt

We enter the number, press OK and get the result:

alert

Ok, so we have the code that solves the problem by neatly separating the pure from the impure part. You may ask yourself if we needed the impurities for something so basic as communication with the user, then why did we even bother with this separation? What did we gain with it?

To answer those questions, I’ll show a counter-example - a completely impure code that does the same thing:

function factorialImpure(){
  let input = prompt("Please enter a number");
  let n = parseInt(input);
  let result;
  if(n < 0){
    result = NaN;
  }
  result = 1;
  while(n >= 1){
    result = result * n;
    n = n - 1;
  }
  alert("The result is " + result);
}

It’s really just the same example, but with all the code lumped together. We only need to run it to see that it’s doing the same thing. However, we lost one thing - we can’t write tests anymore the way we did with the pure version:

//we could do this
factorial(3) == 6;
//=> true

//but we can't do this:
factorialImpure(3) == 6;

Function factorialImpure does not depend on its parameters at all. It depends on the external state, specifically on whatever user enters the dialog box. It also depends on an external action - nothing will happen until user actually presses OK. Likewise, that function does not return any value (the result is undefined). Instead, it produces a side effect - the alert message. The only straightforward way to test it is to manually run it, type different inputs in the dialog and see what happens. Very tedious!

Not only are they more difficult to test, impure functions are also less reusable. Let’s say that we wanted to be able to run our code not in a browser, but in a node shell, where there is no prompt dialog to collect the user input and we want to pass it as a command line argument instead. With our pure and impure code separated, we can reuse factorial and write a new impure wrapper for the shell interface:

function factorial(n){
    if(n < 0){
        return NaN;
    }
    let result = 1;
    while(n >= 1){
        result = result * n;
        n = n - 1;
    }
    return result;
}

function main(){
    //the first arg is the executable, i.e. node,
    //the second one is the source file name
    //and the third (zero-based) is our custom input
    let input = process.argv[2]; 
    let n = parseInt(input);
    let result = factorial(n);
    console.log("The result is " + result);
}

main();

Save this piece of code in a file called factorial.js and try running it with node factorial.js 3:

node factorial.js 3

#=> The result is 120

And we just re-used our pure factorial function without any modification - we got to make all the changes in the side effect heavy user interface function.

We could further extrapolate on this example with different kinds of interactions, e.g. if we had to get the input number from say some REST service using an AJAX call, then calculate the factorial and push it back to some other service, we would just need another input-output wrapper - one that uses AJAX, e.g.

By separating the core logic from the input output code, we managed to keep the core code pure, which made it more easily reusable, testable and generally better to work with. And yet, we still got to have the necessary side effects in our code. It’s all right there, it’s just neatly sorted in one spot rather than interleaved throughout the code.