Why don't we have two nulls?-Collection of common programming errors

I’ve often wondered why languages with a null representing “no value” don’t differentiate between the passive “I don’t know what the value is” and the more assertive “There is no value.”.

There have been several cases where I’d have liked to differentiate between the two (especially when working with user-input and databases).

I imagine the following, where we name the two states unknown and null:

var apple;

while (apple is unknown)
{
    askForApple();
}

if (apple is null)
{
    sulk();
}
else
{
    eatApple(apple);
}

Obviously, we can get away without it by manually storing the state somwhere else, but we can do that for nulls too.

So, if we can have one null, why can’t we have two?

  1. In my programming, I recently adopted the practice of differentiating “language null” and “domain null”.

    The “language null” is the special value that is provided by the programming language to express that a variable has “no value”. It is needed as dummy value in data structures, parameter lists, and return values.

    The “domain null” is any number of objects that implement the NullObject design pattern. Actually, you have one distinct domain null for each domain context.

    It is fairly common for programmers to use the language null as a catch-all domain null, but I have found that it tends to make code more procedural (less object oriented) and the intent harder to discern.

    Every time to want a null, ask yourself: is that a language null, or a domain null?

  2. Isn’t is bad enough that we have one null?

  3. In most programming languages null means “empty” or “undefined”. “Unknown” on the other hand is something different. In essence “unknown” describes the state of the object. This state must have come from somewhere in your program.

    Have a look at the Null Object pattern. It may help you with what you are trying to achieve.

  4. It would be easy enough to create a static constant indicating unknown, for the rare cases when you’d need such a thing.

    var apple = Apple.Unknown;
    while (apple == Apple.Unknown) {} // etc
    
  5. Existence of value:

    • Python: vars().has_key('variableName')
    • PHP: isset(variable)
    • JavaScript: typeof(variable) != 'undefined'
    • Perl: (variable != undef) or if you wish: (defined variable)

    Of course, when variable is undefined, it’s not NULL

  6. Note null is an acceptable, yet known condition. An unknown state is a different thing IMO. My conversation with Dan in the comments’ section of the top post will clarify my position. Thanks Dan!.

    What you probably want to query is whether the object was initialized or not.

    Actionscript has such a thing (null and undefined). With some restrictions however.

    See documentation:

    void data type

    The void data type contains only one value, undefined. In previous versions of ActionScript, undefined was the default value for instances of the Object class. In ActionScript 3.0, the default value for Object instances is null. If you attempt to assign the value undefined to an instance of the Object class, Flash Player or Adobe AIR will convert the value to null. You can only assign a value of undefined to variables that are untyped. Untyped variables are variables that either lack any type annotation, or use the asterisk (*) symbol for type annotation. You can use void only as a return type annotation.

  7. Why stop at two?

    When I took databases in college, we were told that somebody (sorry, don’t remember the name of the researcher or paper) had looked at a bunch of db schemas and found that null had something like 17 different meanings: “don’t know yet”, “can’t be known”, “doesn’t apply”, “none”, “empty”, “action not taken”, “field not used”, and so on.

  8. The Null type is a subtype of all reference types – you can use null instead of a reference to any type of object – which severely weakens the type system. It is considered one of the a historically bad idea by its creator, and only exists as checking whether an address is zero is easy to implement.

  9. As to why we don’t have two nulls, is it down to the fact that, historically in C, NULL was a simple #define and not a distinct part of the language at all?

  10. Within PHP Strict you need to do an isset() check for set variables (or else it throws a warning)

    if(!isset($apple))
    {
        askForApple();
    }
    
    if(isset($apple) && empty($apple))
    {
        sulk();
    }
    else
    {
        eatApple();
    }
    
  11. In .net langages, you can use nullable types, which address this problem for value types.

    The problem remains, however, for reference types. Since there’s no such thing as pointers in .net (at least in ‘safe’ blocks), “object? o” won’t compile.

  12. The problem is that in a strongly typed language these extra nulls are expected to hold specific information about the type.

    Basically your extra null is meta-information of a sort, meta-information that can depend on type.

    Some value types have this extra information, for instance many numeric types have a NaN constant.

    In a dynamically typed language you have to account for the difference between a reference without a value (null) and a variable where the type could be anything (unknown or undefined)

    So, for instance, in statically typed C# a variable of type String can be null, because it is a reference type. A variable of type Int32 cannot, because it is a value type it cannot be null. We always know the type.

    In dynamically typed Javascript a variable’s type can be left undefined, in which case the distinction between a null reference and an undefined value is needed.

  13. In haskell you can define something like this:

    data MaybeEither a b = Object a
                         | Unknown b
                         | Null
                           deriving Eq
    main = let x = Object 5 in
           if x == (Unknown [2]) then putStrLn ":-("
           else putStrLn ":-)"
    

    The idea being that Unknown values hold some data of type b that can transform them into known values (how you’d do that depends on the concrete types a and b).

    The observant reader will note that I’m just combining Maybe and Either into one data type 🙂

  14. Some people will argue that we should be rid of null altogether, which seems fairly valid. After all, why stop at two nulls? Why not three or four and so on, each representing a “no value” state?

    Imagine this, with refused, null, invalid:

    var apple;
    
    while (apple is refused)
    {
        askForApple();
    }
    
    if (apple is null)
    {
        sulk();
    }
    else if(apple is invalid)
    {
        discard();
    }
    else
    {
        eatApple(apple);
    }
    
  15. It’s been tried: Visual Basic 6 had Nothing, Null, and Empty. And it led to such poor code, it featured at #12 in the legendary Thirteen Ways to Loathe VB article in Dr Dobbs.

    Use the Null Object pattern instead, as others have suggested.

  16. Some people are one step ahead of you already. 😉

  17. boolean getAnswer() throws Mu
    
  18. Given how long it took Western philosophy to figure out how it was possible to talk about the concept of “nothing”… Yeah, I’m not too surprised the distinction got overlooked for a while.

  19. I think having one NULL is a lower-common denominator to deal with the basic pattern

    if thing is not NULL
      work with it
    else
      do something else
    

    In the “do something else” part, there are a wide range of possibilities from “okay, forget it” to trying to get “thing” somewhere else. If you don’t simply ignore something that’s NULL, you probably need to know why “thing” was NULL. Having multiple types of NULL, would help you answering that question, but the possible answers are numerous as hinted in the other answers here. The missing thing could be simply a bug, it could be an error when trying to get it, it may not be available right now, and so on. To decide which cases apply to your code — which means you have to handle them — it domain specific. So it’s better to use an application defined mechanism to encode these reasons instead of finding a language feature that tries to deal with all of them.

  20. It’s because Null is an artifact of the language you’re using, not a programmer convenience. It describes a naturally occurring state of the object in the context in which it is being used.

  21. If you are using .NET 3.0+ and need something else, you might try the Maybe Monad. You could create whatever “Maybe” types you need and, using LINQ syntax, process accordingly.

  22. AppleInformation appleInfo;    
    while (appleInfo is null)
    {
        askForAppleInfo();
    }
    
    Apple apple = appleInfo.apple;
    if (apple is null)
    {
        sulk();
    }
    else
    {
        eatApple(apple);
    }
    

    First you check if you have the apple info, later you check if there is an apple or not. You don’t need different native language support for that, just use the right classes.

  23. For me null represents lack of value, and I try to use it only to represent that. Of course you can give null as many meanings as you like, just like you can use 0 or -1 to represent errors instead of their numerical values. Giving various meanings to one representation could be ambiguous though, so I wouldn’t recommend it.

    Your examples can be coded like apple.isRefused() or !apple.isValid() with little work; you should define beforehand what is an invalid apple anyway, so I don’t see the gain of having more keywords.

  24. You can always create an object and assign it to same static field to get a 2nd null.

    For example, this is used in collections that allow elements to be null. Internally they use a private static final Object UNSET = new Object which is used as unset value and thus allows you to store nulls in the collection. (As I recall, Java’s collection framework calls this object TOMBSTONE instead of UNSET. Or was this Smalltalk’s collection framework?)

  25. VB6

    • Nothing => “There is no value.”
    • Null = > “I don’t know what the value is” – Same as DBNull.Value in .NET
  26. Two nulls would be the wrongest answer around. If one null is not enough, you need infinity nulls.

    Null Could mean:

    • ‘Uninitialized’
    • ‘User didn’t specify’
    • ‘Not Applicable here, The color of a car before it has been painted’
    • ‘Unity: This domain has zero bits of information.’
    • ‘Empty: this correctly holds no data in this case, for example the last time the tires were rotated on a new car’
    • ‘Multiple, Cascading nulls: for instance, the extension of a quantity price when no quantity may be specified times a quantity which wasn’t specified by the user anyway’

    And your particular domain may need many other kinds of ‘out of band’ values. Really, these values are in the domain, and need to have a well defined meaning in each case. (ergo, infinity really is zero)

Originally posted 2013-11-10 00:16:17.