strictEquality with explicit-nulls do not work well together

soronpo · February 21, 2025, 4:33pm

With -Yexplicit-nulls

import language.strictEquality

val x: String | Null = null

val check = x == null //error

Should the explicit nulls feature introduce CanEqual[T | Null, Null] and CanEqual[T | Null, T]?

Could be relevant for the discussion, how to generally handle strict equality for unions:

github.com/scala/scala3

CanEqual is undefined by default for unions?

opened 09:02PM - 04 Dec 23 UTC

soronpo

itype:enhancement area:typeclass-derivation backlog

## Compiler version v3.3.1 ## Minimized code https://scastie.scala-lang….org/94w48wQbRdq9rQ2IKcbNCg ```Scala import language.strictEquality sealed trait Foo derives CanEqual sealed trait Bar derives CanEqual object FooObj extends Foo object BarObj extends Bar val x: Foo | Bar = FooObj @main def main: Unit = x match case FooObj => case BarObj => ``` ## Output ```scala error: Values of types object FooObj and Foo | Bar cannot be compared with == or != ``` ## Expectation I would expect that by default we have something like ```scala given CanEqualUnion[A, B](using CanEqual[A, A], CanEqual[B, B]): CanEqual[A | B, A | B] = CanEqual.derived ``` So no error should happen under strict equality for any union of types that have `CanEqual` defined for them.

Sporarum · February 21, 2025, 4:45pm

This seems required for these two features to interact in a safe way

Is there a way to allow that, but disallow "null" == null (Since "null" is a String | null) ?

Odomontois · February 24, 2025, 11:22am

I’ve tried trivial fix with

given [A, B](using CanEqual[A, B]): CanEqual[A | Null, B | Null] = CanEqual.derived

And it seems to work in most obvious cases

Except it allows null == ""

odersky · February 25, 2025, 9:57pm

That’s actually useful. Even if things are declared with non-null types they can still be null at runtime because they might not be initialized. So, a test like null == s where s is a String is sometimes necessary and it would be annoying if it was not allowed.

mberndt · February 25, 2025, 10:50pm

In my humble opinion, any access to an uninitialized field is a bug. It is in the same category as accessing an array out of bounds. This isn’t something that one should need to branch on, and I wouldn’t let any code that does this pass code review.

And regarding the original issue, I don’t think it’s a real problem because while x == null doesn’t compile, x eq null works fine, and it makes more sense too because checking for null is a check for reference equality, not for value equality, which is what == is meant to be used for.

There is only one potential issue here: many users don’t know eq because it’s not something that is needed very often. If this is of real concern, it can be fixed with a simple compiler hack. When the compiler sees an == null check and a suitable CanEqual cannot be found, the error message should suggest trying eq.

Eastsun · February 26, 2025, 12:16am

Maybe we can add methods isNull and isNotNull in Any.

odersky · February 26, 2025, 9:13am

Not even in an assert(s != null)? In code that you don’t control fully? I agree eq/ne would be an alternative, but we want to make it easy and straightforward to write such asserts. Someone who adds such an assert in a desperate debug session would not appreciate the technicalities of CanEqual here.

mberndt · February 26, 2025, 2:52pm

Well, assert is a special case because it is a tool specifically made to detect bugs at run-time. In pretty much all other cases, I don’t think one should ever branch on a condition that can only ever be true when there’s a bug in the program. And adding a language feature that is always available but whose only legitimate use is inside an assert statement doesn’t seem reasonable to me, especially given that eq and ne work just fine.
Again, I think a compiler hint to use eq or ne is fine.

noti0na1 · February 26, 2025, 3:36pm

We had many issues with language.strictEquality before; unfortunately, we couldn’t find anyone who is actively maintaining this feature and discuss the expected behaviour.

Hence, the current behaviour of explicit nulls is based on rules without strictEquality.