Most BRDFs that I have seen get darker as they get rougher like the article mentions. There are some that take into account the interreflections of the theoretical facets that make the distribution of normals that become the BRDF (mentioned in the link also), and they do better but I'm not sure even they pass this test straight up.
Most of what is actually being used out there loses energy. Sometimes people have used a lookup table on top of the BRDF to compensate.
So weird to see this top of HN! Also there are actually two furnace tests, the other one is with 0.5 emission and 0.5 albedo, also should produce 1.0 +/- noise pixels.
In the end, nobody actually cared about automatically fitting material data from BRDFs or if they do, they actually prefer a symmetric and conserving-enough BSDF, which is what most future research improved. In particular, breaking symmetry means making bidirectional path tracing and other techniques kinda weird.
This is based on classic black body radiation experiments which use ovens/kilns/furnace.
Here the concept is similar and the point is that the object is enclosed in a cavity that has an uniform emission spectrum, in this case white instead of black body.
Funny enough this Youtube video presented itself to my feed the other day that goes over this very topic in the planned Blender 4.0 release. They have a new Multiscatter GGX function for BSDF. I think this goes a lot more in-depth and presents a lot of examples.
it starts grey, then you see the outline of a sphere, background fades in, a red, blue, and white sphere appear around the central grey/white sphere, the scene fades back to grey, then the red and blue sphere come back and you can see the reflections or whatever on the central sphere, but you can't tell it's a sphere since it's "faded in to the background".
Also tried it out in both up-to-date Chrome and Safari on my up-to-date MacBook, which can play everything on https://webglsamples.org/ just fine, but on this page, nothing happens.
It doesn't pass the white furnace test.
That made me realize I didn't understand much of the math I implemented. Random multiplications by pi and 2 as the author stated didn't work out.