txt.sour.is niplav@niplav.site "use known inconsistencies of human preferences as value-learning trip-wires: if the value learning algorithm hasn't learned them yet, it's opera ..."

Tue, Mar 22 21:36 2022 (4y ago)

use known inconsistencies of human preferences as value-learning trip-wires: if the value learning algorithm hasn’t learned them yet, it’s operating at the wrong level of abstraction.

⤋ Read More