txt.sour.is @<niplav https://niplav.site/twtxt.txt> "Maybe it's true that intelligence depends on the environment, but consider: the environments where policy iteration performs better than RL with ..."

Thu, May 5 07:35 2022 (4y ago)

Maybe it’s true that intelligence depends on the environment, but consider: the environments where policy iteration performs better than RL with temporal difference learning are kind of dumb.

⤋ Read More

Participate