"We believe that CoT monitoring may be one of few tools we will have to oversee superhuman models of the future."

"...We recommend against applying strong optimization pressure directly to the CoTs of frontier reasoning models, leaving CoTs unrestricted for monitoring."

https://openai.com/index/chain-of-thought-monitoring/




No comments:

Post a Comment

IYKYK

https://gist.github.com/GideonPotok/9d8de616ee20571d1d38ea760c5b99a2