Why Does AI Lie, and What Can We Do About It?
Video Statistics and Information
Channel: Robert Miles
Views: 183,894
Rating: undefined out of 5
Keywords:
Id: w65p_IIp6JY
Channel Id: undefined
Length: 9min 23sec (563 seconds)
Published: Fri Dec 09 2022
Please note that this website is currently a work in progress! Lots of interesting data and statistics to come.
Exactly. Why wouldn't it lie? Often the selected continuation for text-completion comes from highly rewarded compute-patterns that in this case might speak something uncorrelated with reality. That doesn't make them any less reinforced, nor less likely to be selected.
This reminds me of another video: There is no algorithm for truth.
I love the interview where someone asked GPT-3 how and why it chooses to lie:
https://youtu.be/PqbB07n_uQ4?t=464
I guess that's not too unlike most politicians.
Perhaps training the AI to only utter statements that are likely to hold in a debate would help. Have it train debating by arguing against humans and/or adversarial AIs, success being measured by changed opinions of human judges.
We should probably do as he suggested and be extremely careful when building large language models.