Expand each heading to view my notes.
https://openai.com/research/learning-from-human-preferences#:~:text=For example%2C a robot which was supposed to grasp items instead positioned its manipulator in between the camera and the object so that it only appeared to be grasping it%2C as shown below.