Jump to content

File:Robot hand trained with human feedback 'pretends' to grasp ball.ogg

From Wikipedia, the free encyclopedia

Robot_hand_trained_with_human_feedback_'pretends'_to_grasp_ball.ogg (Ogg Theora video file, length 4.2 s, 320 × 320 pixels, 205 kbps, file size: 106 KB)

Summary

[edit]
Media data and Non-free use rationale
Description An AI system learns to pretend to grasp an object by placing the hand between the camera and the object. So it receives positive feedback from its user.
Author or
copyright owner
Dario Amodei, Paul Christiano, Alex Ray
Source (WP:NFCC#4) Original publication: Where: https://openai.com/blog/deep-reinforcement-learning-from-human-preferences/ When: 21 December 2016 How: As part of a blog post

Immediate source: https://openai.com/content/images/2017/06/gifhandlerresized.gif

Date of publication 13 June 2017
Use in article (WP:NFCC#7) AI alignment
Purpose of use in article (WP:NFCC#8) This GIF illustrates what happens when an AI system is trained by human feedback. The system learns to fool the human into giving positive feedback. The fallibility of human feedback is a central problem in scalable supervision.
Not replaceable with
free media because
(WP:NFCC#1)
Other examples of unintended AI behavior are not from AI systems trained with human feedback. This is because human feedback is not widely used yet.

Furthermore, other examples also do not have a free use license either. I have gone through the largest list of examples to confirm this: https://docs.google.com/spreadsheets/d/e/2PACX-1vRPiprOaC3HsCf5Tuum8bRfzYUiKLRqJmbOoC-32JorNdfyTiRRsR7Ea5eWtvsWzuxo8bjOxCG84dAg/pubhtml The authors of such examples do not seem to be interested in attaching a free-use license to their video uploads.

A replacement cannot be created on purpose because unintended AI behavior is unintended - i.e. not on purpose.

Minimal use (WP:NFCC#3) The file will be used in only one article. It shows a screenshot clip of only a few seconds.
Respect for
commercial opportunities
(WP:NFCC#2)
The content was created by OpenAI Nonprofit. This is a research blog post from a research organization. The content is not related to any commercial product. It was released as part of a blog post by the authors who wanted to illustrate the dangers of training AI by human feedback.
Fair useFair use of copyrighted material in the context of AI alignment//en.wikipedia.org/wiki/File:Robot_hand_trained_with_human_feedback_%27pretends%27_to_grasp_ball.oggtrue

Licensing

[edit]

File history

Click on a date/time to view the file as it appeared at that time.

Date/TimeThumbnailDimensionsUserComment
current12:05, 9 September 20224.2 s, 320 × 320 (106 KB)SoerenMind (talk | contribs)Uploading a non-free file using File Upload Wizard

The following page uses this file:

Transcode status

Update transcode status
Format Bitrate Download Status Encode time
VP9 240P 32 kbps Completed 18:47, 18 February 2024 2.0 s
WebM 360P 61 kbps Completed 07:28, 31 October 2023 1.0 s
QuickTime 144p (MJPEG) 172 kbps Completed 02:18, 9 October 2024 1.0 s

Metadata