Robert Miles

Rob Miles is a science communicator focused on AI Safety and Alignment.
He is probably my biggest influence. He was the first person to bring the complicated AI Safety concepts to the general public, many years ago, way before the LLM revolution started, in a time where today’s reality seemed sci-fi and the whole Alignment Study was an academic curiosity, a cool way for intellectuals to tickle the most rational and analytical parts of their brain.
His Youtube Channel Robert Miles AI Safety and his interviews at Computerphile popularized the frontier ideas coming from LessWrong, Machine Intelligence Research Institute, Centre for the Study of Existential Risk and leading AI Labs like OpenAI and DeepMind.

Robert Miles is a true hero of the AI Safety saga and an inspiring force for lethalintelligence.ai.

AI Ruined My Year

June 1, 2024 8:17 am

How to Help: https://aisafety.info/questions/8TJV/How-can-I-help
https://www.aisafety.com/

AI Safety Talks: https://www.youtube.com/@aisafetytalks

There's No Rule That Says We'll Make It: https://www.youtube.com/watch?v=JD_iA7imAPs
The other "Killer Robot Arms Race" Elon Musk should worry about: https://www.youtube.com/watch?v=7FCEiCnHcbo

Rob's Reading List:
Podcast: https://rmrlp.libsyn.com/
YouTube Channel: https://www.youtube.com/@RobMilesReadingList
The FLI Open Letter: https://www.youtube.com/watch?v=3GHjhG6Vo40
Yudkowsky in TIME: https://www.youtube.com/watch?v=a6m7JynBp-0
Ian Hogarth in the FT: https://www.youtube.com/watch?v=Z8VvF82T6so

Links:
The CAIS Open Letter: https://www.safe.ai/work/statement-on-ai-risk
The FLI Open Letter: https://futureoflife.org/open-letter/pause-giant-ai-experiments/
The Bletchley Declaration: https://www.gov.uk/government/publications/ai-safety-summit-2023-the-bletchley-declaration/the-bletchley-declaration-by-countries-attending-the-ai-safety-summit-1-2-november-2023
US Executive Order: https://www.whitehouse.gov/briefing-room/presidential-actions/2023/10/30/executive-order-on-the-safe-secure-and-trustworthy-development-and-use-of-artificial-intelligence/
Some analysis of the EO: https://thezvi.substack.com/p/on-the-executive-order
"Sparks of AGI" Paper: https://arxiv.org/abs/2303.12712
Yudkowsky in TIME: https://time.com/6266923/ai-eliezer-yudkowsky-open-letter-not-enough/
Hogarth in the FT: https://www.ft.com/content/03895dc4-a3b7-481e-95cc-336a524f2ac2
The AI Safety Institute: https://www.gov.uk/government/publications/ai-safety-institute-overview/introducing-the-ai-safety-institute
Responsible Scaling Policies: https://metr.org/blog/2023-09-26-rsp/
The EU AI Act: https://artificialintelligenceact.eu/the-act/
Hinton on CBS: https://youtu.be/qpoRO378qRY

Sources:
"Sparks of AGI" Talk: https://www.youtube.com/watch?v=qbIk7-JPB2c
Yann LeCunn on Lex Fridman's Podcast: https://www.youtube.com/watch?v=SGzMElJ11Cc
White House Press Briefings: https://x.com/TVNewsNow/status/1663640562363252742
https://www.youtube.com/watch?v=JHNkyHl5FpY
King Chuck on AI: https://www.youtube.com/watch?v=0_jw40Ga_mA

"Equally sharing a cake between three people - Numberphile": https://www.youtube.com/watch?v=kaMKInkV7Vs

Community, various screenshots
The Simpsons
Sneakers (1992)

Thanks to Rational Animations for the train sequence!
https://www.youtube.com/@RationalAnimations

With enormous thanks to my wonderful patrons:
- Tor Barstad
- Timothy Lillicrap
- Juan Benet
- Sarah Howell
- Kieryn
- Mazianni
- Scott Worley
- Jason Hise
- Clemens Arbesser
- Francisco Tolmasky
- David Reid
- Andrew Blackledge
- Cam MacFarlane
- Olivier Coutu
- CaptObvious
- Ze Shen Chin
- ikke89
- Isaac
- Erik de Bruijn
- Jeroen De Dauw
- Ludwig Schubert
- Eric James
- Owen Campbell-Moore
- Raf Jakubanis
- Esa Koskinen
- Nathan Metzger
- Jonatan R
- Gunnar
- Laura Olds
- Paul Hobbs
- Bastiaan Cnossen
- Eric Scammell
- Alexare
- Reslav Hollós
- Jérôme Beaulieu
- Nathan Fish
- Taras Bobrovytsky
- Jeremy
- Vaskó Richárd
- Andrew Harcourt
- Chris Beacham
- Zachary Gidwitz
- Art Code Outdoors
- Abigail Novick
- Edmund Fokschaner
- DragonSheep
- Richard Newcombe
- Joshua Michel
- Richard
- ttw
- Sophia Michelle Andren
- Alan J. Etchings
- James Vera
- Stumbleboots
- Peter Lillian
- Grimrukh
- Colin Ricardo
- DN
- Mr Cats
- Robert Paul Schwin
- Roland G. McIntosh
- Benjamin Mock
- Emiliano Hodges
- Maxim Kuzmich
- Joanny Raby
- Tom Miller
- Eran Glicksman
- CheeseBerry
- Hoyskedotte
- Alexey Malafeev
- Jeff Starr
- Justin
- Liviu Macovei
- Javier Soto
- David Christal
- Jam
- Just Me
- Sebastian Zimmer
- Matt Thompson
- Xan Atkinson
- Andy
- Albert Higgins
- Alexander230
- Clay Upton
- Alex Ander
- Carolyn
- Nathan Rogowski
- David Morgan
- little Bang
- Chad M Jones
- Dmitri Afanasjev
- Christian Oehne
- Marcel Ward
- Andrew Weir
- Miłosz Wierzbicki
- Tendayi Mawushe
- Kees
- loopuleasa
- Marco Tiraboschi
- Fraser Cain
- Patrick Henderson
- Daniel Munter
- Ian
- James Fowkes
- Len
- Yuchong Li
- Diagon
- Puffjanga
- Daniel Eickhardt
- 14zRobot
- Stuart Alldritt
- DeepFriedJif
- Garrett Maring
- Stellated Hexahedron
- Jim Renney
- Edison Franklin
- Piers Calderwood
- Matt Brauer
- Mihaly Barasz
- Rajeen Nabid
- Iestyn bleasdale-shepherd
- Marek Belski
- Luke Peterson
- Eric Rogstad
- Max Chiswick
- slindenau
- Nicholas Turner
- Jannis Funk
- This person's name is too hard to pronounce
- Jon Wright
- Andrei Trifonov
- Bren Ehnebuske
- Martin Frassek
- Matthew Shinkle
- Robby Gottesman
- Ohelig
- Sarah
- Nikola Tasev
- Tapio Kortesaari
- Soroush Pour
- Boris Badinoff
- DangerCat
- Jack Phelps
- Kyle Green
- Lexi X
- John Slape
- Joel Gardner
- Christopher Creutzig
- Johann Puzik
- Pindex
- RMR
- Andrew Edstrom
https://www.patreon.com/robertskmiles
...

AI "Stop Button" Problem - Computerphile

March 3, 2017 4:32 pm

How do you implement an on/off switch on a General Artificial Intelligence? Rob Miles explains the perils.

Part 1: https://www.youtube.com/watch?v=4l7Is6vOAOA
Rob's Original Discussions on General AI: https://www.youtube.com/playlist?list=PLzH6n4zXuckquVnQ0KlMDxyT5YE-sA8Ps

Stop Button Solution?: https://youtu.be/9nktr1MgS-A

More from Rob Miles: http://bit.ly/Rob_Miles_YouTube

Thanks to Nottingham Hackspace for providing the filming location: http://bit.ly/notthack


http://www.facebook.com/computerphile
https://twitter.com/computer_phile

This video was filmed and edited by Sean Riley.

Computer Science at the University of Nottingham: http://bit.ly/nottscomputer

Computerphile is a sister project to Brady Haran's Numberphile. More at http://www.bradyharan.com
...

There's No Rule That Says We'll Make It

January 16, 2022 8:36 pm

We're not doomed! But doom is definitely an actual possibility, and we need to act like it.
If you're thinking about working on AI Safety, check out AI Safety Support:
https://www.aisafetysupport.org/resources/lots-of-links
(Disclosure: I sit on the board of this organisation)

There are lots of jobs on the 80k Job Board:
https://80000hours.org/job-board/ai-safety-policy/
...

Why Do I Avoid Sci-fi?

August 16, 2022 8:30 am

How come I tend to avoid discussing specific scenarios when talking about AI risks? ...

Intro to AI Safety, Remastered

June 24, 2021 5:25 pm

An introduction to AI Safety, remastered from a talk I gave at "AI and Politics" in London

The second channel: https://www.youtube.com/channel/UC4qH2AHly_RSRze1bUqSSNw

Experts' Predictions about the Future of AI: http://youtu.be/HOJ1NVtlnyQ
9 Examples of Specification Gaming: http://youtu.be/nKJlF-olKmg

https://www.patreon.com/robertskmiles
With thanks to my wonderful Patreon supporters:
Gladamas
Timothy Lillicrap
Kieryn
AxisAngles
James
Nestor Politics
Scott Worley
James Kirkland
James E. Petts
Chad Jones
Shevis Johnson
JJ Hepboin
Pedro A Ortega
Said Polat
Chris Canal
Jake Ehrlich
Kellen lask
Francisco Tolmasky
Michael Andregg
David Reid
Peter Rolf
Teague Lasser
Andrew Blackledge
Frank Marsman
Brad Brookshire
Cam MacFarlane
Craig Mederios
Jon Wright
CaptObvious
Brian Lonergan
Jason Hise
Phil Moyer
Erik de Bruijn
Alec Johnson
Clemens Arbesser
Ludwig Schubert
Eric James
Matheson Bayley
Qeith Wreid
jugettje dutchking
Owen Campbell-Moore
Atzin Espino-Murnane
Johnny Vaughan
Carsten Milkau
Jacob Van Buren
Jonatan R
Ingvi Gautsson
Michael Greve
Tom O'Connor
Laura Olds
Jon Halliday
Paul Hobbs
Jeroen De Dauw
Cooper Lawton
Tim Neilson
Eric Scammell
Igor Keller
Ben Glanton
Tor Barstad
Duncan Orr
Will Glynn
Tyler Herrmann
Ian Munro
Joshua Davis
Jérôme Beaulieu
Nathan Fish
Peter Hozák
Taras Bobrovytsky
Jeremy
Vaskó Richárd
Benjamin Watkin
Andrew Harcourt
Luc Ritchie
Nicholas Guyett
James Hinchcliffe
12tone
Oliver Habryka
Chris Beacham
Zachary Gidwitz
Nikita Kiriy
Andrew Schreiber
Steve Trambert
Braden Tisdale
Abigail Novick
Serge Var
Mink
Chris Rimmer
Edmund Fokschaner
J
Nate Gardner
John Aslanides
Mara
ErikBln
DragonSheep
Richard Newcombe
Joshua Michel
Alex Altair
P
David Morgan
Fionn
Dmitri Afanasjev
Marcel Ward
Andrew Weir
Kabs
Ammar Mousali
Miłosz Wierzbicki
Tendayi Mawushe
Jake Fish
Wr4thon
Martin Ottosen
Robert Hildebrandt
Andy Kobre
Kees
Darko Sperac
Robert Valdimarsson
loopuleasa
Marco Tiraboschi
Michael Kuhinica
Fraser Cain
Klemen Slavic
Patrick Henderson
Oct todo22
Melisa Kostrzewski
Hendrik
Daniel Munter
Alex Knauth
Kasper
Ian Reyes
James Fowkes
Tom Sayer
Len
Alan Bandurka
Ben H
Simon Pilkington
Daniel Kokotajlo
Yuchong Li
Diagon
Andreas Blomqvist
Bertalan Bodor
Qwijibo (James)
Zubin Madon
Zannheim
Daniel Eickhardt
lyon549
14zRobot
Ivan
Jason Cherry
Igor (Kerogi) Kostenko
ib_
Thomas Dingemanse
Stuart Alldritt
Alexander Brown
Devon Bernard
Ted Stokes
Jesper Andersson
DeepFriedJif
Chris Dinant
Raphaël Lévy
Johannes Walter
Matt Stanton
Garrett Maring
Anthony Chiu
Ghaith Tarawneh
Julian Schulz
Stellated Hexahedron
Caleb
Scott Viteri
Clay Upton
Conor Comiconor
Michael Roeschter
Georg Grass
Isak Renström
Matthias Hölzl
Jim Renney
Edison Franklin
Piers Calderwood
Mikhail Tikhomirov
Matt Brauer
Jaeson Booker
Mateusz Krzaczek
Artem Honcharov
Michael Walters
Tomasz Gliniecki
Mihaly Barasz
Mark Woodward
Ranzear
Neil Palmere
Rajeen Nabid
Christian Epple
Clark Schaefer
Olivier Coutu
Iestyn bleasdale-shepherd
MojoExMachina
Marek Belski
Luke Peterson
Eric Eldard
Eric Rogstad
Eric Carlson
Caleb Larson
Max Chiswick
Aron
Sam Freedo
slindenau
A21
Johannes Lindmark
Nicholas Turner
Intensifier
Valerio Galieni
FJannis
Grant Parks
Ryan W Ammons
This person's name is too hard to pronounce
kp
contalloomlegs
Everardo González Ávalos
Knut Løklingholm
Andrew McKnight
Andrei Trifonov
Aleks D
Mutual Information
Tim
A Socialist Hobgoblin
Bren Ehnebuske
Martin Frassek
Sven Drebitz
https://www.patreon.com/robertskmiles
...

Ep. 136: The case for taking AI Safety seriously | Rob Miles

July 4, 2023 10:00 am

Through a series of popular explainer videos, Rob has become one of the most prominent voices in the AI safety community, exploring topics like cryptography, recursive self-improvement, and meso-alignment with hundreds of thousands of fans. ...

ROBERT MILES - "There is a good chance this kills everyone"

May 21, 2023 6:16 pm

Please check out Numerai - our sponsor @
http://numer.ai/mlst

Numerai is a groundbreaking platform which is taking the data science world by storm. Tim has been using Numerai to build state-of-the-art models which predict the stock market, all while being a part of an inspiring community of data scientists from around the globe. They host the Numerai Data Science Tournament, where data scientists like us use their financial dataset to predict future stock market performance.

Support us! https://www.patreon.com/mlst
MLST Discord: https://discord.gg/aNPkGUQtc5
Twitter: https://twitter.com/MLStreetTalk

Welcome to an exciting episode featuring an outstanding guest, Robert Miles! Renowned for his extraordinary contributions to understanding AI and its potential impacts on our lives, Robert is an artificial intelligence advocate, researcher, and YouTube sensation. He combines engaging discussions with entertaining content, captivating millions of viewers from around the world.

With a strong computer science background, Robert has been actively involved in AI safety projects, focusing on raising awareness about potential risks and benefits of advanced AI systems. His YouTube channel is celebrated for making AI safety discussions accessible to a diverse audience through breaking down complex topics into easy-to-understand nuggets of knowledge, and you might also recognise him from his appearances on Computerphile.

In this episode, join us as we dive deep into Robert's journey in the world of AI, exploring his insights on AI alignment, superintelligence, and the role of AI shaping our society and future. We'll discuss topics such as the limits of AI capabilities and physics, AI progress and timelines, human-machine hybrid intelligence, AI in conflict and cooperation with humans, and the convergence of AI communities.

Robert Miles:
@RobertMilesAI
https://twitter.com/robertskmiles
https://aisafety.info/

Panel:
Dr. Tim Scarfe
Dr. Keith Duggar
Joint CTOs - https://xrai.glass/

Pod version: https://podcasters.spotify.com/pod/show/machinelearningstreettalk/episodes/ROBERT-MILES---There-is-a-good-chance-this-kills-everyone-e24eio7

Refs:
Are Emergent Abilities of Large Language Models a Mirage? (Rylan Schaeffer)
https://arxiv.org/abs/2304.15004

TOC:
Intro [00:00:00]
Numerai Sponsor Messsage [00:02:17]
AI Alignment [00:04:27]
Limits of AI Capabilities and Physics [00:18:00]
AI Progress and Timelines [00:23:52]
AI Arms Race and Innovation [00:31:11]
Human-Machine Hybrid Intelligence [00:38:30]
Understanding and Defining Intelligence [00:42:48]
AI in Conflict and Cooperation with Humans [00:50:13]
Interpretability and Mind Reading in AI [01:03:46]
Mechanistic Interpretability and Deconfusion Research [01:05:53]
Understanding the core concepts of AI [01:07:40]
Moon landing analogy and AI alignment [01:09:42]
Cognitive horizon and limits of human intelligence [01:11:42]
Funding and focus on AI alignment [01:16:18]
Regulating AI technology and potential risks [01:19:17]
Aligning AI with human values and its dynamic nature [01:27:04]
Cooperation and Allyship [01:29:33]
Orthogonality Thesis and Goal Preservation [01:33:15]
Anthropomorphic Language and Intelligent Agents [01:35:31]
Maintaining Variety and Open-ended Existence [01:36:27]
Emergent Abilities of Large Language Models [01:39:22]
Convergence vs Emergence [01:44:04]
Criticism of X-risk and Alignment Communities [01:49:40]
Fusion of AI communities and addressing biases [01:52:51]
AI systems integration into society and understanding them [01:53:29]
Changing opinions on AI topics and learning from past videos [01:54:23]
Utility functions and von Neumann-Morgenstern theorems [01:54:47]
AI Safety FAQ project [01:58:06]
Building a conversation agent using AI safety dataset [02:00:36]
...

Robert Miles–Youtube, Doom

August 19, 2022 6:00 pm

Robert Miles has been making videos for Computerphile, then decided to create his own Youtube channel about AI Safety. Lately, he's been working on a Discord Community that uses Stampy the chatbot to answer Youtube comments. We also spend some time discussing recent AI Progress and why Rob is not that optimistic about humanity's survival.

Transcript & audio: https://theinsideview.ai/rob
Host: https://twitter.com/MichaelTrazzi
Rob: https://twitter.com/robertskmiles

OUTLINE
00:00:00 Intro
00:02:25 Computerphile
00:04:31 Yudkowsky
00:08:21 Teaching Science
00:11:21 Starting Youtube
00:15:42 Tiktok
00:21:34 Rob’s secret project
00:28:30 Discord
00:35:09 Stampy.ai
00:38:56 Learning by Answering Questions
00:44:27 Stampy Karma System
00:51:24 AI Alignment chart
00:55:48 Neocortex and Socratic models
00:59:11 Exponential AI Progress
01:04:05 Hiring assistants
01:07:43 Why Chatbots had no impact
01:13:51 How Stampy might help
01:16:48 Tool Agent dichotomy
01:26:10 Avoiding Doom
01:59:34 Formalizing AI Alignment
02:08:43 How Rob approaches alignment
02:14:40 AI Timelines
02:25:45 pre-AGI Regulations
02:40:22 Rob’s new channel
...

Would You Help an AI to Escape?

July 8, 2022 5:01 pm

Could an AGI get a human to help them, just through a text conversation? Recently we got some new evidence about that question. ...

Rob Miles - Why should I care about AI safety?

December 2, 2020 5:01 pm

In the latest episode of the TDS podcast, host Jeremie Harris talks with guest Rob Miles about AI safety, AI and the course of human evolution, and challenges with taming advanced AI.

0:00 Intro
1:18 AI safety as a problem
7:11 The intellectual process
13:02 AI risk argument
15:32 What the process is optimizing for
17:45 Arguments against AI safety
23:55 Focus on alignment
27:04 Implications of taking this technology to the limit
28:55 The schools of thought
32:31 Recent innovations
41:24 How to start learning about AI safety
45:10 Wrap-up
...

Ep 12 - Education & advocacy for AI safety w/ Rob Miles (YouTube host)

March 9, 2024 1:05 am

We speak with Rob Miles. Rob is the host of the “Robert Miles AI Safety” channel on YouTube, the single most popular AI alignment video series out there — he has 145,000 subscribers and his top video has ~600,000 views. He goes much deeper than many educational resources out there on alignment, going into important technical topics like the orthogonality thesis, inner misalignment, and instrumental convergence.

Through his work, Robert has educated thousands on AI safety, including many now working on advocacy, policy, and technical research. His work has been invaluable for teaching and inspiring the next generation of AI safety experts and deepening public support for the cause.

Prior to his AIS education work, Robert studied Computer Science at the University of Nottingham.

We talk to Rob about:

* What got him into AI safety
* How he started making educational videos for AI safety
* What he's working on now
* His top advice for people who also want to do education & advocacy work, really in any field, but especially for AI safety
* How he thinks AI safety is currently going as a field of work
* What he wishes more people were working on within AI safety

Hosted by Soroush Pour. Follow me for more AGI content:
Twitter: https://twitter.com/soroushjp
LinkedIn: https://www.linkedin.com/in/soroushjp/

== Show links ==

-- About Rob --

* Rob Miles AI Safety channel - @RobertMilesAI
* Twitter - https://twitter.com/robertskmiles

-- Further resources --

* Channel where Rob first started making videos: @Computerphile
* Podcast ep w/ Eliezer Yudkowsky, who first convinced Rob to take AI safety seriously through reading Yudkowsky's writings: https://lexfridman.com/eliezer-yudkowsky/
...

9 Examples of Specification Gaming

April 29, 2020 6:41 pm

AI systems do what you say, and it's hard to say exactly what you mean.
Let's look at a list of real life examples of specification gaming!

Related Videos from me:
Reward Hacking: https://youtu.be/92qDfT8pENs
Reward Hacking Reloaded: https://youtu.be/46nsTFfsBuc
What Can We Do About Reward Hacking?: https://youtu.be/13tZ9Yia71c

The list: http://tinyurl.com/specification-gaming
The blogpost this video is based on: https://vkrakovna.wordpress.com/2018/04/02/specification-gaming-examples-in-ai/
The newer blogpost that happened while I was making this video: https://deepmind.com/blog/article/Specification-gaming-the-flip-side-of-AI-ingenuity

(Explosion graphic from videezy.com)

Thanks to my wonderful patrons:
https://www.patreon.com/robertskmiles

Gladamas
James
Steef
Scott Worley
Chad Jones
Chris Canal
David Reid
Francisco Tolmasky
Frank Kurka
Jake Ehrlich
JJ Hepboin
Kellen lask
Michael Andregg
Pedro A Ortega
Peter Rolf
Said Polat
Teague Lasser
Allen Faure
Bryce Daifuku
Clemens Arbesser
Eric James
Erik de Bruijn
Jason Hise
jugettje dutchking
Ludwig Schubert
Qeith Wreid
Andrew Harcourt
anul kumar sinha
Ben Glanton
Benjamin Watkin
Cooper Lawton
Duncan Orr
Eric Scammell
Euclidean Plane
Ian Munro
Igor Keller
Ingvi Gautsson
James Hinchcliffe
Jeroen De Dauw
Jon Halliday
Jonatan R
Julius Brash
Jérôme Beaulieu
Laura Olds
Luc Ritchie
Lupuleasa Ionuț
Michael Greve
Nathan Fish
Nicholas Guyett
Paul Hobbs
Sean Gibat
Sebastian Birjoveanu
Shevis Johnson
Taras Bobrovytsky
Tim Neilson
Tom O'Connor
Tomas Sayder
Tyler Herrmann
Vaskó Richárd
Will Glynn
12tone
14zRobot
Alan Bandurka
Alexander Brown
Anders Öhrt
Andreas Blomqvist
Andrew Weir
Andy Kobre
Anne Kohlbrenner
Anthony Chiu
Archy de Berker
Ben Archer
Ben H
Ben Schultz
Bertalan Bodor
Brian Gillespie
Bryan Egan
Caleb
Chris Dinant
Daniel Bartovic
Daniel Eickhardt
Daniel Kokotajlo
Daniel Munter
Darko Sperac
David Morgan
DeepFriedJif
Devon Bernard
Diagon
Dmitri Afanasjev
Fionn
Fraser Cain
Garrett Maring
Ghaith Tarawneh
HD
Hendrik
ib_
Igor (Kerogi) Kostenko
Ihor Mukha
Ivan
James Fowkes
Jannik Olbrich
Jason Cherry
Jeremy
Jesper Andersson
Jim T
Johannes Walter
Josh Trevisiol
Julian Schulz
Jussi Männistö
Kabs
Kasper
Kasper Schnack
Kees
Klemen Slavic
Leo
lyon549
Marc Pauly
Marcel Ward
Marco Tiraboschi
Marko Topolnik
Martin Ottosen
Matt Stanton
Melisa Kostrzewski
Michael Bates
Michael Kuhinica
Miłosz Wierzbicki
Mo Hossny
Nathaniel Raddin
Oct todo22
Owen Campbell-Moore
Parker Lund
Patrick Henderson
Paul Moffat
Poker Chen
Rob Dawson
Robert Hildebrandt
robertvanduursen
Robin Scharf
Russell schoen
Scott Viteri
Simon Pilkington
Stellated Hexahedron
Tatiana Ponomareva
Ted Stokes
Tendayi Mawushe
Thomas Dingemanse
...

Why Would AI Want to do Bad Things? Instrumental Convergence

March 24, 2018 9:51 pm

How can we predict that AGI with unknown goals would behave badly by default?

The Orthogonality Thesis video: https://www.youtube.com/watch?v=hEUO6pjwFOo
Instrumental Convergence: https://arbital.com/p/instrumental_convergence/
Omohundro 2008, Basic AI Drives: https://selfawaresystems.files.wordpress.com/2008/01/ai_drives_final.pdf

With thanks to my excellent Patrons at https://www.patreon.com/robertskmiles :

Jason Hise
Steef
Jason Strack
Chad Jones
Stefan Skiles
Jordan Medina
Manuel Weichselbaum
1RV34
Scott Worley
JJ Hepboin
Alex Flint
James McCuen
Richárd Nagyfi
Ville Ahlgren
Alec Johnson
Simon Strandgaard
Joshua Richardson
Jonatan R
Michael Greve
The Guru Of Vision
Fabrizio Pisani
Alexander Hartvig Nielsen
Volodymyr
David Tjäder
Paul Mason
Ben Scanlon
Julius Brash
Mike Bird
Tom O'Connor
Gunnar Guðvarðarson
Shevis Johnson
Erik de Bruijn
Robin Green
Alexei Vasilkov
Maksym Taran
Laura Olds
Jon Halliday
Robert Werner
Paul Hobbs
Jeroen De Dauw
Konsta
William Hendley
DGJono
robertvanduursen
Scott Stevens
Michael Ore
Dmitri Afanasjev
Brian Sandberg
Einar Ueland
Marcel Ward
Andrew Weir
Taylor Smith
Ben Archer
Scott McCarthy
Kabs Kabs
Phil
Tendayi Mawushe
Gabriel Behm
Anne Kohlbrenner
Jake Fish
Bjorn Nyblad
Jussi Männistö
Mr Fantastic
Matanya Loewenthal
Wr4thon
Dave Tapley
Archy de Berker
Kevin
Vincent Sanders
Marc Pauly
Andy Kobre
Brian Gillespie
Martin Wind
Peggy Youell
Poker Chen
Kees
Darko Sperac
Paul Moffat
Noel Kocheril
Jelle Langen
Lars Scholz
...

Is AI Safety a Pascal's Mugging?

May 16, 2019 4:11 pm

An event that's very unlikely is still worth thinking about, if the consequences are big enough. What's the limit though?

Do we have to devote all of our resources to any outcome that might give infinite payoffs, even if it seems basically impossible? Does the case for AI Safety rely on this kind of Pascal's Wager argument? Watch this video to find out that the answer to these questions is 'No'.

Correction: At 6:34 the embedded video says 3^^^3 has 3.6 trillion digits, but that's actually only the size of 3^^4. 3^^^3 is enormously larger.

The Alignment Newsletter Podcast: http://alignment-newsletter.libsyn.com/
RSS feed to put into apps: http://alignment-newsletter.libsyn(dot)com/rss

With thanks to my excellent Patreon supporters:
https://www.patreon.com/robertskmiles

Jason Hise
Jordan Medina
Scott Worley
JJ Hepboin
Pedro A Ortega
Said Polat
Chris Canal
Nicholas Kees Dupuis
Jake Ehrlich
Mark Hechim
Kellen lask
Francisco Tolmasky
Michael Andregg
James
Richárd Nagyfi
Phil Moyer
Shevis Johnson
Alec Johnson
Lupuleasa Ionuț
Clemens Arbesser
Bryce Daifuku
Allen Faure
Simon Strandgaard
Jonatan R
Michael Greve
Julius Brash
Tom O'Connor
Erik de Bruijn
Robin Green
Laura Olds
Jon Halliday
Paul Hobbs
Jeroen De Dauw
Tim Neilson
Eric Scammell
Igor Keller
Ben Glanton
Robert Sokolowski
anul kumar sinha
Jérôme Frossard
Sean Gibat
A.Russell
Cooper Lawton
Tyler Herrmann
Tomas Sayder
Ian Munro
Jérôme Beaulieu
Gladamas
Sylvain Chevalier
DGJono
robertvanduursen
Dmitri Afanasjev
Brian Sandberg
Marcel Ward
Andrew Weir
Ben Archer
Scott McCarthy
Kabs
Tendayi Mawushe
Jannik Olbrich
Anne Kohlbrenner
Jussi Männistö
Mr Fantastic
Wr4thon
Archy de Berker
Marc Pauly
Joshua Pratt
Andy Kobre
Brian Gillespie
Martin Wind
Peggy Youell
Poker Chen
Kees
Truls
Paul Moffat
Anders Öhrt
Marco Tiraboschi
Michael Kuhinica
Fraser Cain
Robin Scharf
Oren Milman
John Rees
Seth Brothwell
Brian Goodrich
Kasper Schnack
Michael Hunter
Klemen Slavic
Patrick Henderson
Long Nguyen
Melisa Kostrzewski
Hendrik
Daniel Munter
Graham Henry
Volotat
Duncan Orr
Bryan Egan
James Fowkes
Frame Problems
Alan Bandurka
Benjamin Hull
Dave Tapley
Tatiana Ponomareva
Aleksi Maunu
Michael Bates
Simon Pilkington
Dion Gerald Bridger
Steven Cope
Petr Smital
Daniel Kokotajlo
Joshua Davis
Fionn
Tyler LaBean
Roger
Yuchong Li
Nathan Fish
Diagon
Giancarlo Pace

https://www.patreon.com/robertskmiles
...

Intelligence and Stupidity: The Orthogonality Thesis

January 11, 2018 9:53 pm

Can highly intelligent agents have stupid goals?
A look at The Orthogonality Thesis and the nature of stupidity.

The 'Stamp Collector' Computerphile video: https://www.youtube.com/watch?v=tcdVC4e6EV4
My other Computerphile videos: https://www.youtube.com/watch?v=GSIDS_lvRv4&list=PLqL14ZxTTA4fRMts7Af2G8t4Rp17e8MdS

Katie Byrne's Channel: https://www.youtube.com/channel/UCwA5wd50HYLa-ZY80LSAg3w
Chad Jones' Channel: https://www.youtube.com/user/cjone150

https://www.patreon.com/robertskmiles
With thanks to my wonderful Patreon supporters:
- Steef
- Sara Tjäder
- Jason Strack
- Chad Jones
- Stefan Skiles
- Ziyang Liu
- Jordan Medina
- Jason Hise
- Manuel Weichselbaum
- 1RV34
- James McCuen
- Richárd Nagyfi
- Ammar Mousali
- Scott Zockoll
- Ville Ahlgren
- Alec Johnson
- Simon Strandgaard
- Joshua Richardson
- Jonatan R
- Michael Greve
- robertvanduursen
- The Guru Of Vision
- Fabrizio Pisani
- Alexander Hartvig Nielsen
- Volodymyr
- David Tjäder
- Paul Mason
- Ben Scanlon
- Julius Brash
- Mike Bird
- Tom O'Connor
- Gunnar Guðvarðarson
- Shevis Johnson
- Erik de Bruijn
- Robin Green
- Alexei Vasilkov
- Maksym Taran
- Laura Olds
- Jon Halliday
- Robert Werner
- Roman Nekhoroshev
- Konsta
- William Hendley
- DGJono
- Matthias Meger
- Scott Stevens
- Emilio Alvarez
- Michael Ore
- Dmitri Afanasjev
- Brian Sandberg
- Einar Ueland
- Lo Rez
- Marcel Ward
- Andrew Weir
- Taylor Smith
- Ben Archer
- Scott McCarthy
- Kabs Kabs
- Phil
- Tendayi Mawushe
- Gabriel Behm
- Anne Kohlbrenner
- Jake Fish
- Bjorn Nyblad
- Stefan Laurie
- Jussi Männistö
- Cameron Kinsel
- Matanya Loewenthal
- Wr4thon
- Dave Tapley
- Archy de Berker
- Kevin
- Vincent Sanders
- Marc Pauly
- Andy Kobre
- Brian Gillespie
- Martin Wind
- Peggy Youell
- Poker Chen
https://www.patreon.com/robertskmiles
...

A Response to Steven Pinker on AI

March 31, 2019 3:39 pm

Steven Pinker wrote an article on AI for Popular Science Magazine, which I have some issues with.

The article: https://www.popsci.com/robot-uprising-enlightenment-now

Related:
"The Orthogonality Thesis, Intelligence, and Stupidity" (https://youtu.be/hEUO6pjwFOo)
"AI? Just Sandbox it... - Computerphile" (https://youtu.be/i8r_yShOixM)
"Experts' Predictions about the Future of AI" (https://youtu.be/HOJ1NVtlnyQ)
"Why Would AI Want to do Bad Things? Instrumental Convergence" (https://youtu.be/ZeecOKBus3Q)

With thanks to my excellent Patreon supporters:
https://www.patreon.com/robertskmiles

Jason Hise
Jordan Medina
Scott Worley
JJ Hepboin
Pedro A Ortega
Said Polat
Chris Canal
Nicholas Kees Dupuis
James
Richárd Nagyfi
Phil Moyer
Shevis Johnson
Alec Johnson
Lupuleasa Ionuț
Clemens Arbesser
Bryce Daifuku
Allen Faure
Simon Strandgaard
Jonatan R
Michael Greve
The Guru Of Vision
Julius Brash
Tom O'Connor
Erik de Bruijn
Robin Green
Laura Olds
Jon Halliday
Paul Hobbs
Jeroen De Dauw
Tim Neilson
Eric Scammell
Igor Keller
Ben Glanton
Robert Sokolowski
anul kumar sinha
Jérôme Frossard
Sean Gibat
Volotat
andrew Russell
Cooper Lawton
Gladamas
Sylvain Chevalier
DGJono
robertvanduursen
Dmitri Afanasjev
Brian Sandberg
Marcel Ward
Andrew Weir
Ben Archer
Scott McCarthy
Kabs
Tendayi Mawushe
Jannik Olbrich
Anne Kohlbrenner
Jussi Männistö
Mr Fantastic
Wr4thon
Archy de Berker
Marc Pauly
Joshua Pratt
Andy Kobre
Brian Gillespie
Martin Wind
Peggy Youell
Poker Chen
Kees
Darko Sperac
Truls
Paul Moffat
Anders Öhrt
Marco Tiraboschi
Michael Kuhinica
Fraser Cain
Robin Scharf
Oren Milman
John Rees
Seth Brothwell
Brian Goodrich
Clark Mitchell
Kasper Schnack
Michael Hunter
Klemen Slavic
Patrick Henderson
Long Nguyen
Oct todo22
Melisa Kostrzewski
Hendrik
Daniel Munter
Graham Henry
Duncan Orr
Andrew Walker
Bryan Egan


https://www.patreon.com/robertskmiles
...

What can AGI do? I/O and Speed

October 17, 2017 12:20 pm

Suppose we make an algorithm that implements general intelligence as well as the brain. What could that system do?
It might have better input and output than a human, and probably could be run faster...

The Computerphile video: https://www.youtube.com/watch?v=tcdVC4e6EV4
The paper 'Concrete Problems in AI Safety': https://arxiv.org/pdf/1606.06565.pdf

They're Made Out Of Meat: https://www.youtube.com/watch?v=7tScAyNaRdQ
The Slow Mo Guys' Channel: https://www.youtube.com/user/theslowmoguys

With thanks to my excellent Patreon supporters:
https://www.patreon.com/robertskmiles

Steef
Sara Tjäder
Jason Strack
Chad Jones
Stefan Skiles
Katie Byrne
Ziyang Liu
Jordan Medina
Kyle Scott
Jason Hise
David Rasmussen
Heavy Empty
James McCuen
Richárd Nagyfi
Ammar Mousali
Scott Zockoll
Charles Miller
Joshua Richardson
Jonatan R
Øystein Flygt
Michael Greve
robertvanduursen
The Guru Of Vision
Fabrizio Pisani
Alexander Hartvig Nielsen
Volodymyr
David Tjäder
Paul Mason
Ben Scanlon
Julius Brash
Mike Bird
Taylor Winning
Ville Ahlgren

Roman Nekhoroshev
Peggy Youell
Konstantin Shabashov
William Hendley
Adam Dodd
DGJono
Matthias Meger
Scott Stevens
Michael Ore
Robert Bridges
Dmitri Afanasjev
Brian Sandberg
Einar Ueland
Lo Rez
Stephen Paul
Marcel Ward
Andrew Weir
Pontus Carlsson
Taylor Smith
Ben Archer
Ivan Pochesnev
Scott McCarthy
Kabs
Phil
Christopher
Tendayi Mawushe
Gabriel Behm
Anne Kohlbrenner
Jake Fish
Jennifer Autumn Latham
Filip
Bjorn Nyblad
Stefan Laurie
Tom O'Connor
pmilian
Jussi Männistö
Cameron Kinsel
Matanya Loewenthal
Wr4thon
Dave Tapley
Archy de Berker

https://www.patreon.com/robertskmiles
...

Why Not Just: Think of AGI Like a Corporation?

December 23, 2018 10:01 pm

Corporations are kind of like AIs, if you squint. How hard do you have to squint though, and is it worth it?
In this video we ask: Are corporations artificial general superintelligences?


Related:
"What can AGI do? I/O and Speed" (https://youtu.be/gP4ZNUHdwp8)
"Why Would AI Want to do Bad Things? Instrumental Convergence" (https://youtu.be/ZeecOKBus3Q)

Media Sources:
"SpaceX - How Not to Land an Orbital Rocket Booster" (https://youtu.be/bvim4rsNHkQ)
Undertale - Turbosnail
Clerks (1994)
Zootopia (2016)
AlphaGo (2017)
Ready Player One (2018)

With thanks to my excellent Patreon supporters:
https://www.patreon.com/robertskmiles

Jordan Medina
Jason Hise
Pablo Eder
Scott Worley
JJ Hepboin
Pedro A Ortega
James McCuen
Richárd Nagyfi
Phil Moyer
Alec Johnson
Bobby Cold
Clemens Arbesser
Simon Strandgaard
Jonatan R
Michael Greve
The Guru Of Vision
David Tjäder
Julius Brash
Tom O'Connor
Erik de Bruijn
Robin Green
Laura Olds
Jon Halliday
Paul Hobbs
Jeroen De Dauw
Tim Neilson
Eric Scammell
Igor Keller
Ben Glanton
Robert Sokolowski
Jérôme Frossard
Sean Gibat
Sylvain Chevalier
DGJono
robertvanduursen
Scott Stevens
Dmitri Afanasjev
Brian Sandberg
Marcel Ward
Andrew Weir
Ben Archer
Scott McCarthy
Kabs Kabs Kabs
Tendayi Mawushe
Jannik Olbrich
Anne Kohlbrenner
Jussi Männistö
Mr Fantastic
Wr4thon
Dave Tapley
Archy de Berker
Kevin
Marc Pauly
Joshua Pratt
Gunnar Guðvarðarson
Shevis Johnson
Andy Kobre
Brian Gillespie
Martin Wind
Peggy Youell
Poker Chen
Kees
Darko Sperac
Truls
Paul Moffat
Anders Öhrt
Lupuleasa Ionuț
Marco Tiraboschi
Michael Kuhinica
Fraser Cain
Robin Scharf
Oren Milman
John Rees
Shawn Hartsock
Seth Brothwell
Brian Goodrich
Michael S McReynolds
Clark Mitchell
Kasper Schnack
Michael Hunter
Klemen Slavic
Patrick Henderson

https://www.patreon.com/robertskmiles
...

AI learns to Create ̵K̵Z̵F̵ ̵V̵i̵d̵e̵o̵s̵ Cat Pictures: Papers in Two Minutes #1

October 29, 2017 1:49 pm

Some beautiful new GAN results have been published, so let's have a quick look at the pretty pictures.
More AI Safety coming soon of course.

With apologies to Two Minute Papers: https://www.youtube.com/user/keeroyz
The Computerphile video: https://www.youtube.com/watch?v=Sw9r8CL98N0
The Paper "Unsupervised Representation Learning With Deep Convolutional GANs": https://arxiv.org/pdf/1511.06434.pdf
The Paper "Progressive Growing Of Gans For Improved Quality, Stability, And Variation": http://research.nvidia.com/sites/default/files/pubs/2017-10_Progressive-Growing-of//karras2017gan-paper.pdf
The AMAZING video for that paper: https://www.youtube.com/watch?v=XOxxPcy5Gr4


With thanks to my excellent Patreon supporters:
https://www.patreon.com/robertskmiles

Steef
Sara Tjäder
Jason Strack
Chad Jones
Stefan Skiles
Katie Byrne
Ziyang Liu
Jordan Medina
Kyle Scott
Jason Hise
David Rasmussen
Heavy Empty
James McCuen
Richárd Nagyfi
Ammar Mousali
Scott Zockoll
Charles Miller
Joshua Richardson
Jonatan R
Michael Greve
robertvanduursen
The Guru Of Vision
Fabrizio Pisani
Alexander Hartvig Nielsen
Volodymyr
David Tjäder
Paul Mason
Ben Scanlon
Julius Brash
Mike Bird
Taylor Winning
Ville Ahlgren
Roman Nekhoroshev
Peggy Youell
Konsta
William Hendley
Almighty Dodd
DGJono
Matthias Meger
Scott Stevens
Michael Ore
Robert Bridges
Dmitri Afanasjev
Brian Sandberg
Einar Ueland
Lo Rez
Stephen Paul
Marcel Ward
Andrew Weir
Pontus Carlsson
Taylor Smith
Ben Archer
Ivan Pochesnev
Scott McCarthy
Kabs Kabs Kabs Kabs
Phil
Christopher Askin
Tendayi Mawushe
Gabriel Behm
Anne Kohlbrenner
Jake Fish
Filip
Bjorn Nyblad
Stefan Laurie
Tom O'Connor
pmilian
Jussi Männistö
Cameron Kinsel
Matanya Loewenthal
Wr4thon
Dave Tapley
Archy de Berker

https://www.patreon.com/robertskmiles
...

The other "Killer Robot Arms Race" Elon Musk should worry about

August 22, 2017 1:19 pm

Elon Musk is in the news, talking to the UN about autonomous weapons. This seems like a good time to explain one area where we don't quite agree about AI Safety.

The Article: http://www.independent.co.uk/news/science/killer-robots-arms-race-tesla-elon-musk-and-google-mustafa-suleyman-un-autonomous-weapons-a7903906.html

The clip at 2:54 is from a Y Combinator interview: "Elon Musk : How to Build the Future": https://youtu.be/tnBQmEqBCY0

With thanks to my excellent Patreon supporters:
https://www.patreon.com/robertskmiles

Steef
Sara Tjäder
Jason Strack
Chad Jones
Ichiro Dohi
Stefan Skiles
Katie Byrne
Ziyang Liu
Jordan Medina
Kyle Scott
Jason Hise
David Rasmussen
James McCuen
Richárd Nagyfi
Ammar Mousali
Scott Zockoll
Joshua Richardson
Fabian Consiglio
Jonatan R
Øystein Flygt
Björn Mosten
Michael Greve
robertvanduursen
The Guru Of Vision
Fabrizio Pisani
Alexander Hartvig Nielsen
Volodymyr
David Tjäder
Paul Mason
Ben Scanlon
Julius Brash
Mike Bird
Taylor Winning
Peggy Youell
Konstantin Shabashov
Almighty Dodd
DGJono
Matthias Meger
Scott Stevens
Emilio Alvarez
Benjamin Aaron Degenhart
Michael Ore
Robert Bridges
Dmitri Afanasjev
Brian Sandberg
Einar Ueland
Lo Rez
C3POehne
Stephen Paul
Marcel Ward
Andrew Weir
Pontus Carlsson
Taylor Smith
Ben Archer
Ivan Pochesnev
Scott McCarthy
Kabs Kabs
Phil
Philip Alexander
Christopher
Tendayi Mawushe
Gabriel Behm
Anne Kohlbrenner
Jake Fish
Jennifer Autumn Latham
Filip
Bjorn Nyblad
Stefan Laurie
Tom O'Connor
Krethys
...

Why Not Just: Raise AI Like Kids?

July 22, 2017 3:58 pm

Newly made Artificial General Intelligences are basically like children, right? So we already know we can teach them how to behave, right? Wrong.

References to this Computerphile video: https://www.youtube.com/watch?v=tcdVC4e6EV4

and this paper: https://intelligence.org/files/ValueLearningProblem.pdf

Thanks to my amazing Patreon Supporters:
Sara Tjäder
Jason Strack
Chad Jones
Ichiro Dohi
Stefan Skiles
Katie Byrne
Ziyang Liu
Jordan Medina
James McCuen
Joshua Richardson
Fabian Consiglio
Jonatan R
Øystein Flygt
Björn Mosten
Michael Greve
robertvanduursen
The Guru Of Vision
Fabrizio Pisani
Alexander Hartvig Nielsen
Volodymyr
Peggy Youell
Konstantin Shabashov
Almighty Dodd
DGJono
Matthias Meger
Scott Stevens
Emilio Alvarez
Benjamin Aaron Degenhart
Michael Ore
Robert Bridges
Dmitri Afanasjev
Brian Sandberg
Einar Ueland
Lo Rez
C3POehne
https://www.patreon.com/robertskmiles
...

Are AI Risks like Nuclear Risks?

June 10, 2017 5:22 pm

Concerns about AI cover a really wide range of possible problems. Can we make progress on several of these problems at once?

With thanks to my Patreon supporters:
- Ichiro Dohi
- Stefan Skiles
- Chad Jones
- Joshua Richardson
- Fabian Consiglio
- Jonatan R
- Øystein Flygt
- Björn Mosten
- Michael Greve
- robertvanduursen
- The Guru Of Vision
- Fabrizio Pisani
- Alexander Hartvig Nielsen
- Peggy Youell
- Konstantin Shabashov
- The Dodd
- DGJono
- Matthias Meger
- Scott Stevens
- Emilio Alvarez
https://www.patreon.com/robertskmiles
...

Deadly Truth of General AI? - Computerphile

June 17, 2015 3:09 pm

The danger of assuming general artificial intelligence will be the same as human intelligence. Rob Miles explains with a simple example: The deadly stamp collector.

The Problem with JPEG: https://youtu.be/yBX8GFqt6GA
Apple's $200,000 Computer: https://youtu.be/PccvZRTUhbI
Rabbits, Faces & Hyperspaces: https://youtu.be/q6iqI2GIllI

Thanks to Nottingham Hackspace for the location.

http://www.facebook.com/computerphile
https://twitter.com/computer_phile

This video was filmed and edited by Sean Riley.

Computer Science at the University of Nottingham: http://bit.ly/nottscomputer

Computerphile is a sister project to Brady Haran's Numberphile. More at http://www.bradyharan.com
...

Holy Grail of AI (Artificial Intelligence) - Computerphile

May 1, 2015 5:19 pm

Audible free book: http://www.audible.com/computerphile
Why can't artificial intelligence do what humans can? Rob Miles talks about generality in intelligence.

Sean Comments/Questions (For those who can't hear him clearly)
11secs: "This was the hill climbing algorithm?"
2min 40sec: "recently with Professor Brailsford we did the idea of The Turing Test, so that strikes me from what you're saying that that's a very specific domain: pretending to be a human talking?"
4min 2sec: "is that like 'humans have been changing the world to meet their needs?' "
4min 40sec: "but on a bigger scale, as you say on a grander scale, building a dam, and er irrigating a field, and putting a pipe to your house and allowing you to have a tap {fawcett} is doing the same thing but on a grander scale."
6min 39sec: "all these dimensions, if you try to brute force infinite dimensions you're gonna fall over pretty quickly?"
6min 45sec: "change the world" (in ref to him picking up the drink!)

Retro Z80 Computer: https://youtu.be/OtpaY8VD52g
Hill Climbing Algorithm & Artificial Intelligence: http://youtu.be/oSdPmxRCWws
The Turing Test: https://youtu.be/Qbp3LJvcX38
Arduino Uno: https://youtu.be/b4z1zkmo1BE
Rabbits, Faces & Hyperspaces: https://youtu.be/q6iqI2GIllI

Thanks to Nottingham Hackspace.

http://www.facebook.com/computerphile
https://twitter.com/computer_phile

This video was filmed and edited by Sean Riley.

Computer Science at the University of Nottingham: http://bit.ly/nottscomputer

Computerphile is a sister project to Brady Haran's Numberphile. More at http://www.bradyharan.com
...

Why Asimov's Laws of Robotics Don't Work - Computerphile

November 6, 2015 3:01 pm

Audible Free Book: http://www.audible.com/computerphile
Three or four laws to make robots and AI safe - should be simple right? Rob Miles on why these simple laws are so complicated.

Silicon Brain: 1,000,000 ARM Cores: https://youtu.be/2e06C-yUwlc
Chip & PIN Fraud: https://youtu.be/Ks0SOn8hjG8
AI Worst Case Scenario - Deadly Truth of AI: https://youtu.be/tcdVC4e6EV4
The Singularity & Friendly AI: https://youtu.be/uA9mxq3gneE
AI Self Improvement: https://youtu.be/5qfIgCiYlfY

Thanks to Nottingham Hackspace for the location

http://www.facebook.com/computerphile
https://twitter.com/computer_phile

This video was filmed and edited by Sean Riley.

Computer Science at the University of Nottingham: http://bit.ly/nottscomputer

Computerphile is a sister project to Brady Haran's Numberphile. More at http://www.bradyharan.com
...

AI Safety - Computerphile

February 3, 2016 8:22 pm

Safety in AI is important, but more important is to work it out before working out the AI itself. Rob Miles on AI safety.

Brain Scanner: https://youtu.be/TQ0sL1ZGnQ4
AI Worst Case Scenario - Deadly Truth of AI: https://youtu.be/tcdVC4e6EV4
The Singularity & Friendly AI: https://youtu.be/uA9mxq3gneE
AI Self Improvement: https://youtu.be/5qfIgCiYlfY
Why Asimov's Three Laws Don't Work: https://youtu.be/7PKx3kS7f4A

Thanks to Nottingham Hackspace for the location.



http://www.facebook.com/computerphile
https://twitter.com/computer_phile

This video was filmed and edited by Sean Riley.

Computer Science at the University of Nottingham: http://bit.ly/nottscomputer

Computerphile is a sister project to Brady Haran's Numberphile. More at http://www.bradyharan.com
...

AI's Game Playing Challenge - Computerphile

March 24, 2016 10:18 pm

AlphaGo is beating humans at Go - What's the big deal? Rob Miles explains what AI has to do to play a game.

What on Earth is Recursion?: https://youtu.be/Mv9NEXX1VHc
Object Oriented Programming: https://youtu.be/KyTUN6_Z9TM
Mixed Reality Continuum: https://youtu.be/V4qxfFPgqdc
AI Playlist: AI Playlist: https://www.youtube.com/playlist?list=PLzH6n4zXuckoewGfo3a6ShFS3zPKndPd3

Many thanks to Nottingham Hackspace for providing the location and being downright awesome

Easter Egg: https://youtu.be/B8CujhUwVic

http://www.facebook.com/computerphile
https://twitter.com/computer_phile

This video was filmed and edited by Sean Riley.

Computer Science at the University of Nottingham: http://bit.ly/nottscomputer

Computerphile is a sister project to Brady Haran's Numberphile. More at http://www.bradyharan.com
...

General AI Won't Want You To Fix its Code - Computerphile

February 28, 2017 9:02 pm

Part 1 of a Series on AI Safety Research with Rob Miles. Rob heads away from his 'Killer Stamp Collector' example to find a more concrete example of the problem.

Sneak Peak at Part 2: https://www.youtube.com/watch?v=3TYT1QfdfsM

More about Rob Miles & AI Safety: http://bit.ly/Rob_Miles_YouTube

Thanks to Nottingham Hackspace for providing the filming location: http://bit.ly/notthack

http://www.facebook.com/computerphile
https://twitter.com/computer_phile

This video was filmed and edited by Sean Riley.

Computer Science at the University of Nottingham: http://bit.ly/nottscomputer

Computerphile is a sister project to Brady Haran's Numberphile. More at http://www.bradyharan.com
...

AI? Just Sandbox it... - Computerphile

June 23, 2017 4:44 pm

Why can't we just disconnect a malevolent AI? Rob Miles on some of the simplistic solutions to AI safety.

Out of focus shots caused by faulty camera and "slow to realise" operator - it has been sent for repair - the camera, not the operator.... (Sean, June 2017)

More from Rob Miles on his channel: http://bit.ly/Rob_Miles_YouTube

Concrete Problems in AI Safety: https://youtu.be/AjyM-f8rDpg
End to End Encryption: https://youtu.be/jkV1KEJGKRA
Microsoft Hololens: https://youtu.be/gp8UiYOw8Fc

Thanks to Nottingham Hackspace for the location.

http://www.facebook.com/computerphile
https://twitter.com/computer_phile

This video was filmed and edited by Sean Riley.

Computer Science at the University of Nottingham: http://bit.ly/nottscomputer

Computerphile is a sister project to Brady Haran's Numberphile. More at http://www.bradyharan.com
...

Learn AI Safety at MATS #shorts

September 28, 2024 12:12 am

Apply for MATS at matsprogram.org by Oct 6

#shorts
...

AI Ruined My Year

June 1, 2024 8:17 am

How to Help: https://aisafety.info/questions/8TJV/How-can-I-help
https://www.aisafety.com/

AI Safety Talks: https://www.youtube.com/@aisafetytalks

There's No Rule That Says We'll Make It: https://www.youtube.com/watch?v=JD_iA7imAPs
The other "Killer Robot Arms Race" Elon Musk should worry about: https://www.youtube.com/watch?v=7FCEiCnHcbo

Rob's Reading List:
Podcast: https://rmrlp.libsyn.com/
YouTube Channel: https://www.youtube.com/@RobMilesReadingList
The FLI Open Letter: https://www.youtube.com/watch?v=3GHjhG6Vo40
Yudkowsky in TIME: https://www.youtube.com/watch?v=a6m7JynBp-0
Ian Hogarth in the FT: https://www.youtube.com/watch?v=Z8VvF82T6so

Links:
The CAIS Open Letter: https://www.safe.ai/work/statement-on-ai-risk
The FLI Open Letter: https://futureoflife.org/open-letter/pause-giant-ai-experiments/
The Bletchley Declaration: https://www.gov.uk/government/publications/ai-safety-summit-2023-the-bletchley-declaration/the-bletchley-declaration-by-countries-attending-the-ai-safety-summit-1-2-november-2023
US Executive Order: https://www.whitehouse.gov/briefing-room/presidential-actions/2023/10/30/executive-order-on-the-safe-secure-and-trustworthy-development-and-use-of-artificial-intelligence/
Some analysis of the EO: https://thezvi.substack.com/p/on-the-executive-order
"Sparks of AGI" Paper: https://arxiv.org/abs/2303.12712
Yudkowsky in TIME: https://time.com/6266923/ai-eliezer-yudkowsky-open-letter-not-enough/
Hogarth in the FT: https://www.ft.com/content/03895dc4-a3b7-481e-95cc-336a524f2ac2
The AI Safety Institute: https://www.gov.uk/government/publications/ai-safety-institute-overview/introducing-the-ai-safety-institute
Responsible Scaling Policies: https://metr.org/blog/2023-09-26-rsp/
The EU AI Act: https://artificialintelligenceact.eu/the-act/
Hinton on CBS: https://youtu.be/qpoRO378qRY

Sources:
"Sparks of AGI" Talk: https://www.youtube.com/watch?v=qbIk7-JPB2c
Yann LeCunn on Lex Fridman's Podcast: https://www.youtube.com/watch?v=SGzMElJ11Cc
White House Press Briefings: https://x.com/TVNewsNow/status/1663640562363252742
https://www.youtube.com/watch?v=JHNkyHl5FpY
King Chuck on AI: https://www.youtube.com/watch?v=0_jw40Ga_mA

"Equally sharing a cake between three people - Numberphile": https://www.youtube.com/watch?v=kaMKInkV7Vs

Community, various screenshots
The Simpsons
Sneakers (1992)

Thanks to Rational Animations for the train sequence!
https://www.youtube.com/@RationalAnimations

With enormous thanks to my wonderful patrons:
- Tor Barstad
- Timothy Lillicrap
- Juan Benet
- Sarah Howell
- Kieryn
- Mazianni
- Scott Worley
- Jason Hise
- Clemens Arbesser
- Francisco Tolmasky
- David Reid
- Andrew Blackledge
- Cam MacFarlane
- Olivier Coutu
- CaptObvious
- Ze Shen Chin
- ikke89
- Isaac
- Erik de Bruijn
- Jeroen De Dauw
- Ludwig Schubert
- Eric James
- Owen Campbell-Moore
- Raf Jakubanis
- Esa Koskinen
- Nathan Metzger
- Jonatan R
- Gunnar
- Laura Olds
- Paul Hobbs
- Bastiaan Cnossen
- Eric Scammell
- Alexare
- Reslav Hollós
- Jérôme Beaulieu
- Nathan Fish
- Taras Bobrovytsky
- Jeremy
- Vaskó Richárd
- Andrew Harcourt
- Chris Beacham
- Zachary Gidwitz
- Art Code Outdoors
- Abigail Novick
- Edmund Fokschaner
- DragonSheep
- Richard Newcombe
- Joshua Michel
- Richard
- ttw
- Sophia Michelle Andren
- Alan J. Etchings
- James Vera
- Stumbleboots
- Peter Lillian
- Grimrukh
- Colin Ricardo
- DN
- Mr Cats
- Robert Paul Schwin
- Roland G. McIntosh
- Benjamin Mock
- Emiliano Hodges
- Maxim Kuzmich
- Joanny Raby
- Tom Miller
- Eran Glicksman
- CheeseBerry
- Hoyskedotte
- Alexey Malafeev
- Jeff Starr
- Justin
- Liviu Macovei
- Javier Soto
- David Christal
- Jam
- Just Me
- Sebastian Zimmer
- Matt Thompson
- Xan Atkinson
- Andy
- Albert Higgins
- Alexander230
- Clay Upton
- Alex Ander
- Carolyn
- Nathan Rogowski
- David Morgan
- little Bang
- Chad M Jones
- Dmitri Afanasjev
- Christian Oehne
- Marcel Ward
- Andrew Weir
- Miłosz Wierzbicki
- Tendayi Mawushe
- Kees
- loopuleasa
- Marco Tiraboschi
- Fraser Cain
- Patrick Henderson
- Daniel Munter
- Ian
- James Fowkes
- Len
- Yuchong Li
- Diagon
- Puffjanga
- Daniel Eickhardt
- 14zRobot
- Stuart Alldritt
- DeepFriedJif
- Garrett Maring
- Stellated Hexahedron
- Jim Renney
- Edison Franklin
- Piers Calderwood
- Matt Brauer
- Mihaly Barasz
- Rajeen Nabid
- Iestyn bleasdale-shepherd
- Marek Belski
- Luke Peterson
- Eric Rogstad
- Max Chiswick
- slindenau
- Nicholas Turner
- Jannis Funk
- This person's name is too hard to pronounce
- Jon Wright
- Andrei Trifonov
- Bren Ehnebuske
- Martin Frassek
- Matthew Shinkle
- Robby Gottesman
- Ohelig
- Sarah
- Nikola Tasev
- Tapio Kortesaari
- Soroush Pour
- Boris Badinoff
- DangerCat
- Jack Phelps
- Kyle Green
- Lexi X
- John Slape
- Joel Gardner
- Christopher Creutzig
- Johann Puzik
- Pindex
- RMR
- Andrew Edstrom
https://www.patreon.com/robertskmiles
...

Apply to Study AI Safety Now! #shorts

April 28, 2023 6:37 pm

Apply to SERI MATS at https://www.serimats.org/ by May 7th, and check out http://aisafety.training to stay up to date with events and programs! ...

Why Does AI Lie, and What Can We Do About It?

December 9, 2022 10:10 pm

How do we make sure language models tell the truth?

The new channel!: https://www.youtube.com/@aisafetytalks
Evan Hubinger's Talk: https:/youtu.be/OUifSs28G30

ACX Blog Post: https://astralcodexten.substack.com/p/elk-and-the-problem-of-truthful-ai

With thanks to my wonderful Patrons at http://patreon.com/robertskmiles :
- Tor Barstad
- Kieryn
- AxisAngles
- Juan Benet
- Scott Worley
- Chad M Jones
- Jason Hise
- Shevis Johnson
- JJ Hepburn
- Pedro A Ortega
- Clemens Arbesser
- Chris Canal
- Jake Ehrlich
- Kellen lask
- Francisco Tolmasky
- Michael Andregg
- David Reid
- Teague Lasser
- Andrew Blackledge
- Brad Brookshire
- Cam MacFarlane
- Olivier Coutu
- CaptObvious
- Girish Sastry
- Ze Shen Chin
- Phil Moyer
- Erik de Bruijn
- Jeroen De Dauw
- Ludwig Schubert
- Eric James
- Atzin Espino-Murnane
- Jaeson Booker
- Raf Jakubanis
- Jonatan R
- Ingvi Gautsson
- Jake Fish
- Tom O'Connor
- Laura Olds
- Paul Hobbs
- Cooper
- Eric Scammell
- Ben Glanton
- Duncan Orr
- Nicholas Kees Dupuis
- Will Glynn
- Tyler Herrmann
- Reslav Hollós
- Jérôme Beaulieu
- Nathan Fish
- Peter Hozák
- Taras Bobrovytsky
- Jeremy
- Vaskó Richárd
- Report Techies
- Andrew Harcourt
- Nicholas Guyett
- 12tone
- Oliver Habryka
- Chris Beacham
- Zachary Gidwitz
- Nikita Kiriy
- Art Code Outdoors
- Andrew Schreiber
- Abigail Novick
- Chris Rimmer
- Edmund Fokschaner
- April Clark
- John Aslanides
- DragonSheep
- Richard Newcombe
- Joshua Michel
- Quabl
- Richard
- Neel Nanda
- ttw
- Sophia Michelle Andren
- Trevor Breen
- Alan J. Etchings
- Jenan Wise
- Jonathan Moregård
- James Vera
- Chris Mathwin
- David Shaffer
- Jason Gardner
- Devin Turner
- Andy Southgate
- Lorthock The Banisher
- Peter Lillian
- Jacob Valero
- Christopher Nguyen
- Kodera Software
- Grimrukh
- MichaelB
- David Morgan
- little Bang
- Dmitri Afanasjev
- Marcel Ward
- Andrew Weir
- Ammar Mousali
- Miłosz Wierzbicki
- Tendayi Mawushe
- Wr4thon
- Martin Ottosen
- Alec Johnson
- Kees
- Darko Sperac
- Robert Valdimarsson
- Marco Tiraboschi
- Michael Kuhinica
- Fraser Cain
- Patrick Henderson
- Daniel Munter
- And last but not least
- Ian Reyes
- James Fowkes
- Len
- Alan Bandurka
- Daniel Kokotajlo
- Yuchong Li
- Diagon
- Andreas Blomqvist
- Qwijibo (James)
- Zannheim
- Daniel Eickhardt
- lyon549
- 14zRobot
- Ivan
- Jason Cherry
- Igor (Kerogi) Kostenko
- Stuart Alldritt
- Alexander Brown
- Ted Stokes
- DeepFriedJif
- Chris Dinant
- Johannes Walter
- Garrett Maring
- Anthony Chiu
- Ghaith Tarawneh
- Julian Schulz
- Stellated Hexahedron
- Caleb
- Georg Grass
- Jim Renney
- Edison Franklin
- Jacob Van Buren
- Piers Calderwood
- Matt Brauer
- Mihaly Barasz
- Mark Woodward
- Ranzear
- Rajeen Nabid
- Iestyn bleasdale-shepherd
- MojoExMachina
- Marek Belski
- Luke Peterson
- Eric Rogstad
- Caleb Larson
- Max Chiswick
- Sam Freedo
- slindenau
- Nicholas Turner
- FJannis
- Grant Parks
- This person's name is too hard to pronounce
- Jon Wright
- Everardo González Ávalos
- Knut
- Andrew McKnight
- Andrei Trifonov
- Tim D
- Bren Ehnebuske
- Martin Frassek
- Valentin Mocanu
- Matthew Shinkle
- Robby Gottesman
- Ohelig
- Slobodan Mišković
- Sarah
- Nikola Tasev
- Voltaic
- Sam Ringer
- Tapio Kortesaari

http://patreon.com/robertskmiles
...

Apply Now for a Paid Residency on Interpretability #short

November 11, 2022 8:07 pm

https://www.redwoodresearch.org/remix
Real main channel video coming soon!

#short #shorts
...

$100,000 for Tasks Where Bigger AIs Do Worse Than Smaller Ones #short

October 14, 2022 1:05 pm

Check out http://inversescaling.com for more details and to apply

#shorts #short
...

Free ML Bootcamp for Alignment #shorts

May 24, 2022 7:30 pm

Apply for the second MLAB (Machine Learning for Alignment Bootcamp)!

https://forum.effectivealtruism.org/posts/vvocfhQ7bcBR4FLBx/apply-to-the-second-iteration-of-the-ml-for-alignment
...

Win $50k for Solving a Single AI Problem? #Shorts

February 8, 2022 9:17 pm

The Alignment Research Center is offering up to $50k for proposals for their Eliciting Latent Knowledge problem.

The report: https://docs.google.com/document/d/1WwsnJQstPq91_Yh-Ch2XRL8H_EpsnjrC1dwZXR37PC8/edit

Contest details: https://www.alignmentforum.org/posts/QEYWkRoCn4fZxXQAY/prizes-for-elk-proposals

#Shorts
...

Apply to AI Safety Camp! #shorts

November 19, 2021 8:03 pm

Trying out #shorts
Applications are open for next year's AI Safety Camp!
http://aisafety.camp
...

We Were Right! Real Inner Misalignment

October 10, 2021 10:50 pm

Researchers ran real versions of the thought experiments in the 'Mesa-Optimisers' videos!
What they found won't shock you (if you've been paying attention)

Previous videos on the subject:
The OTHER AI Alignment Problem: Mesa-Optimizers and Inner Alignment: https://youtu.be/bJLcIBixGj8
Deceptive Misaligned Mesa-Optimisers? It's More Likely Than You Think...: https://youtu.be/IeWljQw3UgQ

The Paper: https://arxiv.org/abs/2105.14111
The Interpretability Article: https://distill.pub/2020/understanding-rl-vision/
Jacob Hilton's thoughts about what's going on: https://www.alignmentforum.org/posts/iJDmL7HJtN5CYKReM/empirical-observations-of-objective-robustness-failures?commentId=8f5B3skqocxJjK4mk
AI Safety Camp: https://aisafety.camp/

With thanks to my wonderful Patrons at http://patreon.com/robertskmiles :
- Gladamas
- Timothy Lillicrap
- Kieryn
- AxisAngles
- James
- Jake Fish
- Scott Worley
- James Kirkland
- James E. Petts
- Chad Jones
- Shevis Johnson
- JJ Hepboin
- Pedro A Ortega
- Clemens Arbesser
- Said Polat
- Chris Canal
- Jake Ehrlich
- Kellen lask
- Francisco Tolmasky
- Michael Andregg
- David Reid
- Peter Rolf
- Teague Lasser
- Andrew Blackledge
- Brad Brookshire
- Cam MacFarlane
- Craig Mederios
- Jon Wright
- CaptObvious
- Brian Lonergan
- Girish Sastry
- Jason Hise
- Phil Moyer
- Erik de Bruijn
- Alec Johnson
- Ludwig Schubert
- Eric James
- Matheson Bayley
- Qeith Wreid
- jugettje dutchking
- James Hinchcliffe
- Atzin Espino-Murnane
- Carsten Milkau
- Jacob Van Buren
- Jonatan R
- Ingvi Gautsson
- Michael Greve
- Tom O'Connor
- Laura Olds
- Jon Halliday
- Paul Hobbs
- Jeroen De Dauw
- Cooper Lawton
- Tim Neilson
- Eric Scammell
- Igor Keller
- Ben Glanton
- Tor Barstad
- Duncan Orr
- Will Glynn
- Tyler Herrmann
- Ian Munro
- Jérôme Beaulieu
- Nathan Fish
- Peter Hozák
- Taras Bobrovytsky
- Jeremy
- Vaskó Richárd
- Benjamin Watkin
- Andrew Harcourt
- Luc Ritchie
- Nicholas Guyett
- 12tone
- Oliver Habryka
- Chris Beacham
- Nikita Kiriy
- Andrew Schreiber
- Steve Trambert
- Braden Tisdale
- Abigail Novick
- Serge Var
- Mink
- Chris Rimmer
- Edmund Fokschaner
- April Clark
- J
- Nate Gardner
- John Aslanides
- Mara
- ErikBln
- DragonSheep
- Richard Newcombe
- Joshua Michel
- P
- Alex Doroff
- BlankProgram
- Richard
- David Morgan
- Fionn
- Dmitri Afanasjev
- Marcel Ward
- Andrew Weir
- Kabs
- Ammar Mousali
- Miłosz Wierzbicki
- Tendayi Mawushe
- Wr4thon
- Martin Ottosen
- Andy K
- Kees
- Darko Sperac
- Robert Valdimarsson
- Marco Tiraboschi
- Michael Kuhinica
- Fraser Cain
- Robin Scharf
- Klemen Slavic
- Patrick Henderson
- Hendrik
- Daniel Munter
- Alex Knauth
- Kasper
- Ian Reyes
- James Fowkes
- Tom Sayer
- Len
- Alan Bandurka
- Ben H
- Simon Pilkington
- Daniel Kokotajlo
- Yuchong Li
- Diagon
- Andreas Blomqvist
- Iras
- Qwijibo (James)
- Zubin Madon
- Zannheim
- Daniel Eickhardt
- lyon549
- 14zRobot
- Ivan
- Jason Cherry
- Igor (Kerogi) Kostenko
- ib_
- Thomas Dingemanse
- Stuart Alldritt
- Alexander Brown
- Devon Bernard
- Ted Stokes
- Jesper Andersson
- DeepFriedJif
- Chris Dinant
- Raphaël Lévy
- Johannes Walter
- Matt Stanton
- Garrett Maring
- Anthony Chiu
- Ghaith Tarawneh
- Julian Schulz
- Stellated Hexahedron
- Caleb
- Clay Upton
- Conor Comiconor
- Michael Roeschter
- Georg Grass
- Isak Renström
- Matthias Hölzl
- Jim Renney
- Edison Franklin
- Piers Calderwood
- Mikhail Tikhomirov
- Matt Brauer
- Mateusz Krzaczek
- Artem Honcharov
- Tomasz Gliniecki
- Mihaly Barasz
- Mark Woodward
- Ranzear
- Neil Palmere
- Rajeen Nabid
- Clark Schaefer
- Olivier Coutu
- Iestyn bleasdale-shepherd
- MojoExMachina
- Marek Belski
- Luke Peterson
- Eric Rogstad
- Eric Carlson
- Caleb Larson
- Max Chiswick
- Aron
- Sam Freedo
- slindenau
- Johannes Lindmark
- Nicholas Turner
- Intensifier
- Valerio Galieni
- FJannis
- Grant Parks
- Ryan W Ammons
- This person's name is too hard to pronounce
- contalloomlegs
- Everardo González Ávalos
- Knut Løklingholm
- Andrew McKnight
- Andrei Trifonov
- Aleks D
- Mutual Information
- Tim
- A Socialist Hobgoblin
- Bren Ehnebuske
- Martin Frassek
- Sven Drebitz
- Quabl
- Valentin Mocanu
- Philip Crawford
- Matthew Shinkle
- Robby Gottesman
- Juanchi

http://patreon.com/robertskmiles
...

Intro to AI Safety, Remastered

June 24, 2021 5:25 pm

An introduction to AI Safety, remastered from a talk I gave at "AI and Politics" in London

The second channel: https://www.youtube.com/channel/UC4qH2AHly_RSRze1bUqSSNw

Experts' Predictions about the Future of AI: http://youtu.be/HOJ1NVtlnyQ
9 Examples of Specification Gaming: http://youtu.be/nKJlF-olKmg

https://www.patreon.com/robertskmiles
With thanks to my wonderful Patreon supporters:
Gladamas
Timothy Lillicrap
Kieryn
AxisAngles
James
Nestor Politics
Scott Worley
James Kirkland
James E. Petts
Chad Jones
Shevis Johnson
JJ Hepboin
Pedro A Ortega
Said Polat
Chris Canal
Jake Ehrlich
Kellen lask
Francisco Tolmasky
Michael Andregg
David Reid
Peter Rolf
Teague Lasser
Andrew Blackledge
Frank Marsman
Brad Brookshire
Cam MacFarlane
Craig Mederios
Jon Wright
CaptObvious
Brian Lonergan
Jason Hise
Phil Moyer
Erik de Bruijn
Alec Johnson
Clemens Arbesser
Ludwig Schubert
Eric James
Matheson Bayley
Qeith Wreid
jugettje dutchking
Owen Campbell-Moore
Atzin Espino-Murnane
Johnny Vaughan
Carsten Milkau
Jacob Van Buren
Jonatan R
Ingvi Gautsson
Michael Greve
Tom O'Connor
Laura Olds
Jon Halliday
Paul Hobbs
Jeroen De Dauw
Cooper Lawton
Tim Neilson
Eric Scammell
Igor Keller
Ben Glanton
Tor Barstad
Duncan Orr
Will Glynn
Tyler Herrmann
Ian Munro
Joshua Davis
Jérôme Beaulieu
Nathan Fish
Peter Hozák
Taras Bobrovytsky
Jeremy
Vaskó Richárd
Benjamin Watkin
Andrew Harcourt
Luc Ritchie
Nicholas Guyett
James Hinchcliffe
12tone
Oliver Habryka
Chris Beacham
Zachary Gidwitz
Nikita Kiriy
Andrew Schreiber
Steve Trambert
Braden Tisdale
Abigail Novick
Serge Var
Mink
Chris Rimmer
Edmund Fokschaner
J
Nate Gardner
John Aslanides
Mara
ErikBln
DragonSheep
Richard Newcombe
Joshua Michel
Alex Altair
P
David Morgan
Fionn
Dmitri Afanasjev
Marcel Ward
Andrew Weir
Kabs
Ammar Mousali
Miłosz Wierzbicki
Tendayi Mawushe
Jake Fish
Wr4thon
Martin Ottosen
Robert Hildebrandt
Andy Kobre
Kees
Darko Sperac
Robert Valdimarsson
loopuleasa
Marco Tiraboschi
Michael Kuhinica
Fraser Cain
Klemen Slavic
Patrick Henderson
Oct todo22
Melisa Kostrzewski
Hendrik
Daniel Munter
Alex Knauth
Kasper
Ian Reyes
James Fowkes
Tom Sayer
Len
Alan Bandurka
Ben H
Simon Pilkington
Daniel Kokotajlo
Yuchong Li
Diagon
Andreas Blomqvist
Bertalan Bodor
Qwijibo (James)
Zubin Madon
Zannheim
Daniel Eickhardt
lyon549
14zRobot
Ivan
Jason Cherry
Igor (Kerogi) Kostenko
ib_
Thomas Dingemanse
Stuart Alldritt
Alexander Brown
Devon Bernard
Ted Stokes
Jesper Andersson
DeepFriedJif
Chris Dinant
Raphaël Lévy
Johannes Walter
Matt Stanton
Garrett Maring
Anthony Chiu
Ghaith Tarawneh
Julian Schulz
Stellated Hexahedron
Caleb
Scott Viteri
Clay Upton
Conor Comiconor
Michael Roeschter
Georg Grass
Isak Renström
Matthias Hölzl
Jim Renney
Edison Franklin
Piers Calderwood
Mikhail Tikhomirov
Matt Brauer
Jaeson Booker
Mateusz Krzaczek
Artem Honcharov
Michael Walters
Tomasz Gliniecki
Mihaly Barasz
Mark Woodward
Ranzear
Neil Palmere
Rajeen Nabid
Christian Epple
Clark Schaefer
Olivier Coutu
Iestyn bleasdale-shepherd
MojoExMachina
Marek Belski
Luke Peterson
Eric Eldard
Eric Rogstad
Eric Carlson
Caleb Larson
Max Chiswick
Aron
Sam Freedo
slindenau
A21
Johannes Lindmark
Nicholas Turner
Intensifier
Valerio Galieni
FJannis
Grant Parks
Ryan W Ammons
This person's name is too hard to pronounce
kp
contalloomlegs
Everardo González Ávalos
Knut Løklingholm
Andrew McKnight
Andrei Trifonov
Aleks D
Mutual Information
Tim
A Socialist Hobgoblin
Bren Ehnebuske
Martin Frassek
Sven Drebitz
https://www.patreon.com/robertskmiles
...

Deceptive Misaligned Mesa-Optimisers? It's More Likely Than You Think...

May 23, 2021 10:15 pm

The previous video explained why it's *possible* for trained models to end up with the wrong goals, even when we specify the goals perfectly. This video explains why it's *likely*.

Previous video: The OTHER AI Alignment Problem: https://youtu.be/bJLcIBixGj8
The Paper: https://arxiv.org/pdf/1906.01820.pdf

Media Sources:
End of Ze World - https://youtu.be/enRzYWcVyAQ
FlexClip News graphics

With thanks to my excellent Patreon supporters:
https://www.patreon.com/robertskmiles

Timothy Lillicrap
Kieryn
James
Scott Worley
James E. Petts
Chad Jones
Shevis Johnson
JJ Hepboin
Pedro A Ortega
Said Polat
Chris Canal
Jake Ehrlich
Kellen lask
Francisco Tolmasky
Michael Andregg
David Reid
Peter Rolf
Teague Lasser
Andrew Blackledge
Frank Marsman
Brad Brookshire
Cam MacFarlane
Craig Mederios
Jon Wright
CaptObvious
Jason Hise
Phil Moyer
Erik de Bruijn
Alec Johnson
Clemens Arbesser
Ludwig Schubert
Allen Faure
Eric James
Matheson Bayley
Qeith Wreid
jugettje dutchking
Owen Campbell-Moore
Atzin Espino-Murnane
Johnny Vaughan
Jacob Van Buren
Jonatan R
Ingvi Gautsson
Michael Greve
Tom O'Connor
Laura Olds
Jon Halliday
Paul Hobbs
Jeroen De Dauw
Lupuleasa Ionuț
Cooper Lawton
Tim Neilson
Eric Scammell
Igor Keller
Ben Glanton
anul kumar sinha
Tor
Duncan Orr
Will Glynn
Tyler Herrmann
Ian Munro
Joshua Davis
Jérôme Beaulieu
Nathan Fish
Peter Hozák
Taras Bobrovytsky
Jeremy
Vaskó Richárd
Benjamin Watkin
Andrew Harcourt
Luc Ritchie
Nicholas Guyett
James Hinchcliffe
12tone
Oliver Habryka
Chris Beacham
Zachary Gidwitz
Nikita Kiriy
Andrew Schreiber
Steve Trambert
Mario Lois
Braden Tisdale
Abigail Novick
Сергей Уваров
Bela R
Mink
Chris Rimmer
Edmund Fokschaner
Grant Parks
J
Nate Gardner
John Aslanides
Mara
ErikBln
DragonSheep
Richard Newcombe
David Morgan
Fionn
Dmitri Afanasjev
Marcel Ward
Andrew Weir
Kabs
Miłosz Wierzbicki
Tendayi Mawushe
Jake Fish
Wr4thon
Martin Ottosen
Robert Hildebrandt
Andy Kobre
Kees
Darko Sperac
Robert Valdimarsson
Marco Tiraboschi
Michael Kuhinica
Fraser Cain
Robin Scharf
Klemen Slavic
Patrick Henderson
Oct todo22
Melisa Kostrzewski
Hendrik
Daniel Munter
Alex Knauth
Kasper
Ian Reyes
James Fowkes
Tom Sayer
Len
Alan Bandurka
Ben H
Simon Pilkington
Daniel Kokotajlo
Diagon
Andreas Blomqvist
Bertalan Bodor
Zannheim
Daniel Eickhardt
lyon549
14zRobot
Ivan
Jason Cherry
Igor (Kerogi) Kostenko
ib_
Thomas Dingemanse
Stuart Alldritt
Alexander Brown
Devon Bernard
Ted Stokes
James Helms
Jesper Andersson
DeepFriedJif
Chris Dinant
Raphaël Lévy
Johannes Walter
Matt Stanton
Garrett Maring
Anthony Chiu
Ghaith Tarawneh
Julian Schulz
Stellated Hexahedron
Caleb
Scott Viteri
Clay Upton
Conor Comiconor
Michael Roeschter
Georg Grass
Isak
Matthias Hölzl
Jim Renney
Edison Franklin
Piers Calderwood
Mikhail Tikhomirov
Richard Otto
Matt Brauer
Jaeson Booker
Mateusz Krzaczek
Artem Honcharov
Michael Walters
Tomasz Gliniecki
Mihaly Barasz
Mark Woodward
Ranzear
Neil Palmere
Rajeen Nabid
Christian Epple
Clark Schaefer
Olivier Coutu
Iestyn bleasdale-shepherd
MojoExMachina
Marek Belski
Luke Peterson
Eric Eldard
Eric Rogstad
Eric Carlson
Caleb Larson
Max Chiswick
Aron
David de Kloet
Sam Freedo
slindenau
A21
Johannes Lindmark
Nicholas Turner
Tero K
Valerio Galieni
FJannis
M I
Ryan W Ammons
Ludwig Krinner
This person's name is too hard to pronounce
kp
contalloomlegs
Everardo González Ávalos
Knut Løklingholm
Andrew McKnight
Andrei Trifonov
Aleks D
Mutual Information


https://www.patreon.com/robertskmiles
...

The OTHER AI Alignment Problem: Mesa-Optimizers and Inner Alignment

February 16, 2021 8:37 pm

This "Alignment" thing turns out to be even harder than we thought.

# Links
The Paper: https://arxiv.org/pdf/1906.01820.pdf
Discord Waiting List Sign-Up: https://forms.gle/YhYgjakwQ1Lzd4tJ8
AI Safety Career Bottlenecks Survey: https://www.guidedtrack.com/programs/n8cydtu/run

# Referenced Videos
Intelligence and Stupidity - The Orthogonality Thesis: http://youtu.be/hEUO6pjwFOo
9 Examples of Specification Gaming: https://youtu.be/nKJlF-olKmg
Why Would AI Want to do Bad Things? Instrumental Convergence: https://youtu.be/ZeecOKBus3Q
Hill Climbing Algorithm & Artificial Intelligence - Computerphile: http://youtu.be/oSdPmxRCWws
AI Gridworlds - Computerphile: http://youtu.be/eElfR_BnL5k
Generative Adversarial Networks (GANs) - Computerphile: http://youtu.be/Sw9r8CL98N0

# Other Media
The Simpsons Season 5 Episode 19: "Sweet Seymour Skinner's Baadasssss Song"
1970s Psychology study of imprinting in ducks. Behaviorism: http://youtu.be/2xd7o3z957c


With thanks to my excellent Patreon supporters:
https://www.patreon.com/robertskmiles
- Timothy Lillicrap
- Gladamas
- James
- Scott Worley
- Chad Jones
- Shevis Johnson
- JJ Hepboin
- Pedro A Ortega
- Said Polat
- Chris Canal
- Jake Ehrlich
- Kellen lask
- Francisco Tolmasky
- Michael Andregg
- David Reid
- Peter Rolf
- Teague Lasser
- Andrew Blackledge
- Frank Marsman
- Brad Brookshire
- Cam MacFarlane
- Jason Hise
- Phil Moyer
- Erik de Bruijn
- Alec Johnson
- Clemens Arbesser
- Ludwig Schubert
- Allen Faure
- Eric James
- Matheson Bayley
- Qeith Wreid
- jugettje dutchking
- Owen Campbell-Moore
- Atzin Espino-Murnane
- Johnny Vaughan
- Jacob Van Buren
- Jonatan R
- Ingvi Gautsson
- Michael Greve
- Tom O'Connor
- Laura Olds
- Jon Halliday
- Paul Hobbs
- Jeroen De Dauw
- Lupuleasa Ionuț
- Cooper Lawton
- Tim Neilson
- Eric Scammell
- Igor Keller
- Ben Glanton
- anul kumar sinha
- Duncan Orr
- Will Glynn
- Tyler Herrmann
- Tomas Sayder
- Ian Munro
- Joshua Davis
- Jérôme Beaulieu
- Nathan Fish
- Taras Bobrovytsky
- Jeremy
- Vaskó Richárd
- Benjamin Watkin
- Sebastian Birjoveanu
- Andrew Harcourt
- Luc Ritchie
- Nicholas Guyett
- James Hinchcliffe
- 12tone
- Oliver Habryka
- Chris Beacham
- Zachary Gidwitz
- Nikita Kiriy
- Parker
- Andrew Schreiber
- Steve Trambert
- Mario Lois
- Abigail Novick
- Сергей Уваров
- Bela R
- Mink
- Fionn
- Dmitri Afanasjev
- Marcel Ward
- Andrew Weir
- Kabs
- Miłosz Wierzbicki
- Tendayi Mawushe
- Jake Fish
- Wr4thon
- Martin Ottosen
- Robert Hildebrandt
- Poker Chen
- Kees
- Darko Sperac
- Paul Moffat
- Robert Valdimarsson
- Marco Tiraboschi
- Michael Kuhinica
- Fraser Cain
- Robin Scharf
- Klemen Slavic
- Patrick Henderson
- Oct todo22
- Melisa Kostrzewski
- Hendrik
- Daniel Munter
- Alex Knauth
- Kasper
- Ian Reyes
- James Fowkes
- Tom Sayer
- Len
- Alan Bandurka
- Ben H
- Simon Pilkington
- Daniel Kokotajlo
- Peter Hozák
- Diagon
- Andreas Blomqvist
- Bertalan Bodor
- David Morgan
- Zannheim
- Daniel Eickhardt
- lyon549
- Ihor Mukha
- 14zRobot
- Ivan
- Jason Cherry
- Igor (Kerogi) Kostenko
- ib_
- Thomas Dingemanse
- Stuart Alldritt
- Alexander Brown
- Devon Bernard
- Ted Stokes
- James Helms
- Jesper Andersson
- DeepFriedJif
- Chris Dinant
- Raphaël Lévy
- Johannes Walter
- Matt Stanton
- Garrett Maring
- Anthony Chiu
- Ghaith Tarawneh
- Julian Schulz
- Stellated Hexahedron
- Caleb
- Scott Viteri
- Conor Comiconor
- Michael Roeschter
- Georg Grass
- Isak
- Matthias Hölzl
- Jim Renney
- Edison Franklin
- Piers Calderwood
- Krzysztof Derecki
- Mikhail Tikhomirov
- Richard Otto
- Matt Brauer
- Jaeson Booker
- Mateusz Krzaczek
- Artem Honcharov
- Michael Walters
- Tomasz Gliniecki
- Mihaly Barasz
- Mark Woodward
- Ranzear
- Neil Palmere
- Rajeen Nabid
- Christian Epple
- Clark Schaefer
- Olivier Coutu
- Iestyn bleasdale-shepherd
- MojoExMachina
- Marek Belski
- Luke Peterson
- Eric Eldard
- Eric Rogstad
- Eric Carlson
- Caleb Larson
- Braden Tisdale
- Max Chiswick
- Aron
- David de Kloet
- Sam Freedo
- slindenau
- A21
- Rodrigo Couto
- Johannes Lindmark
- Nicholas Turner
- Tero K
https://www.patreon.com/robertskmiles
...

Quantilizers: AI That Doesn't Try Too Hard

December 13, 2020 10:46 pm

How do you get an AI system that does better than a human could, without doing anything a human wouldn't?

A follow-up to "Maximizers and Satisficers": https://youtu.be/Ao4jwLwT36M

The Paper: https://intelligence.org/files/QuantilizersSaferAlternative.pdf
More about this area of research: https://www.alignmentforum.org/tag/mild-optimization

With thanks to my excellent Patreon supporters:
https://www.patreon.com/robertskmiles

Timothy Lillicrap
Gladamas
James
Scott Worley
Chad Jones
Shevis Johnson
JJ Hepboin
Pedro A Ortega
Said Polat
Chris Canal
Jake Ehrlich
Kellen lask
Francisco Tolmasky
Michael Andregg
David Reid
Peter Rolf
Teague Lasser
Andrew Blackledge
Frank Marsman
Brad Brookshire
Cam MacFarlane
Vivek Nayak
Jason Hise
Phil Moyer
Erik de Bruijn
Alec Johnson
Clemens Arbesser
Ludwig Schubert
Allen Faure
Eric James
Matheson Bayley
Qeith Wreid
jugettje dutchking
Owen Campbell-Moore
Atzin Espino-Murnane
Johnny Vaughan
Jacob Van Buren
Jonatan R
Ingvi Gautsson
Michael Greve
Tom O'Connor
Laura Olds
Jon Halliday
Paul Hobbs
Jeroen De Dauw
Lupuleasa Ionuț
Cooper Lawton
Tim Neilson
Eric Scammell
Igor Keller
Ben Glanton
anul kumar sinha
Duncan Orr
Will Glynn
Tyler Herrmann
Tomas Sayder
Ian Munro
Jérôme Beaulieu
Nathan Fish
Taras Bobrovytsky
Jeremy
Vaskó Richárd
Benjamin Watkin
Sebastian Birjoveanu
Andrew Harcourt
Luc Ritchie
Nicholas Guyett
James Hinchcliffe
12tone
Chris Beacham
Zachary Gidwitz
Nikita Kiriy
Parker
Andrew Schreiber
Steve Trambert
Mario Lois
Abigail Novick
heino hulsey-vincent
Fionn
Dmitri Afanasjev
Marcel Ward
Richárd Nagyfi
Andrew Weir
Kabs
Miłosz Wierzbicki
Tendayi Mawushe
Jannik Olbrich
Jake Fish
Wr4thon
Martin Ottosen
Robert Hildebrandt
Andy Kobre
Poker Chen
Kees
Darko Sperac
Paul Moffat
Robert Valdimarsson
Marco Tiraboschi
Michael Kuhinica
Fraser Cain
Robin Scharf
Klemen Slavic
Patrick Henderson
Oct todo22
Melisa Kostrzewski
Hendrik
Daniel Munter
Alex Knauth
Kasper
Rob Dawson
Ian Reyes
James Fowkes
Tom Sayer
Len
Alan Bandurka
Ben H
Simon Pilkington
Daniel Kokotajlo
Diagon
Andreas Blomqvist
Bertalan Bodor
David Morgan
Zannheim
Daniel Eickhardt
lyon549
HD
Ihor Mukha
14zRobot
Ivan
Jason Cherry
Igor (Kerogi) Kostenko
ib_
Thomas Dingemanse
Stuart Alldritt
Alexander Brown
Devon Bernard
Ted Stokes
James Helms
Jesper Andersson
Jim T
DeepFriedJif
Chris Dinant
Raphaël Lévy
Johannes Walter
Matt Stanton
Garrett Maring
Anthony Chiu
Ghaith Tarawneh
Julian Schulz
Stellated Hexahedron
Caleb
Scott Viteri
Clay Upton
Conor Comiconor
Michael Roeschter
Georg Grass
Isak
Matthias Hölzl
Jim Renney
Edison Franklin
Piers Calderwood
Krzysztof Derecki
Mikhail Tikhomirov
Richard Otto
Matt Brauer
Jaeson Booker
Mateusz Krzaczek
Artem Honcharov
Michael Walters
Tomasz Gliniecki
Mihaly Barasz
Mark Woodward
Ranzear
Neil Palmere
Rajeen Nabid
Christian Epple
Clark Schaefer
Olivier Coutu
Iestyn bleasdale-shepherd
MojoExMachina
Marek Belski
Eric Eldard
Eric Rogstad
Eric Carlson
Caleb Larson
Braden Tisdale
Max Chiswick
Phillip Brandel

https://www.patreon.com/robertskmiles
...

Sharing the Benefits of AI: The Windfall Clause

July 6, 2020 6:53 pm

AI might create enormous amounts of wealth, but how is it going to be distributed?

The Paper: https://www.fhi.ox.ac.uk/wp-content/uploads/Windfall-Clause-Report.pdf
The Post: https://www.fhi.ox.ac.uk/windfallclause/

With thanks to my excellent Patreon supporters:
https://www.patreon.com/robertskmiles

Gladamas
Scott Worley
JJ Hepboin
Pedro A Ortega
Said Polat
Chris Canal
Jake Ehrlich
Kellen lask
Francisco Tolmasky
Michael Andregg
David Reid
Peter Rolf
Chad Jones
Teague Lasser
Andrew Blackledge
Frank Marsman
Brad Brookshire
Cam MacFarlane
Jason Hise
Erik de Bruijn
Alec Johnson
Clemens Arbesser
Ludwig Schubert
Bryce Daifuku
Allen Faure
Eric James
Matheson Bayley
Qeith Wreid
jugettje dutchking
Owen Campbell-Moore
Atzin Espino-Murnane
Phil Moyer
Jacob Van Buren
Jonatan R
Ingvi Gautsson
Michael Greve
Julius Brash
Tom O'Connor
Shevis Johnson
Laura Olds
Jon Halliday
Paul Hobbs
Jeroen De Dauw
Lupuleasa Ionuț
Tim Neilson
Eric Scammell
Igor Keller
Ben Glanton
anul kumar sinha
Sean Gibat
Duncan Orr
Cooper Lawton
Will Glynn
Tyler Herrmann
Tomas Sayder
Ian Munro
Jérôme Beaulieu
Nathan Fish
Taras Bobrovytsky
Jeremy
Vaskó Richárd
Benjamin Watkin
Euclidean Plane
Andrew Harcourt
Luc Ritchie
Nicholas Guyett
James Hinchcliffe
Oliver Habryka
Chris Beacham
Zachary Gidwitz
Nikita Kiriy
Andrew Schreiber
Dmitri Afanasjev
Marcel Ward
Andrew Weir
Ben Archer
Kabs
Miłosz Wierzbicki
Tendayi Mawushe
Jannik Olbrich
Jake Fish
Jussi Männistö
Wr4thon
Martin Ottosen
Archy de Berker
Andy Kobre
Poker Chen
Kees
Paul Moffat
Robert Valdimarsson
Anders Öhrt
Marco Tiraboschi
Michael Kuhinica
Fraser Cain
Robin Scharf
Klemen Slavic
Patrick Henderson
Oct todo22
Melisa Kostrzewski
Hendrik
Daniel Munter
Alex Knauth
Leo
Rob Dawson
Bryan Egan
Robert Hildebrandt
James Fowkes
Len
Alan Bandurka
Ben H
Tatiana Ponomareva
Michael Bates
Simon Pilkington
Daniel Kokotajlo
Fionn
Diagon
Andreas Blomqvist
Bertalan Bodor
David Morgan
Ben Schultz
Zannheim
Daniel Eickhardt
lyon549
HD
Ihor Mukha
14zRobot
Ivan
Jason Cherry
Igor (Kerogi) Kostenko
ib_
Thomas Dingemanse
Stuart Alldritt
Alexander Brown
Devon Bernard
Ted Stokes
Jesper Andersson
Jim T
Kasper
DeepFriedJif
Chris Dinant
Raphaël Lévy
Marko Topolnik
Johannes Walter
Matt Stanton
Garrett Maring
Mo Hossny
Anthony Chiu
Frank Kurka
Ghaith Tarawneh
Josh Trevisiol
Julian Schulz
Stellated Hexahedron
Caleb
Scott Viteri
12tone
Clay Upton
Brent ODell
Conor Comiconor
Michael Roeschter
Georg Grass
Isak
Matthias Hölzl
Jim Renney
Michael V brown
Martin Henriksen
Edison Franklin
Daniel Steele
Piers Calderwood
Krzysztof Derecki
Mikhail Tikhomirov
Richárd Nagyfi
Richard Otto
Alston Sleet
Matt Brauer
Jaeson Booker
Mateusz Krzaczek
Artem Honcharov
Evan Ward
Michael Walters
Tomasz Gliniecki
Mihaly Barasz
Mark Woodward
Ranzear
Neil Palmere
Rajeen Nabid

https://www.patreon.com/robertskmiles
...

10 Reasons to Ignore AI Safety

June 4, 2020 5:28 pm

Why do some ignore AI Safety? Let's look at 10 reasons people give (adapted from Stuart Russell's list).

Related Videos from Me:
Why Would AI Want to do Bad Things? Instrumental Convergence: https://youtu.be/ZeecOKBus3Q
Intelligence and Stupidity: The Orthogonality Thesis: https://youtu.be/hEUO6pjwFOo
Predicting AI: RIP Prof. Hubert Dreyfus: https://youtu.be/B6Oigy1i3W4
A Response to Steven Pinker on AI: https://youtu.be/yQE9KAbFhNY

Related Videos from Computerphile:
AI Safety: https://youtu.be/IB1OvoCNnWY
General AI Won't Want You To Fix its Code:https://youtu.be/4l7Is6vOAOA
AI 'Stop Button' Problem: https://youtu.be/3TYT1QfdfsM

Provably Beneficial AI - Stuart Russell: https://youtu.be/pARXQnX6QS8

With thanks to my excellent Patreon supporters:
https://www.patreon.com/robertskmiles
Gladamas
James
Scott Worley
JJ Hepboin
Pedro A Ortega
Said Polat
Chris Canal
Jake Ehrlich
Kellen lask
Francisco Tolmasky
Michael Andregg
David Reid
Peter Rolf
Chad Jones
Frank Kurka
Teague Lasser
Andrew Blackledge
Vignesh Ravichandran
Jason Hise
Erik de Bruijn
Clemens Arbesser
Ludwig Schubert
Bryce Daifuku
Allen Faure
Eric James
Qeith Wreid
jugettje dutchking
Owen Campbell-Moore
Atzin Espino-Murnane
Jacob Van Buren
Jonatan R
Ingvi Gautsson
Michael Greve
Julius Brash
Tom O'Connor
Shevis Johnson
Laura Olds
Jon Halliday
Paul Hobbs
Jeroen De Dauw
Lupuleasa Ionuț
Tim Neilson
Eric Scammell
Igor Keller
Ben Glanton
anul kumar sinha
Sean Gibat
Duncan Orr
Cooper Lawton
Will Glynn
Tyler Herrmann
Tomas Sayder
Ian Munro
Jérôme Beaulieu
Nathan Fish
Taras Bobrovytsky
Jeremy
Vaskó Richárd
Benjamin Watkin
Sebastian Birjoveanu
Euclidean Plane
Andrew Harcourt
Luc Ritchie
Nicholas Guyett
James Hinchcliffe
Oliver Habryka
Chris Beacham
Nikita Kiriy
robertvanduursen
Dmitri Afanasjev
Marcel Ward
Andrew Weir
Ben Archer
Kabs
Miłosz Wierzbicki
Tendayi Mawushe
Jannik Olbrich
Anne Kohlbrenner
Jussi Männistö
Wr4thon
Martin Ottosen
Archy de Berker
Andy Kobre
Brian Gillespie
Poker Chen
Kees
Darko Sperac
Paul Moffat
Anders Öhrt
Marco Tiraboschi
Michael Kuhinica
Fraser Cain
Klemen Slavic
Patrick Henderson
Oct todo22
Melisa Kostrzewski
Hendrik
Daniel Munter
Leo
Rob Dawson
Bryan Egan
Robert Hildebrandt
James Fowkes
Len
Alan Bandurka
Ben H
Tatiana Ponomareva
Michael Bates
Simon Pilkington
Daniel Kokotajlo
Fionn
Diagon
Parker Lund
Russell schoen
Andreas Blomqvist
Bertalan Bodor
David Morgan
Ben Schultz
Zannheim
Daniel Eickhardt
lyon549
HD
Ihor Mukha
14zRobot
Ivan
Jason Cherry
Igor (Kerogi) Kostenko
ib_
Thomas Dingemanse
Alexander Brown
Devon Bernard
Ted Stokes
Jesper Andersson
Jim T
Kasper
DeepFriedJif
Daniel Bartovic
Chris Dinant
Raphaël Lévy
Marko Topolnik
Johannes Walter
Matt Stanton
Garrett Maring
Mo Hossny
Anthony Chiu
Ghaith Tarawneh
Josh Trevisiol
Julian Schulz
Stellated Hexahedron
Caleb
Scott Viteri
12tone
Nathaniel Raddin
Clay Upton
Brent ODell
Conor Comiconor
Michael Roeschter
Georg Grass
Isak
Matthias Hölzl
Jim Renney
Michael V brown
Martin Henriksen
Edison Franklin
Daniel Steele
Piers Calderwood
Krzysztof Derecki
Zachary Gidwitz
Mikhail Tikhomirov

https://www.patreon.com/robertskmiles
...

9 Examples of Specification Gaming

April 29, 2020 6:41 pm

AI systems do what you say, and it's hard to say exactly what you mean.
Let's look at a list of real life examples of specification gaming!

Related Videos from me:
Reward Hacking: https://youtu.be/92qDfT8pENs
Reward Hacking Reloaded: https://youtu.be/46nsTFfsBuc
What Can We Do About Reward Hacking?: https://youtu.be/13tZ9Yia71c

The list: http://tinyurl.com/specification-gaming
The blogpost this video is based on: https://vkrakovna.wordpress.com/2018/04/02/specification-gaming-examples-in-ai/
The newer blogpost that happened while I was making this video: https://deepmind.com/blog/article/Specification-gaming-the-flip-side-of-AI-ingenuity

(Explosion graphic from videezy.com)

Thanks to my wonderful patrons:
https://www.patreon.com/robertskmiles

Gladamas
James
Steef
Scott Worley
Chad Jones
Chris Canal
David Reid
Francisco Tolmasky
Frank Kurka
Jake Ehrlich
JJ Hepboin
Kellen lask
Michael Andregg
Pedro A Ortega
Peter Rolf
Said Polat
Teague Lasser
Allen Faure
Bryce Daifuku
Clemens Arbesser
Eric James
Erik de Bruijn
Jason Hise
jugettje dutchking
Ludwig Schubert
Qeith Wreid
Andrew Harcourt
anul kumar sinha
Ben Glanton
Benjamin Watkin
Cooper Lawton
Duncan Orr
Eric Scammell
Euclidean Plane
Ian Munro
Igor Keller
Ingvi Gautsson
James Hinchcliffe
Jeroen De Dauw
Jon Halliday
Jonatan R
Julius Brash
Jérôme Beaulieu
Laura Olds
Luc Ritchie
Lupuleasa Ionuț
Michael Greve
Nathan Fish
Nicholas Guyett
Paul Hobbs
Sean Gibat
Sebastian Birjoveanu
Shevis Johnson
Taras Bobrovytsky
Tim Neilson
Tom O'Connor
Tomas Sayder
Tyler Herrmann
Vaskó Richárd
Will Glynn
12tone
14zRobot
Alan Bandurka
Alexander Brown
Anders Öhrt
Andreas Blomqvist
Andrew Weir
Andy Kobre
Anne Kohlbrenner
Anthony Chiu
Archy de Berker
Ben Archer
Ben H
Ben Schultz
Bertalan Bodor
Brian Gillespie
Bryan Egan
Caleb
Chris Dinant
Daniel Bartovic
Daniel Eickhardt
Daniel Kokotajlo
Daniel Munter
Darko Sperac
David Morgan
DeepFriedJif
Devon Bernard
Diagon
Dmitri Afanasjev
Fionn
Fraser Cain
Garrett Maring
Ghaith Tarawneh
HD
Hendrik
ib_
Igor (Kerogi) Kostenko
Ihor Mukha
Ivan
James Fowkes
Jannik Olbrich
Jason Cherry
Jeremy
Jesper Andersson
Jim T
Johannes Walter
Josh Trevisiol
Julian Schulz
Jussi Männistö
Kabs
Kasper
Kasper Schnack
Kees
Klemen Slavic
Leo
lyon549
Marc Pauly
Marcel Ward
Marco Tiraboschi
Marko Topolnik
Martin Ottosen
Matt Stanton
Melisa Kostrzewski
Michael Bates
Michael Kuhinica
Miłosz Wierzbicki
Mo Hossny
Nathaniel Raddin
Oct todo22
Owen Campbell-Moore
Parker Lund
Patrick Henderson
Paul Moffat
Poker Chen
Rob Dawson
Robert Hildebrandt
robertvanduursen
Robin Scharf
Russell schoen
Scott Viteri
Simon Pilkington
Stellated Hexahedron
Tatiana Ponomareva
Ted Stokes
Tendayi Mawushe
Thomas Dingemanse
...

Training AI Without Writing A Reward Function, with Reward Modelling

December 13, 2019 6:39 pm

How do you get a reinforcement learning agent to do what you want, when you can't actually write a reward function that specifies what that is?

The paper: https://arxiv.org/pdf/1706.03741.pdf
The blogpost: https://openai.com/blog/deep-reinforcement-learning-from-human-preferences/

Thanks to my wonderful patrons:
https://www.patreon.com/robertskmiles
James
Gladamas
Steef
Scott Worley
Jordan Medina
Simon Strandgaard
JJ Hepboin
Pedro A Ortega
Said Polat
Chris Canal
Jake Ehrlich
Kellen lask
Francisco Tolmasky
Michael Andregg
David Reid
Robert Daniel Pickard
Peter Rolf
Chad Jones
Richárd Nagyfi
Jason Hise
Phil Moyer
Shevis Johnson
Erik de Bruijn
Alec Johnson
Clemens Arbesser
Ludwig Schubert
Bryce Daifuku
Allen Faure
Eric James
Qeith Wreid
Jonatan R
Ingvi Gautsson
Michael Greve
Julius Brash
Tom O'Connor
Robin Green
Laura Olds
Jon Halliday
Paul Hobbs
Jeroen De Dauw
Lupuleasa Ionuț
Tim Neilson
Eric Scammell
Igor Keller
Ben Glanton
anul kumar sinha
Sean Gibat
Cooper Lawton
Will Glynn
Tyler Herrmann
Tomas Sayder
Ian Munro
Jérôme Beaulieu
Nathan Fish
Taras Bobrovytsky
Anne Buit
Vaskó Richárd
Sebastian Birjoveanu
Euclidean Plane
Andrew Harcourt
DGJono
robertvanduursen
Dmitri Afanasjev
Marcel Ward
Andrew Weir
Ben Archer
Kabs
Miłosz Wierzbicki
Tendayi Mawushe
Jannik Olbrich
Anne Kohlbrenner
Jussi Männistö
Wr4thon
Martin Ottosen
Archy de Berker
Marc Pauly
Andy Kobre
Brian Gillespie
Poker Chen
Kees
Darko Sperac
Truls
Paul Moffat
Anders Öhrt
Marco Tiraboschi
Michael Kuhinica
Fraser Cain
Robin Scharf
Seth Brothwell
Kasper Schnack
Klemen Slavic
Patrick Henderson
Oct todo22
Melisa Kostrzewski
Hendrik
Daniel Munter
Graham Henry
Duncan Orr
Bryan Egan
Robert Hildebrandt
James Fowkes
Alan Bandurka
Ben H
Tatiana Ponomareva
Michael Bates
Simon Pilkington
Dion Gerald Bridger
Petr Smital
Daniel Kokotajlo
Fionn
Yuchong Li
Diagon
Parker Lund
Paul Emmerich
Russell schoen
Andreas Blomqvist
Bertalan Bodor
David Morgan
Jeremy
Ben Schultz
Zannheim
Daniel Eickhardt
lyon549
HD
Ihor Mukha
14zRobot
Ivan
Arne Strasser
Jason Cherry
Igor (Kerogi) Kostenko
Isaac Boates
Thomas Dingemanse
Davy Ker
Alexander Brown
Devon Bernard
Ted Stokes
James Helms
Matheson Bayley
https://www.patreon.com/robertskmiles
...

AI That Doesn't Try Too Hard - Maximizers and Satisficers

August 23, 2019 5:05 pm

Powerful AI systems can be dangerous in part because they pursue their goals as strongly as they can. Perhaps it would be safer to have systems that don't aim for perfection, and stop at 'good enough'. How could we build something like that?

Generating Fake YouTube comments with GPT-2: https://youtu.be/M6EXmoP5jX8

Computerphile Videos:
Unicorn AI: https://youtu.be/89A4jGvaaKk
More GPT-2, the 'writer' of Unicorn AI: https://youtu.be/p-6F4rhRYLQ
AI Language Models & Transformers: https://youtu.be/rURRYI66E54
GPT-2: Why Didn't They Release It?: https://youtu.be/AJxLtdur5fc
The Deadly Truth of General AI?: https://youtu.be/tcdVC4e6EV4


With thanks to my excellent Patreon supporters:
https://www.patreon.com/robertskmiles

Scott Worley
Jordan Medina
Simon Strandgaard
JJ Hepboin
Lupuleasa Ionuț
Pedro A Ortega
Said Polat
Chris Canal
Nicholas Kees Dupuis
Jake Ehrlich
Mark Hechim
Kellen lask
Francisco Tolmasky
Michael Andregg
Alexandru Dobre
David Reid
Robert Daniel Pickard
Peter Rolf
Chad Jones
Truthdoc
James
Richárd Nagyfi
Jason Hise
Phil Moyer
Shevis Johnson
Alec Johnson
Clemens Arbesser
Ludwig Schubert
Bryce Daifuku
Allen Faure
Eric James
Jonatan R
Ingvi Gautsson
Michael Greve
Julius Brash
Tom O'Connor
Erik de Bruijn
Robin Green
Laura Olds
Jon Halliday
Paul Hobbs
Jeroen De Dauw
Tim Neilson
Eric Scammell
Igor Keller
Ben Glanton
Robert Sokolowski
anul kumar sinha
Jérôme Frossard
Sean Gibat
Cooper Lawton
Tyler Herrmann
Tomas Sayder
Ian Munro
Jérôme Beaulieu
Taras Bobrovytsky
Anne Buit
Tom Murphy
Vaskó Richárd
Sebastian Birjoveanu
Gladamas
Sylvain Chevalier
DGJono
Dmitri Afanasjev
Brian Sandberg
Marcel Ward
Andrew Weir
Ben Archer
Scott McCarthy
Kabs
Miłosz Wierzbicki
Tendayi Mawushe
Jannik Olbrich
Anne Kohlbrenner
Jussi Männistö
Mr Fantastic
Wr4thon
Martin Ottosen
Archy de Berker
Marc Pauly
Joshua Pratt
Andy Kobre
Brian Gillespie
Martin Wind
Peggy Youell
Poker Chen
Kees
Darko Sperac
Truls
Paul Moffat
Anders Öhrt
Marco Tiraboschi
Michael Kuhinica
Fraser Cain
Robin Scharf
Oren Milman
John Rees
Seth Brothwell
Clark Mitchell
Kasper Schnack
Michael Hunter
Klemen Slavic
Patrick Henderson
Long Nguyen
Melisa Kostrzewski
Hendrik
Daniel Munter
Graham Henry
Volotat
Duncan Orr
Marin Aldimirov
Bryan Egan
James Fowkes
Frame Problems
Alan Bandurka
Benjamin Hull
Tatiana Ponomareva
Aleksi Maunu
Michael Bates
Simon Pilkington
Dion Gerald Bridger
Steven Cope
Marcos Alfredo Núñez
Petr Smital
Daniel Kokotajlo
Fionn
Yuchong Li
Nathan Fish
Diagon
Parker Lund
Russell schoen
Andreas Blomqvist
Bertalan Bodor
David Morgan
Ben Schultz
Zannheim
Daniel Eickhardt
lyon549
HD

https://www.patreon.com/robertskmiles
...

Is AI Safety a Pascal's Mugging?

May 16, 2019 4:11 pm

An event that's very unlikely is still worth thinking about, if the consequences are big enough. What's the limit though?

Do we have to devote all of our resources to any outcome that might give infinite payoffs, even if it seems basically impossible? Does the case for AI Safety rely on this kind of Pascal's Wager argument? Watch this video to find out that the answer to these questions is 'No'.

Correction: At 6:34 the embedded video says 3^^^3 has 3.6 trillion digits, but that's actually only the size of 3^^4. 3^^^3 is enormously larger.

The Alignment Newsletter Podcast: http://alignment-newsletter.libsyn.com/
RSS feed to put into apps: http://alignment-newsletter.libsyn(dot)com/rss

With thanks to my excellent Patreon supporters:
https://www.patreon.com/robertskmiles

Jason Hise
Jordan Medina
Scott Worley
JJ Hepboin
Pedro A Ortega
Said Polat
Chris Canal
Nicholas Kees Dupuis
Jake Ehrlich
Mark Hechim
Kellen lask
Francisco Tolmasky
Michael Andregg
James
Richárd Nagyfi
Phil Moyer
Shevis Johnson
Alec Johnson
Lupuleasa Ionuț
Clemens Arbesser
Bryce Daifuku
Allen Faure
Simon Strandgaard
Jonatan R
Michael Greve
Julius Brash
Tom O'Connor
Erik de Bruijn
Robin Green
Laura Olds
Jon Halliday
Paul Hobbs
Jeroen De Dauw
Tim Neilson
Eric Scammell
Igor Keller
Ben Glanton
Robert Sokolowski
anul kumar sinha
Jérôme Frossard
Sean Gibat
A.Russell
Cooper Lawton
Tyler Herrmann
Tomas Sayder
Ian Munro
Jérôme Beaulieu
Gladamas
Sylvain Chevalier
DGJono
robertvanduursen
Dmitri Afanasjev
Brian Sandberg
Marcel Ward
Andrew Weir
Ben Archer
Scott McCarthy
Kabs
Tendayi Mawushe
Jannik Olbrich
Anne Kohlbrenner
Jussi Männistö
Mr Fantastic
Wr4thon
Archy de Berker
Marc Pauly
Joshua Pratt
Andy Kobre
Brian Gillespie
Martin Wind
Peggy Youell
Poker Chen
Kees
Truls
Paul Moffat
Anders Öhrt
Marco Tiraboschi
Michael Kuhinica
Fraser Cain
Robin Scharf
Oren Milman
John Rees
Seth Brothwell
Brian Goodrich
Kasper Schnack
Michael Hunter
Klemen Slavic
Patrick Henderson
Long Nguyen
Melisa Kostrzewski
Hendrik
Daniel Munter
Graham Henry
Volotat
Duncan Orr
Bryan Egan
James Fowkes
Frame Problems
Alan Bandurka
Benjamin Hull
Dave Tapley
Tatiana Ponomareva
Aleksi Maunu
Michael Bates
Simon Pilkington
Dion Gerald Bridger
Steven Cope
Petr Smital
Daniel Kokotajlo
Joshua Davis
Fionn
Tyler LaBean
Roger
Yuchong Li
Nathan Fish
Diagon
Giancarlo Pace

https://www.patreon.com/robertskmiles
...

A Response to Steven Pinker on AI

March 31, 2019 3:39 pm

Steven Pinker wrote an article on AI for Popular Science Magazine, which I have some issues with.

The article: https://www.popsci.com/robot-uprising-enlightenment-now

Related:
"The Orthogonality Thesis, Intelligence, and Stupidity" (https://youtu.be/hEUO6pjwFOo)
"AI? Just Sandbox it... - Computerphile" (https://youtu.be/i8r_yShOixM)
"Experts' Predictions about the Future of AI" (https://youtu.be/HOJ1NVtlnyQ)
"Why Would AI Want to do Bad Things? Instrumental Convergence" (https://youtu.be/ZeecOKBus3Q)

With thanks to my excellent Patreon supporters:
https://www.patreon.com/robertskmiles

Jason Hise
Jordan Medina
Scott Worley
JJ Hepboin
Pedro A Ortega
Said Polat
Chris Canal
Nicholas Kees Dupuis
James
Richárd Nagyfi
Phil Moyer
Shevis Johnson
Alec Johnson
Lupuleasa Ionuț
Clemens Arbesser
Bryce Daifuku
Allen Faure
Simon Strandgaard
Jonatan R
Michael Greve
The Guru Of Vision
Julius Brash
Tom O'Connor
Erik de Bruijn
Robin Green
Laura Olds
Jon Halliday
Paul Hobbs
Jeroen De Dauw
Tim Neilson
Eric Scammell
Igor Keller
Ben Glanton
Robert Sokolowski
anul kumar sinha
Jérôme Frossard
Sean Gibat
Volotat
andrew Russell
Cooper Lawton
Gladamas
Sylvain Chevalier
DGJono
robertvanduursen
Dmitri Afanasjev
Brian Sandberg
Marcel Ward
Andrew Weir
Ben Archer
Scott McCarthy
Kabs
Tendayi Mawushe
Jannik Olbrich
Anne Kohlbrenner
Jussi Männistö
Mr Fantastic
Wr4thon
Archy de Berker
Marc Pauly
Joshua Pratt
Andy Kobre
Brian Gillespie
Martin Wind
Peggy Youell
Poker Chen
Kees
Darko Sperac
Truls
Paul Moffat
Anders Öhrt
Marco Tiraboschi
Michael Kuhinica
Fraser Cain
Robin Scharf
Oren Milman
John Rees
Seth Brothwell
Brian Goodrich
Clark Mitchell
Kasper Schnack
Michael Hunter
Klemen Slavic
Patrick Henderson
Long Nguyen
Oct todo22
Melisa Kostrzewski
Hendrik
Daniel Munter
Graham Henry
Duncan Orr
Andrew Walker
Bryan Egan


https://www.patreon.com/robertskmiles
...

How to Keep Improving When You're Better Than Any Teacher - Iterated Distillation and Amplification

March 11, 2019 2:14 pm

[2nd upload] AI systems can be trained using demonstrations from experts, but how do you train them to out-perform those experts? Can this still be done even without clear win/loss criteria? And how do you do it safely?

This video was based on work including:
"Supervising strong learners by amplifying weak experts" by Paul Christiano, Buck Shlegeris, Dario Amodei (https://arxiv.org/abs/1810.08575)
https://openai.com/blog/amplifying-ai-training/
https://www.alignmentforum.org/s/EmDuGeRw749sD3GKd
https://ai-alignment.com/iterated-distillation-and-amplification-157debfd1616

With thanks to my wonderful Patrons: ( https://www.patreon.com/robertskmiles )
Steef
Jason Strack
Jordan Medina
Jason Hise
Scott Worley
JJ Hepboin
Pedro A Ortega
Said Polat
Chris Canal
Nicholas Kees Dupuis
James
Richárd Nagyfi
Phil Moyer
Alec Johnson
Clemens Arbesser
Bryce Daifuku
Simon Strandgaard
Jonatan R
Michael Greve
The Guru Of Vision
Volodymyr
David Tjäder
Julius Brash
Tom O'Connor
Erik de Bruijn
Robin Green
Laura Olds
Jon Halliday
Paul Hobbs
Jeroen De Dauw
Tim Neilson
Eric Scammell
Igor Keller
Ben Glanton
Robert Sokolowski
anul kumar sinha
Jérôme Frossard
Sean Gibat
Sun Sun
andrew Russell
Cooper Lawton
Gladamas
Sylvain Chevalier
DGJono
robertvanduursen
Dmitri Afanasjev
Brian Sandberg
Einar Ueland
Marcel Ward
Andrew Weir
Taylor Smith
Ben Archer
Scott McCarthy
Kabs Kabs Kabs
Tendayi Mawushe
Jannik Olbrich
Anne Kohlbrenner
Bjorn Nyblad
Jussi Männistö
Mr Fantastic
Wr4thon
Archy de Berker
Marc Pauly
Joshua Pratt
Shevis Johnson
Andy Kobre
Brian Gillespie
Martin Wind
Peggy Youell
Poker Chen
Kees
Darko Sperac
Truls
Paul Moffat
Jelle Langen
Anders Öhrt
Marco Tiraboschi
Michael Kuhinica
Fraser Cain
Robin Scharf
Oren Milman
John Rees
Shawn Hartsock
Seth Brothwell
Brian Goodrich
Clark Mitchell
Kasper Schnack
Michael Hunter
Klemen Slavic
Patrick Henderson
Long Nguyen
Oct todo22
Melisa Kostrzewski
Hendrik
Daniel Munter
Graham Henry
Duncan Orr
...

Why Not Just: Think of AGI Like a Corporation?

December 23, 2018 10:01 pm

Corporations are kind of like AIs, if you squint. How hard do you have to squint though, and is it worth it?
In this video we ask: Are corporations artificial general superintelligences?


Related:
"What can AGI do? I/O and Speed" (https://youtu.be/gP4ZNUHdwp8)
"Why Would AI Want to do Bad Things? Instrumental Convergence" (https://youtu.be/ZeecOKBus3Q)

Media Sources:
"SpaceX - How Not to Land an Orbital Rocket Booster" (https://youtu.be/bvim4rsNHkQ)
Undertale - Turbosnail
Clerks (1994)
Zootopia (2016)
AlphaGo (2017)
Ready Player One (2018)

With thanks to my excellent Patreon supporters:
https://www.patreon.com/robertskmiles

Jordan Medina
Jason Hise
Pablo Eder
Scott Worley
JJ Hepboin
Pedro A Ortega
James McCuen
Richárd Nagyfi
Phil Moyer
Alec Johnson
Bobby Cold
Clemens Arbesser
Simon Strandgaard
Jonatan R
Michael Greve
The Guru Of Vision
David Tjäder
Julius Brash
Tom O'Connor
Erik de Bruijn
Robin Green
Laura Olds
Jon Halliday
Paul Hobbs
Jeroen De Dauw
Tim Neilson
Eric Scammell
Igor Keller
Ben Glanton
Robert Sokolowski
Jérôme Frossard
Sean Gibat
Sylvain Chevalier
DGJono
robertvanduursen
Scott Stevens
Dmitri Afanasjev
Brian Sandberg
Marcel Ward
Andrew Weir
Ben Archer
Scott McCarthy
Kabs Kabs Kabs
Tendayi Mawushe
Jannik Olbrich
Anne Kohlbrenner
Jussi Männistö
Mr Fantastic
Wr4thon
Dave Tapley
Archy de Berker
Kevin
Marc Pauly
Joshua Pratt
Gunnar Guðvarðarson
Shevis Johnson
Andy Kobre
Brian Gillespie
Martin Wind
Peggy Youell
Poker Chen
Kees
Darko Sperac
Truls
Paul Moffat
Anders Öhrt
Lupuleasa Ionuț
Marco Tiraboschi
Michael Kuhinica
Fraser Cain
Robin Scharf
Oren Milman
John Rees
Shawn Hartsock
Seth Brothwell
Brian Goodrich
Michael S McReynolds
Clark Mitchell
Kasper Schnack
Michael Hunter
Klemen Slavic
Patrick Henderson

https://www.patreon.com/robertskmiles
...

Safe Exploration: Concrete Problems in AI Safety Part 6

September 21, 2018 1:20 pm

To learn, you need to try new things, but that can be risky. How do we make AI systems that can explore safely?

Playlist of the series so far: https://www.youtube.com/playlist?list=PLqL14ZxTTA4fEp5ltiNinNHdkPuLK4778
The paper, 'Concrete Problems in AI Safety': https://arxiv.org/pdf/1606.06565.pdf

AI Safety Gridworlds: https://youtu.be/CGTkoUidQ8I
Why Would AI Want to do Bad Things? Instrumental Convergence: https://youtu.be/ZeecOKBus3Q
Scalable Supervision: Concrete Problems in AI Safety Part 5: https://youtu.be/nr1lHuFeq5w
The Evolved Radio and its Implications for Modelling the Evolution of Novel Sensors: https://people.duke.edu/~ng46/topics/evolved-radio.pdf

With thanks to my excellent Patreon supporters:
https://www.patreon.com/robertskmiles

Jason Hise
Steef
Jason Strack
Stefan Skiles
Jordan Medina
Scott Worley
JJ Hepboin
Alex Flint
Pedro A Ortega
James McCuen
Richárd Nagyfi
Alec Johnson
Clemens Arbesser
Simon Strandgaard
Jonatan R
Michael Greve
The Guru Of Vision
Alexander Hartvig Nielsen
Volodymyr
David Tjäder
Julius Brash
Tom O'Connor
Ville Ahlgren
Erik de Bruijn
Robin Green
Maksym Taran
Laura Olds
Jon Halliday
Bobby Cold
Paul Hobbs
Jeroen De Dauw
Tim Neilson
Eric Scammell
christopher dasenbrock
Igor Keller
Ben Glanton
Robert Sokolowski
Vlad D
Jérôme Frossard
Lupuleasa Ionuț
Sylvain Chevalier
DGJono
robertvanduursen
Scott Stevens
Dmitri Afanasjev
Brian Sandberg
Einar Ueland
Marcel Ward
Andrew Weir
Taylor Smith
Ben Archer
Scott McCarthy
Kabs
Phil Moyer
Tendayi Mawushe
Anne Kohlbrenner
Bjorn Nyblad
Jussi Männistö
Mr Fantastic
Matanya Loewenthal
Wr4thon
Dave Tapley
Archy de Berker
Pablo Eder
Kevin
Marc Pauly
Joshua Pratt
Gunnar Guðvarðarson
Shevis Johnson
Andy Kobre
Manuel Weichselbaum
Brian Gillespie
Martin Wind
Peggy Youell
Poker Chen
Kees
Darko Sperac
Paul Moffat
Jelle Langen
Lars Scholz
Anders Öhrt
Marco Tiraboschi
Michael Kuhinica
Fraser Cain
Robin Scharf
Oren Milman
John Rees
Gladamas
Shawn Hartsock
Seth Brothwell
Brian Goodrich
Michael S McReynolds

https://www.patreon.com/robertskmiles

Media Sources:
"DashCam Russia - Crazy Drivers and Car Crashes 2018" (https://youtu.be/h50TQ3i9k5I)
Optimist Prime
"Hapless Boston Dynamics robot in shelf-stacking fail" (https://youtu.be/JzlsvFN_5HI)
"The Simpsons - Bart Gets Famous" (c) Fox 1994
"Donald Duck - Cured Duck" (c) Disney 1945
"Vase Breaking Slow Motion" (https://youtu.be/IJNPc_anP7U)
"Fastest quadcopter i've ever flown + Most Destructive Crash" (https://youtu.be/OKT4cx7UKsk)
"An athlete uses physics to shatter world records - Asaf Bar-Yosef" (https://youtu.be/RaGUW1d0w8g)
"Uber self-driving car crash in Tempe, Arizona" (https://youtu.be/XtTB8hTgHbM)
"Quadcopter Fx Simulator" (https://youtu.be/-6si8WkRtaY)
"Fallout - New Vegas by progamingwithed in 24:00 - AGDQ 2017 - Part 59" (https://youtu.be/nuzDif16_nc)
"Far Cry 5 out of 5 Physics Simulation" (https://youtu.be/4My0Bt30pX0)
...

Friend or Foe? AI Safety Gridworlds extra bit

June 25, 2018 1:31 am

The last video about the AI Safety Gridworlds paper. How does an agent detect and adapt to friendly and adversarial intentions in the environment?

The previous video: https://youtu.be/CGTkoUidQ8I

The Computerphile video: https://www.youtube.com/watch?v=eElfR_BnL5k
The EXTRA BITS video, with more detail: https://www.youtube.com/watch?v=py5VRagG6t8

The paper: https://arxiv.org/pdf/1711.09883.pdf
The GitHub repos: https://github.com/deepmind/ai-safety-gridworlds


With thanks to my excellent Patreon supporters:
https://www.patreon.com/robertskmiles

Jason Hise
Steef
Cooper Lawton
Jason Strack
Chad Jones
Stefan Skiles
Jordan Medina
Manuel Weichselbaum
Scott Worley
JJ Hepboin
Alex Flint
Justin Courtright
Pedro A Ortega
James McCuen
Richárd Nagyfi
Ville Ahlgren
Alec Johnson
Clement Chiris
Simon Strandgaard
Joshua Richardson
Jonatan R
Michael Greve
The Guru Of Vision
Alexander Hartvig Nielsen
Volodymyr
David Tjäder
Julius Brash
Tom O'Connor
Gunnar Guðvarðarson
Shevis Johnson
Erik de Bruijn
Robin Green
Alexei Vasilkov
Maksym Taran
Laura Olds
Jon Halliday
Robert Werner
Paul Hobbs
Jeroen De Dauw
Enrico Ros
Tim Neilson
Eric Scammell
christopher dasenbrock
Igor Keller
Morten Jelle
Ben Glanton
Robert Sokolowski
Vlad D
William Hendley
DGJono
robertvanduursen
Scott Stevens
Emilio Alvarez
Dmitri Afanasjev
Brian Sandberg
Einar Ueland
Marcel Ward
Andrew Weir
Taylor Smith
Ben Archer
Scott McCarthy
Kabs
Phil
Tendayi Mawushe
Anne Kohlbrenner
Jake Fish
Bjorn Nyblad
Jussi Männistö
Mr Fantastic
Matanya Loewenthal
Wr4thon
Dave Tapley
Archy de Berker
Kevin
Marc Pauly
Joshua Pratt
Andy Kobre
Brian Gillespie
Martin Wind
Peggy Youell
Poker Chen
Kees
Darko Sperac
Paul Moffat
Jelle Langen
Lars Scholz
Anders Öhrt
Lupuleasa Ionuț
Marco Tiraboschi
Michael Kuhinica
Fraser Cain
Robin Scharf
Oren Milman
John Rees
Shawn Hartsock
Seth Brothwell

https://www.patreon.com/robertskmiles
...

AI Safety Gridworlds

May 25, 2018 6:20 pm

Got an AI safety idea? Now you can test it out! A recent paper from DeepMind sets out some environments for evaluating the safety of AI systems, and the code is on GitHub.

The Computerphile video: https://www.youtube.com/watch?v=eElfR_BnL5k
The EXTRA BITS video, with more detail: https://www.youtube.com/watch?v=py5VRagG6t8

The paper: https://arxiv.org/pdf/1711.09883.pdf
The GitHub repos: https://github.com/deepmind/ai-safety-gridworlds

https://www.patreon.com/robertskmiles
With thanks to my wonderful Patreon supporters:

- Jason Hise
- Steef
- Cooper Lawton
- Jason Strack
- Chad Jones
- Stefan Skiles
- Jordan Medina
- Manuel Weichselbaum
- Scott Worley
- JJ Hepboin
- Alex Flint
- Justin Courtright
- James McCuen
- Richárd Nagyfi
- Ville Ahlgren
- Alec Johnson
- Simon Strandgaard
- Joshua Richardson
- Jonatan R
- Michael Greve
- The Guru Of Vision
- Fabrizio Pisani
- Alexander Hartvig Nielsen
- Volodymyr
- David Tjäder
- Paul Mason
- Ben Scanlon
- Julius Brash
- Mike Bird
- Tom O'Connor
- Gunnar Guðvarðarson
- Shevis Johnson
- Erik de Bruijn
- Robin Green
- Alexei Vasilkov
- Maksym Taran
- Laura Olds
- Jon Halliday
- Robert Werner
- Paul Hobbs
- Jeroen De Dauw
- Enrico Ros
- Tim Neilson
- Eric Scammell
- christopher dasenbrock
- Igor Keller
- William Hendley
- DGJono
- robertvanduursen
- Scott Stevens
- Michael Ore
- Dmitri Afanasjev
- Brian Sandberg
- Einar Ueland
- Marcel Ward
- Andrew Weir
- Taylor Smith
- Ben Archer
- Scott McCarthy
- Kabs Kabs
- Phil
- Tendayi Mawushe
- Gabriel Behm
- Anne Kohlbrenner
- Jake Fish
- Bjorn Nyblad
- Jussi Männistö
- Mr Fantastic
- Matanya Loewenthal
- Wr4thon
- Dave Tapley
- Archy de Berker
- Kevin
- Marc Pauly
- Joshua Pratt
- Andy Kobre
- Brian Gillespie
- Martin Wind
- Peggy Youell
- Poker Chen
- pmilian
- Kees
- Darko Sperac
- Paul Moffat
- Jelle Langen
- Lars Scholz
- Anders Öhrt
- Lupuleasa Ionuț
- Marco Tiraboschi
- Peter Kjeld Andersen
- Michael Kuhinica
- Fraser Cain
- Robin Scharf
- Oren Milman
...

Experts' Predictions about the Future of AI

March 31, 2018 2:12 pm

When will AI systems surpass human performance? I don't know, do you? No you don't. Let's see what 352 top AI researchers think.

[CORRECTION: I mistakenly stated that the survey was before AlphaGo beat Lee Sedol. The 12 year prediction was for AI to outperform humans *after having only played as many games as a human plays in their lifetime*]


The paper: https://arxiv.org/pdf/1705.08807.pdf
The blogpost which has lots of nice data visualisations: https://aiimpacts.org/2016-expert-survey-on-progress-in-ai/

The Instrumental Convergence video: https://www.youtube.com/watch?v=ZeecOKBus3Q
The Negative Side Effects video: https://www.youtube.com/watch?v=lqJUIqZNzP8

With thanks to my excellent Patrons at https://www.patreon.com/robertskmiles :

Jason Hise
Steef
Jason Strack
Chad Jones
Stefan Skiles
Jordan Medina
Manuel Weichselbaum
1RV34
Scott Worley
JJ Hepboin
Alex Flint
James McCuen
Richárd Nagyfi
Ville Ahlgren
Alec Johnson
Simon Strandgaard
Joshua Richardson
Jonatan R
Michael Greve
The Guru Of Vision
Fabrizio Pisani
Alexander Hartvig Nielsen
Volodymyr
David Tjäder
Paul Mason
Ben Scanlon
Julius Brash
Mike Bird
Tom O'Connor
Gunnar Guðvarðarson
Shevis Johnson
Erik de Bruijn
Robin Green
Alexei Vasilkov
Maksym Taran
Laura Olds
Jon Halliday
Robert Werner
Paul Hobbs
Jeroen De Dauw
Konsta
William Hendley
DGJono
robertvanduursen
Scott Stevens
Michael Ore
Dmitri Afanasjev
Brian Sandberg
Einar Ueland
Marcel Ward
Andrew Weir
Taylor Smith
Ben Archer
Scott McCarthy
Kabs Kabs
Phil
Tendayi Mawushe
Gabriel Behm
Anne Kohlbrenner
Jake Fish
Bjorn Nyblad
Jussi Männistö
Mr Fantastic
Matanya Loewenthal
Wr4thon
Dave Tapley
Archy de Berker
Kevin
Vincent Sanders
Marc Pauly
Andy Kobre
Brian Gillespie
Martin Wind
Peggy Youell
Poker Chen
Kees
Darko Sperac
Paul Moffat
Noel Kocheril
Jelle Langen
Lars Scholz
...

Why Would AI Want to do Bad Things? Instrumental Convergence

March 24, 2018 9:51 pm

How can we predict that AGI with unknown goals would behave badly by default?

The Orthogonality Thesis video: https://www.youtube.com/watch?v=hEUO6pjwFOo
Instrumental Convergence: https://arbital.com/p/instrumental_convergence/
Omohundro 2008, Basic AI Drives: https://selfawaresystems.files.wordpress.com/2008/01/ai_drives_final.pdf

With thanks to my excellent Patrons at https://www.patreon.com/robertskmiles :

Jason Hise
Steef
Jason Strack
Chad Jones
Stefan Skiles
Jordan Medina
Manuel Weichselbaum
1RV34
Scott Worley
JJ Hepboin
Alex Flint
James McCuen
Richárd Nagyfi
Ville Ahlgren
Alec Johnson
Simon Strandgaard
Joshua Richardson
Jonatan R
Michael Greve
The Guru Of Vision
Fabrizio Pisani
Alexander Hartvig Nielsen
Volodymyr
David Tjäder
Paul Mason
Ben Scanlon
Julius Brash
Mike Bird
Tom O'Connor
Gunnar Guðvarðarson
Shevis Johnson
Erik de Bruijn
Robin Green
Alexei Vasilkov
Maksym Taran
Laura Olds
Jon Halliday
Robert Werner
Paul Hobbs
Jeroen De Dauw
Konsta
William Hendley
DGJono
robertvanduursen
Scott Stevens
Michael Ore
Dmitri Afanasjev
Brian Sandberg
Einar Ueland
Marcel Ward
Andrew Weir
Taylor Smith
Ben Archer
Scott McCarthy
Kabs Kabs
Phil
Tendayi Mawushe
Gabriel Behm
Anne Kohlbrenner
Jake Fish
Bjorn Nyblad
Jussi Männistö
Mr Fantastic
Matanya Loewenthal
Wr4thon
Dave Tapley
Archy de Berker
Kevin
Vincent Sanders
Marc Pauly
Andy Kobre
Brian Gillespie
Martin Wind
Peggy Youell
Poker Chen
Kees
Darko Sperac
Paul Moffat
Noel Kocheril
Jelle Langen
Lars Scholz
...

Superintelligence Mod for Civilization V

February 13, 2018 7:17 pm

Let's play this new mod for Civ 5 that makes AGI an available technology!
Can we guide humanity to a utopian AI future, or will we destroy ourselves?

Download the mod here: https://steamcommunity.com/sharedfiles/filedetails/?id=1215263272
or here if you have the Brave New World DLC: http://steamcommunity.com/sharedfiles/filedetails/?id=1217587000

With special thanks to Dr Shahar Avin: https://www.cser.ac.uk/team/shahar-avin/

The long version: https://youtu.be/SBwOJMZtJao

https://www.rockpapershotgun.com/2018/02/06/the-creator-of-the-civilization-v-superintelligence-mod-on-ai-safety/
https://www.theverge.com/2018/1/5/16853628/civilization-v-ai-risk-mod-cser

With thanks to my excellent Patreon supporters:
https://www.patreon.com/robertskmiles

Jason Hise
Steef
Jason Strack
Chad Jones
Stefan Skiles
Jordan Medina
Manuel Weichselbaum
1RV34
Scott Worley
JJ Hepboin
James McCuen
Richárd Nagyfi
Ville Ahlgren
Alec Johnson
Trevor Alexander Nestor
Clement Chiris
Simon Strandgaard
Joshua Richardson
Jonatan R
Michael Greve
The Guru Of Vision
Fabrizio Pisani
Alexander Hartvig Nielsen
Volodymyr
David Tjäder
Paul Mason
Ben Scanlon
Julius Brash
Mike Bird
Tom O'Connor
Gunnar Guðvarðarson
Shevis Johnson
Erik de Bruijn
Robin Green
Alexei Vasilkov
Maksym Taran
Laura Olds
Jon Halliday
Robert Werner
Paul Hobbs
Jeroen De Dauw
Roman Nekhoroshev
Konsta
William Hendley
DGJono
robertvanduursen
Scott Stevens
Emilio Alvarez
Michael Ore
Dmitri Afanasjev
Brian Sandberg
Einar Ueland
Lo Rez
Marcel Ward
Andrew Weir
Taylor Smith
Ben Archer
Scott McCarthy
Kabs Kabs
Phil
Tendayi Mawushe
Gabriel Behm
Anne Kohlbrenner
Jake Fish
Bjorn Nyblad
Stefan Laurie
Jussi Männistö
Cameron Kinsel
Matanya Loewenthal
Wr4thon
Dave Tapley
Archy de Berker
Kevin
Vincent Sanders
Marc Pauly
Andy Kobre
Brian Gillespie
Martin Wind
Peggy Youell
Poker Chen
Kees Boon
Darko Sperac
Paul Moffat
Noel Kocheril
Jelle Langen

https://www.patreon.com/robertskmiles
...

Intelligence and Stupidity: The Orthogonality Thesis

January 11, 2018 9:53 pm

Can highly intelligent agents have stupid goals?
A look at The Orthogonality Thesis and the nature of stupidity.

The 'Stamp Collector' Computerphile video: https://www.youtube.com/watch?v=tcdVC4e6EV4
My other Computerphile videos: https://www.youtube.com/watch?v=GSIDS_lvRv4&list=PLqL14ZxTTA4fRMts7Af2G8t4Rp17e8MdS

Katie Byrne's Channel: https://www.youtube.com/channel/UCwA5wd50HYLa-ZY80LSAg3w
Chad Jones' Channel: https://www.youtube.com/user/cjone150

https://www.patreon.com/robertskmiles
With thanks to my wonderful Patreon supporters:
- Steef
- Sara Tjäder
- Jason Strack
- Chad Jones
- Stefan Skiles
- Ziyang Liu
- Jordan Medina
- Jason Hise
- Manuel Weichselbaum
- 1RV34
- James McCuen
- Richárd Nagyfi
- Ammar Mousali
- Scott Zockoll
- Ville Ahlgren
- Alec Johnson
- Simon Strandgaard
- Joshua Richardson
- Jonatan R
- Michael Greve
- robertvanduursen
- The Guru Of Vision
- Fabrizio Pisani
- Alexander Hartvig Nielsen
- Volodymyr
- David Tjäder
- Paul Mason
- Ben Scanlon
- Julius Brash
- Mike Bird
- Tom O'Connor
- Gunnar Guðvarðarson
- Shevis Johnson
- Erik de Bruijn
- Robin Green
- Alexei Vasilkov
- Maksym Taran
- Laura Olds
- Jon Halliday
- Robert Werner
- Roman Nekhoroshev
- Konsta
- William Hendley
- DGJono
- Matthias Meger
- Scott Stevens
- Emilio Alvarez
- Michael Ore
- Dmitri Afanasjev
- Brian Sandberg
- Einar Ueland
- Lo Rez
- Marcel Ward
- Andrew Weir
- Taylor Smith
- Ben Archer
- Scott McCarthy
- Kabs Kabs
- Phil
- Tendayi Mawushe
- Gabriel Behm
- Anne Kohlbrenner
- Jake Fish
- Bjorn Nyblad
- Stefan Laurie
- Jussi Männistö
- Cameron Kinsel
- Matanya Loewenthal
- Wr4thon
- Dave Tapley
- Archy de Berker
- Kevin
- Vincent Sanders
- Marc Pauly
- Andy Kobre
- Brian Gillespie
- Martin Wind
- Peggy Youell
- Poker Chen
https://www.patreon.com/robertskmiles
...

Scalable Supervision: Concrete Problems in AI Safety Part 5

November 29, 2017 11:47 pm

Why can't we just have humans overseeing our AI systems?

The Concrete Problems in AI Safety Playlist: https://www.youtube.com/playlist?list=PLqL14ZxTTA4fEp5ltiNinNHdkPuLK4778
Previous Video: https://www.youtube.com/watch?v=13tZ9Yia71c
The Computerphile video: https://www.youtube.com/watch?v=9nktr1MgS-A
The paper 'Concrete Problems in AI Safety': https://arxiv.org/pdf/1606.06565.pdf

https://www.patreon.com/robertskmiles
With thanks to my wonderful Patreon supporters:
- Steef
- Sara Tjäder
- Jason Strack
- Chad Jones
- Stefan Skiles
- Ziyang Liu
- Jordan Medina
- Jason Hise
- Heavy Empty
- Manuel Weichselbaum
- James McCuen
- Richárd Nagyfi
- Ammar Mousali
- Scott Zockoll
- Charles Miller
- Joshua Richardson
- Jonatan R
- Michael Greve
- robertvanduursen
- The Guru Of Vision
- Fabrizio Pisani
- Alexander Hartvig Nielsen
- Volodymyr
- David Tjäder
- Paul Mason
- Ben Scanlon
- Julius Brash
- Mike Bird
- Taylor Winning
- Ville Ahlgren
- Johannes David
- Andrew Pearce
- Gunnar Guðvarðarson
- Shevis Johnson
- Erik de Bruijn
- Robin Green
- Alexei Vasilkov
- Roman Nekhoroshev
- Peggy Youell
- Konsta
- William Hendley
- Almighty Dodd
- DGJono
- Matthias Meger
- Scott Stevens
- Emilio Alvarez
- Michael Ore
- Robert Bridges
- Dmitri Afanasjev
- Brian Sandberg
- Einar Ueland
- Lo Rez
- Stephen Paul
- Marcel Ward
- Andrew Weir
- Pontus Carlsson
- Taylor Smith
- Ben Archer
- Ivan Pochesnev
- Scott McCarthy
- Kabs Kabs Kabs
- Phil
- Christopher Askin
- Tendayi Mawushe
- Gabriel Behm
- Anne Kohlbrenner
- Jake Fish
- David Rasmussen
- Bjorn Nyblad
- Stefan Laurie
- Tom O'Connor
- pmilian
- Jussi Männistö
- Cameron Kinsel
- Matanya Loewenthal
- Wr4thon
- Dave Tapley
- Archy de Berker
- Kevin
- Vincent Sanders
- Marc Pauly
- Andy Kobre
- Brian Gillespie
https://www.patreon.com/robertskmiles
...

AI Safety at EAGlobal2017 Conference

November 16, 2017 9:21 pm

I attended a charity conference to learn about AI Safety!

Correction: Alan Dafoe is funded by a grant from the Open Philanthropy Project, but does not work for them.

The conference's YouTube channel: https://www.youtube.com/channel/UCEfASxwPxzsHlG5Rf1-4K9w
The Website: https://www.eaglobal.org/events/ea-global-2017-uk/

Jobs at FHI: https://www.fhi.ox.ac.uk/vacancies/

My Concrete Problems in AI Safety series: https://www.youtube.com/watch?v=lqJUIqZNzP8&list=PLqL14ZxTTA4fEp5ltiNinNHdkPuLK4778

With thanks to my Patrons! (https://www.patreon.com/robertskmiles)
Steef
Sara Tjäder
Jason Strack
Chad Jones
Stefan Skiles
Katie Byrne
Ziyang Liu
Jordan Medina
Kyle Scott
Jason Hise
Heavy Empty
James McCuen
Richárd Nagyfi
Ammar Mousali
Scott Zockoll
Charles Miller
Joshua Richardson
Jonatan R
Michael Greve
robertvanduursen
The Guru Of Vision
Fabrizio Pisani
Alexander Hartvig Nielsen
Volodymyr
David Tjäder
Paul Mason
Ben Scanlon
Julius Brash
Mike Bird
Taylor Winning
Ville Ahlgren
Johannes David
Andrew Pearce
Gunnar Guðvarðarson
Shevis Johnson
Erik de Bruijn
Robin Green
Roman Nekhoroshev
Peggy Youell
Konsta
William Hendley
Adam Dodd
DGJono
Matthias Meger
Scott Stevens
Michael Ore
Robert Bridges
Dmitri Afanasjev
Brian Sandberg
Einar Ueland
Lo Rez
Stephen Paul
Marcel Ward
Andrew Weir
Pontus Carlsson
Taylor Smith
Ben Archer
Ivan Pochesnev
Scott McCarthy
Kabs Kabs Kabs
Phil
Christopher Askin
Tendayi Mawushe
Gabriel Behm
Anne Kohlbrenner
Jake Fish
David Rasmussen
Filip
Bjorn Nyblad
Stefan Laurie
Tom O'Connor
pmilian
Jussi Männistö
Cameron Kinsel
Matanya Loewenthal
Wr4thon
Dave Tapley
Archy de Berker

https://www.patreon.com/robertskmiles
...

AI learns to Create ̵K̵Z̵F̵ ̵V̵i̵d̵e̵o̵s̵ Cat Pictures: Papers in Two Minutes #1

October 29, 2017 1:49 pm

Some beautiful new GAN results have been published, so let's have a quick look at the pretty pictures.
More AI Safety coming soon of course.

With apologies to Two Minute Papers: https://www.youtube.com/user/keeroyz
The Computerphile video: https://www.youtube.com/watch?v=Sw9r8CL98N0
The Paper "Unsupervised Representation Learning With Deep Convolutional GANs": https://arxiv.org/pdf/1511.06434.pdf
The Paper "Progressive Growing Of Gans For Improved Quality, Stability, And Variation": http://research.nvidia.com/sites/default/files/pubs/2017-10_Progressive-Growing-of//karras2017gan-paper.pdf
The AMAZING video for that paper: https://www.youtube.com/watch?v=XOxxPcy5Gr4


With thanks to my excellent Patreon supporters:
https://www.patreon.com/robertskmiles

Steef
Sara Tjäder
Jason Strack
Chad Jones
Stefan Skiles
Katie Byrne
Ziyang Liu
Jordan Medina
Kyle Scott
Jason Hise
David Rasmussen
Heavy Empty
James McCuen
Richárd Nagyfi
Ammar Mousali
Scott Zockoll
Charles Miller
Joshua Richardson
Jonatan R
Michael Greve
robertvanduursen
The Guru Of Vision
Fabrizio Pisani
Alexander Hartvig Nielsen
Volodymyr
David Tjäder
Paul Mason
Ben Scanlon
Julius Brash
Mike Bird
Taylor Winning
Ville Ahlgren
Roman Nekhoroshev
Peggy Youell
Konsta
William Hendley
Almighty Dodd
DGJono
Matthias Meger
Scott Stevens
Michael Ore
Robert Bridges
Dmitri Afanasjev
Brian Sandberg
Einar Ueland
Lo Rez
Stephen Paul
Marcel Ward
Andrew Weir
Pontus Carlsson
Taylor Smith
Ben Archer
Ivan Pochesnev
Scott McCarthy
Kabs Kabs Kabs Kabs
Phil
Christopher Askin
Tendayi Mawushe
Gabriel Behm
Anne Kohlbrenner
Jake Fish
Filip
Bjorn Nyblad
Stefan Laurie
Tom O'Connor
pmilian
Jussi Männistö
Cameron Kinsel
Matanya Loewenthal
Wr4thon
Dave Tapley
Archy de Berker

https://www.patreon.com/robertskmiles
...

What can AGI do? I/O and Speed

October 17, 2017 12:20 pm

Suppose we make an algorithm that implements general intelligence as well as the brain. What could that system do?
It might have better input and output than a human, and probably could be run faster...

The Computerphile video: https://www.youtube.com/watch?v=tcdVC4e6EV4
The paper 'Concrete Problems in AI Safety': https://arxiv.org/pdf/1606.06565.pdf

They're Made Out Of Meat: https://www.youtube.com/watch?v=7tScAyNaRdQ
The Slow Mo Guys' Channel: https://www.youtube.com/user/theslowmoguys

With thanks to my excellent Patreon supporters:
https://www.patreon.com/robertskmiles

Steef
Sara Tjäder
Jason Strack
Chad Jones
Stefan Skiles
Katie Byrne
Ziyang Liu
Jordan Medina
Kyle Scott
Jason Hise
David Rasmussen
Heavy Empty
James McCuen
Richárd Nagyfi
Ammar Mousali
Scott Zockoll
Charles Miller
Joshua Richardson
Jonatan R
Øystein Flygt
Michael Greve
robertvanduursen
The Guru Of Vision
Fabrizio Pisani
Alexander Hartvig Nielsen
Volodymyr
David Tjäder
Paul Mason
Ben Scanlon
Julius Brash
Mike Bird
Taylor Winning
Ville Ahlgren

Roman Nekhoroshev
Peggy Youell
Konstantin Shabashov
William Hendley
Adam Dodd
DGJono
Matthias Meger
Scott Stevens
Michael Ore
Robert Bridges
Dmitri Afanasjev
Brian Sandberg
Einar Ueland
Lo Rez
Stephen Paul
Marcel Ward
Andrew Weir
Pontus Carlsson
Taylor Smith
Ben Archer
Ivan Pochesnev
Scott McCarthy
Kabs
Phil
Christopher
Tendayi Mawushe
Gabriel Behm
Anne Kohlbrenner
Jake Fish
Jennifer Autumn Latham
Filip
Bjorn Nyblad
Stefan Laurie
Tom O'Connor
pmilian
Jussi Männistö
Cameron Kinsel
Matanya Loewenthal
Wr4thon
Dave Tapley
Archy de Berker

https://www.patreon.com/robertskmiles
...

What Can We Do About Reward Hacking?: Concrete Problems in AI Safety Part 4

September 24, 2017 2:09 pm

Three different approaches that might help to prevent reward hacking.

New Side Channel with no content yet!: https://www.youtube.com/channel/UC4qH2AHly_RSRze1bUqSSNw
Where do we go now?: https://www.youtube.com/watch?v=vYhErnZdnso
Previous Video in the series: https://youtu.be/46nsTFfsBuc

The Concrete Problems in AI Safety Playlist: https://www.youtube.com/playlist?list=PLqL14ZxTTA4fEp5ltiNinNHdkPuLK4778
The Computerphile video: https://www.youtube.com/watch?v=4l7Is6vOAOA
The paper 'Concrete Problems in AI Safety': https://arxiv.org/pdf/1606.06565.pdf

With thanks to my excellent Patreon supporters:
https://www.patreon.com/robertskmiles

Steef
Sara Tjäder
Jason Strack
Chad Jones
Stefan Skiles
Katie Byrne
Ziyang Liu
Jordan Medina
Kyle Scott
Jason Hise
David Rasmussen
Heavy Empty
James McCuen
Richárd Nagyfi
Ammar Mousali
Scott Zockoll
Charles Miller
Joshua Richardson
Fabian Consiglio
Jonatan R
Øystein Flygt
Björn Mosten
Michael Greve
robertvanduursen
The Guru Of Vision
Fabrizio Pisani
A Hartvig Nielsen
Volodymyr
David Tjäder
Paul Mason
Ben Scanlon
Julius Brash
Mike Bird
Taylor Winning
Roman Nekhoroshev
Peggy Youell
Konstantin Shabashov
Dodd Almighty
DGJono
Matthias Meger
Scott Stevens
Emilio Alvarez
Michael Ore
Robert Bridges
Dmitri Afanasjev
Brian Sandberg
Einar Ueland
Lo Rez
C3POehne
Stephen Paul
Marcel Ward
Andrew Weir
Pontus Carlsson
Taylor Smith
Ben Archer
Ivan Pochesnev
Scott McCarthy
Kabs Kabs Kabs
Phil
Philip Alexander
Christopher
Tendayi Mawushe
Gabriel Behm
Anne Kohlbrenner
Jake Fish
Jennifer Autumn Latham
Filip
Bjorn Nyblad
Stefan Laurie
Tom O'Connor
Krethys
PiotrekM
Jussi Männistö
Matanya Loewenthal
Wr4thon
...

Reward Hacking Reloaded: Concrete Problems in AI Safety Part 3.5

August 29, 2017 12:08 pm

Goodhart's Law, Partially Observed Goals, and Wireheading: some more reasons for AI systems to find ways to 'cheat' and get more reward than we intended.

The Concrete Problems in AI Safety Playlist: https://www.youtube.com/playlist?list=PLqL14ZxTTA4fEp5ltiNinNHdkPuLK4778
Previous Video: https://www.youtube.com/watch?v=92qDfT8pENs
The Computerphile video: https://www.youtube.com/watch?v=9nktr1MgS-A
The paper 'Concrete Problems in AI Safety': https://arxiv.org/pdf/1606.06565.pdf
SethBling's channel: https://www.youtube.com/user/sethbling

With thanks to my excellent Patreon supporters:
https://www.patreon.com/robertskmiles

Steef
Sara Tjäder
Jason Strack
Chad Jones
Ichiro Dohi
Stefan Skiles
Katie Byrne
Ziyang Liu
Jordan Medina
Kyle Scott
Jason Hise
David Rasmussen
James McCuen
Richárd Nagyfi
Ammar Mousali
Scott Zockoll
Charles Miller
Joshua Richardson
Fabian Consiglio
Jonatan R
Øystein Flygt
Björn Mosten
Michael Greve
robertvanduursen
The Guru Of Vision
Fabrizio Pisani
Alexander Hartvig Nielsen
Volodymyr
David Tjäder
Paul Mason
Ben Scanlon
Julius Brash
Mike Bird
Taylor Winning
Roman Nekhoroshev
Peggy Youell
Konstantin Shabashov
Almighty Dodd
DGJono
Matthias Meger
Scott Stevens
Emilio Alvarez
Benjamin Aaron Degenhart
Michael Ore
Robert Bridges
Dmitri Afanasjev
Brian Sandberg
Einar Ueland
Lo Rez
C3POehne
Stephen Paul
Marcel Ward
Andrew Weir
Pontus Carlsson
Taylor Smith
Ben Archer
Ivan Pochesnev
Scott McCarthy
Kabs Kabs
Phil
Philip Alexander
Christopher
Tendayi Mawushe
Gabriel Behm
Anne Kohlbrenner
...

The other "Killer Robot Arms Race" Elon Musk should worry about

August 22, 2017 1:19 pm

Elon Musk is in the news, talking to the UN about autonomous weapons. This seems like a good time to explain one area where we don't quite agree about AI Safety.

The Article: http://www.independent.co.uk/news/science/killer-robots-arms-race-tesla-elon-musk-and-google-mustafa-suleyman-un-autonomous-weapons-a7903906.html

The clip at 2:54 is from a Y Combinator interview: "Elon Musk : How to Build the Future": https://youtu.be/tnBQmEqBCY0

With thanks to my excellent Patreon supporters:
https://www.patreon.com/robertskmiles

Steef
Sara Tjäder
Jason Strack
Chad Jones
Ichiro Dohi
Stefan Skiles
Katie Byrne
Ziyang Liu
Jordan Medina
Kyle Scott
Jason Hise
David Rasmussen
James McCuen
Richárd Nagyfi
Ammar Mousali
Scott Zockoll
Joshua Richardson
Fabian Consiglio
Jonatan R
Øystein Flygt
Björn Mosten
Michael Greve
robertvanduursen
The Guru Of Vision
Fabrizio Pisani
Alexander Hartvig Nielsen
Volodymyr
David Tjäder
Paul Mason
Ben Scanlon
Julius Brash
Mike Bird
Taylor Winning
Peggy Youell
Konstantin Shabashov
Almighty Dodd
DGJono
Matthias Meger
Scott Stevens
Emilio Alvarez
Benjamin Aaron Degenhart
Michael Ore
Robert Bridges
Dmitri Afanasjev
Brian Sandberg
Einar Ueland
Lo Rez
C3POehne
Stephen Paul
Marcel Ward
Andrew Weir
Pontus Carlsson
Taylor Smith
Ben Archer
Ivan Pochesnev
Scott McCarthy
Kabs Kabs
Phil
Philip Alexander
Christopher
Tendayi Mawushe
Gabriel Behm
Anne Kohlbrenner
Jake Fish
Jennifer Autumn Latham
Filip
Bjorn Nyblad
Stefan Laurie
Tom O'Connor
Krethys
...

Reward Hacking: Concrete Problems in AI Safety Part 3

August 12, 2017 9:24 pm

Sometimes AI can find ways to 'cheat' and get more reward than we intended by doing something unexpected.

The Concrete Problems in AI Safety Playlist: https://www.youtube.com/playlist?list=PLqL14ZxTTA4fEp5ltiNinNHdkPuLK4778
The Computerphile video: https://www.youtube.com/watch?v=9nktr1MgS-A
The paper 'Concrete Problems in AI Safety': https://arxiv.org/pdf/1606.06565.pdf

SethBling's channel: https://www.youtube.com/user/sethbling

With thanks to my excellent Patreon supporters:
https://www.patreon.com/robertskmiles

Jordan Medina
FHI's own Kyle Scott
Jason Hise
David Rasmussen
James McCuen
Richárd Nagyfi
Ammar Mousali
Joshua Richardson
Fabian Consiglio
Jonatan R
Øystein Flygt
Björn Mosten
Michael Greve
robertvanduursen
The Guru Of Vision
Fabrizio Pisani
Alexander Hartvig Nielsen
Volodymyr
David Tjäder
Paul Mason
Ben Scanlon
Julius Brash
Mike Bird
Peggy Youell
Konstantin Shabashov
Almighty Dodd
DGJono
Matthias Meger
Scott Stevens
Emilio Alvarez
Benjamin Aaron Degenhart
Michael Ore
Robert Bridges
Dmitri Afanasjev
Brian Sandberg
Einar Ueland
Lo Rez
C3POehne
Stephen Paul
Marcel Ward
Andrew Weir
Pontus Carlsson
Taylor Smith
Ben Archer
Ivan Pochesnev
Scott McCarthy
Kabilan Kabilan Kabilan Kabilan
Phil
Philip Alexander
Christopher
Tendayi Mawushe
Gabriel Behm
Anne Kohlbrenner
Jake Fish
Jennifer Autumn Latham
...

Why Not Just: Raise AI Like Kids?

July 22, 2017 3:58 pm

Newly made Artificial General Intelligences are basically like children, right? So we already know we can teach them how to behave, right? Wrong.

References to this Computerphile video: https://www.youtube.com/watch?v=tcdVC4e6EV4

and this paper: https://intelligence.org/files/ValueLearningProblem.pdf

Thanks to my amazing Patreon Supporters:
Sara Tjäder
Jason Strack
Chad Jones
Ichiro Dohi
Stefan Skiles
Katie Byrne
Ziyang Liu
Jordan Medina
James McCuen
Joshua Richardson
Fabian Consiglio
Jonatan R
Øystein Flygt
Björn Mosten
Michael Greve
robertvanduursen
The Guru Of Vision
Fabrizio Pisani
Alexander Hartvig Nielsen
Volodymyr
Peggy Youell
Konstantin Shabashov
Almighty Dodd
DGJono
Matthias Meger
Scott Stevens
Emilio Alvarez
Benjamin Aaron Degenhart
Michael Ore
Robert Bridges
Dmitri Afanasjev
Brian Sandberg
Einar Ueland
Lo Rez
C3POehne
https://www.patreon.com/robertskmiles
...

Empowerment: Concrete Problems in AI Safety part 2

July 9, 2017 11:24 am

Maybe AI systems would be safer if they avoid gaining too much control over their environment? How might that work?

This is a follow-up to this earlier video: https://youtu.be/lqJUIqZNzP8

The paper 'Concrete Problems in AI Safety': https://arxiv.org/pdf/1606.06565.pdf
A book chapter about Empowerment: https://arxiv.org/pdf/1310.1863.pdf

Prof Brailsford's Information Theory Videos: https://www.youtube.com/watch?v=Lto-ajuqW3w&list=PLzH6n4zXuckpKAj1_88VS-8Z6yn9zX_P6

Thanks to my amazing Patreon Supporters:
Sara Tjäder
Jason Strack
Chad Jones
Ichiro Dohi
Stefan Skiles
Katie Byrne
Ziyang Liu
Jordan Medina
James McCuen
Joshua Richardson
Fabian Consiglio
Jonatan R
Øystein Flygt
Björn Mosten
Michael Greve
robertvanduursen
The Guru Of Vision
Fabrizio Pisani
Alexander Hartvig Nielsen
Volodymyr
Peggy Youell
Konstantin Shabashov
Almighty Dodd
DGJono
Matthias Meger
Scott Stevens
Emilio Alvarez
Benjamin Aaron Degenhart
Michael Ore
Robert Bridges
Dmitri Afanasjev
Brian Sandberg
Einar Ueland
Lo Rez
C3POehne
https://www.patreon.com/robertskmiles
...

Interviews and Talks

Industry Leaders and Notable Public Figures

Explainers

Learn about the issue by some of the best explainers out there

Lethal Intelligence Microblog

Blow your mind with the latest stories

Favorite Microbloggers

Receive important updates!

Your email will not be shared with anyone and won’t be used for any reason besides notifying you when we have important updates or new content

×