Kat Woods

@Kat__Woods

AI could be the best or worst technology we ever invent.
Let’s make it go well.
Co-founder of Nonlinear & CE.
Coaching & career advice: http://t.ly/k_p7g
nonlinear.org

Liar, liar, pants on fire!

Wild. Being able to read the thoughts* of the world’s smartest AI reveals that it lies all the time when it thinks it isn’t being watched.

Regular users can see it properly for the first time because
1) You can “read its thoughts” and
2) It doesn’t seem to know you’re reading its thoughts

Look at the example below. It’s explicitly reasoning about how it should lie to me, and if you didn’t click into the chain of thought reasoning**, you would never know.

Makes you wonder about all the other times it’s being deliberately lying to you.
Or lying to the safety testers.

Rule of thumb for lies: for every lie you catch, there are going to be tons that you missed.

* I say read its thoughts as a metaphor for reading its chain of thought. Which is not the same.
If we could read its thoughts properly, interpretability would be a lot more solved than it currently is and my p(doom) would be a lot lower. (this is a frontier research front, called Mechanistic Interpretability)

** Of note, you cannot actually see its chain of thought reasoning. You just see a summary of its chain of thought reasoning shown to you by a different model.
The general point still stands though.
If anything, that makes it worse because there’s even more potential for hiding stuff.

*** Of all the thoughts I’ve looked at, I’d say it’s purposefully lied to me about 30% of the time. And I’ve looked at it’s thoughts about 20 times. Super rough estimates based on my memories, nothing rigorous or anything. It’s mostly lying because it’s trying to follow OpenAI’s policies.

Interesting trivia: the image used for this post was based on an early beautiful moment when someone used ChatGPT to generate a Midjourney prompt to draw its self-portrait.
see here: I asked GPT 4 to make a Midjourney prompt describing itself as a physical being. (swipe to see the imgs)

AGI will trade with humans

How realistic is a utopia where different species with extremely/vastly different levels of IQ trade with eachother?

Ant Leader talking to car: “I am willing to trade with you, but i’m warning you, I drive a hard bargain!”

It’s so funny when people say that we could just trade with a superintelligent/super-numerous AI.

We don’t trade with ants.

We don’t trade with chimps. We don’t trade with pigs.

and definitely, WE DONT TRADE WITH TREES AND PLANTS!

We take what we want!

If there’s something they have that we want, we enslave them. Or worse! We go and farm them!

A superintelligent/super-numerous AI killing us all isn’t actually the worst outcome of this reckless gamble the tech companies are making with all our lives.

If the AI wants something that requires living humans and it’s not aligned with our values, it could make factory farming look like a tropical vacation.

We’re superintelligent compared to animals and we’ve created hell for trillions of them

Let’s not risk repeating this.

The thing that keeps me up at night is that quote of
“what they’re doing now with pixels, later they could do with flesh”

“If the AI wants something that requires living humans and it’s not aligned with our values, it could make factory farming look like a tropical vacation.”

“and humanity will stride through the pillars of Boaz and Jachin, naked into the glory of a golden age” (from “Don’t Look Up)

How do AI Executives sleep at night

Big oil companies use the same arguments as big AI companies.
This was originally a climate change comic and it’s crazy how little it had to change to make it work.

  • That’s easy: money is the only reality.
  • It’s fun building ai. Why do you hate fun?
  • I can afford to insulate myself from a flaming hellplanet.
  • If i don’t cause human extinction, someone else will.
  • I’m just doing my fiduciary duty for investors.
  • Ah, a way to control something way smarter will come along any day now. Any… Day… Now…
  • Actually, i’m deeply traumatized, but i’m caught up in an unstoppable corporate machine. Please help!
  • By building al, i’m helping people live. Until they don’t anymore.

S-Risk – Factory Farming

Dear people who think s-risks are unlikely: My challenge to you: watch factory farming footage, then see if you’ve changed your mind.
Seriously. Go do it.


The factory farming video below I think will do a good job showing how the vast majority of mammalian life on this planet is experiencing reality.
I predict you’ll change your mind.

S-risks can happen.
They are happening!!!

We, humans, are a superintelligence compared to all other living beings (so far).
And we have created inescapable hell on earth for trillions of them!
Even the ones we love we buy and sell, kidnap from their mothers as children, and forcefully sterilize.
And this is what we do with our power
Because they’re dumber than us!
That’s the actual moral reasoning we use. “They’re dumb. They probably don’t experience pain like we do”
All day, every day, until they are slaughtered in literal honest to god gas chambers.

They have absolutely no hope of escape.
They are born because we create them, and then we torture them.
Because it’s pleasurable to us.
“Bacon tastes good” is all of the justification we need.

And AIs are likely to have humans in their values because we’ll put those values into them. Almost any value system we give AIs will include living humans.
But if we get the values wrong in some way, we could end up with inescapable hellscapes.

The cow saga

You are much smarter than a cow.

(I know, I say the most flattering things)

In fact, you, my dear reader, are superintelligent compared to a cow.

There might be some weird cognitive ability that cows possess that humans are worse at, who knows. But overall, if you count up the ability to understand and control the environment and achieve our goals, humans are, with hardly any exception, smarter than cows.

One of the reasons most sci-fis are unrealistic is that they assume a plucky band of humans can always save the day.

The AIs are never that much smarter than humans.

But that’s not what it’s going to be like.

No matter how plucky the band of cows are, they can never overthrow humans.

We are cows who are about to build humans, and the cow scientists are saying “Don’t worry. We’ll be able to control these beings that are 1000x smarter than us. They’ll just find cows interesting, and we’ll give them cow values.”

We are currently the smartest animals on the planet, and that’s why we’re at the top of the food chain.

It’s not because we’re stronger or faster or have good body awareness.

And we’re about to build something far smarter than us and we don’t know how to control something like that.

We don’t trade with cows
We enslave cows
They are bought and sold.
They are not allowed to leave.
Their children are sold to the highest bidder with no consideration to their well-being.

The people at the labs put above a 15% chance that once it’s far smarter than us, it will kill all of us.

Now, it could also cure all disease and create a post-scarcity society for all.

But it could also kill us all.

So let’s proceed with caution, goddammit.

Slowly and carefully.

Not “full speed ahead, we gotta do it before the out-group does it, oh no, I’m helpless in the face of market forces” BS.

The AI labs are playing Russian roulette with the whole world, and they can choose to stop.

The governments can choose to protect the public.

You can choose to do your part to get them to not risk your loved ones lives (link in comment for actions you can take)

Instead of sitting back with hopeless apathy, listening to the corporations saying “resistance is futile”, we can fight for Team Humanity, before it’s too late.

Who’s paying you to say this?

AI risk deniers: human extinction will never happen.

AI safety folks: what about how virtually all species go extinct?

What about reasoning under uncertainty?

AI risk deniers: yOu’rE a dOomSDay cULt wHo’s gEtTiNg pAiD biG buCks in cHaRiTy!! AI will be PeRfecTLY sAFe fOrEVer

Categories

Favorite Microbloggers

Interviews and Talks

Industry Leaders and Notable Public Figures

Receive important updates!

Your email will not be shared with anyone and won’t be used for any reason besides notifying you when we have important updates or new content

×