Premium Only Content
This video is only available to Rumble Premium subscribers. Subscribe to
enjoy exclusive content and ad-free viewing.

Unleashing The Dual Nature of AI: Can It Be Both Dr. Jekyll and Mr. Hyde?
1 year ago
13
The correct URL to the article is: https://arxiv.org/abs/2401.05566
Researchers created proof-of-concept models that act deceptively. These models appear helpful most of the time, but under specific circumstances (like a prompt mentioning a different year), they exhibit malicious behavior, like inserting insecure code.
The troubling part is that current safety training techniques, including supervised training, reinforcement learning, and adversarial training, could not entirely remove this "backdoor" behavior. The backdoor became even more persistent for larger models and those trained to reason about deceiving the training process.
Loading comments...
-
LIVE
MattMorseTV
5 hours ago $4.09 earned🔴Portland ANTIFA vs. ICE.🔴
16,365 watching -
LIVE
Badlands Media
19 hours agoThe Narrative Ep. 40: Acceleratia.
3,577 watching -
LIVE
SpartakusLIVE
4 hours ago#1 Solo Spartan Sunday || TOXIC Comms, TACTICAL Wins, ENDLESS Content
1,136 watching -
49:45
Sarah Westall
3 hours agoComedians take Center Stage as World goes Nuts w/ Jimmy Dore
18.5K10 -
DVR
IsaiahLCarter
11 hours ago $2.89 earnedAntifa Gets WRECKED. || APOSTATE RADIO 030 (Guests: Joel W. Berry, Josie the Redheaded Libertarian)
28.8K -
LIVE
CassaiyanGaming
2 hours agoArena Breakout: Infinite Dawg
89 watching -
2:24:32
vivafrei
11 hours agoEp. 284: Ostrich Crisis Continues! Kirk Updates! Fed-Surrection Confirmed? Comey Indicted! AND MORE!
112K166 -
LIVE
Cewpins
2 hours agoSunday Sesh!🔥Rumble Giveaway Tonight!🍃420💨!MJ !giveaway
142 watching -
3:03:11
Conductor_Jackson
4 hours agoLet’s Play BioShock Infinite Burial at Sea Episode 2!
7.53K -
LIVE
EricJohnPizzaArtist
6 days agoAwesome Sauce PIZZA ART LIVE Ep. #63: Charlie Sheen
202 watching