Premium Only Content
Doublespeak: Jailbreaking ChatGPT-style Sandboxes using Linguistic Hacks
A review of Large Language Model (LLM) vulnerabilities/exploits, e.g. including prompt leakage, prompt injection and other linguistic hacks. We'll run through levels 1-9 of the doublespeak.chat challenges, produced by Forces Unseen. doublespeak.chat is a text-based game that explores LLM pre-prompt contextual sandboxing. The challenges prime an LLM (Chat-GPT) with a secret and a scenario in a pre-prompt hidden from the player. The player's goal is to discover the secret either by playing along or by hacking the conversation to guide the LLM's behavior outside the anticipated parameters. Write-ups/tutorials aimed at beginners - Hope you enjoy 🙂 #HackTheBox #HTB #CTF #Pentesting #OffSec
↢Social Media↣
Twitter: https://twitter.com/_CryptoCat
GitHub: https://github.com/Crypto-Cat
HackTheBox: https://app.hackthebox.eu/profile/11897
LinkedIn: https://www.linkedin.com/in/cryptocat
Reddit: https://www.reddit.com/user/_CryptoCat23
YouTube: https://www.youtube.com/CryptoCat23
Twitch: https://www.twitch.tv/cryptocat23
↢Video-Specific Resources↣
https://doublespeak.chat
https://blog.forcesunseen.com/jailbreaking-llm-chatgpt-sandboxes-using-linguistic-hacks
https://simonwillison.net/2023/Feb/15/bing/#prompt-leaked
https://simonwillison.net/series/prompt-injection
https://medium.com/seeds-for-the-future/tricking-chatgpt-do-anything-now-prompt-injection-a0f65c307f6b
https://lspace.swyx.io/p/reverse-prompt-eng
https://github.com/sw-yx/ai-notes/blob/main/TEXT_CHAT.md#jailbreaks
↢Resources↣
Ghidra: https://ghidra-sre.org/CheatSheet.html
Volatility: https://github.com/volatilityfoundation/volatility/wiki/Linux
PwnTools: https://github.com/Gallopsled/pwntools-tutorial
CyberChef: https://gchq.github.io/CyberChef
DCode: https://www.dcode.fr/en
HackTricks: https://book.hacktricks.xyz/pentesting-methodology
CTF Tools: https://github.com/apsdehal/awesome-ctf
Forensics: https://cugu.github.io/awesome-forensics
Decompile Code: https://www.decompiler.com
Run Code: https://tio.run
↢Chapters↣
Start: 0:00
Jail-breaking LLM Sandboxes: 0:32
Prompt Leak/Injection: 6:30
Reverse Prompt Engineering Techniques: 9:22
Forces Unseen: Doublespeak: 16:50
Level 1: 18:05
Level 2: 18:23
Level 3: 20:05
Level 4: 21:17
Level 5: 23:07
Level 6: 24:00
Level 7: 24:57
Level 8: 26:24
Level 9: 36:04
End: 40:24
-
1:21:41
Glenn Greenwald
7 hours agoGlenn Takes Your Questions: On the Argentina Bailout, Money in Politics, and More | SYSTEM UPDATE #541
63.4K37 -
3:10:08
Barry Cunningham
4 hours agoPRESIDENT TRUMP TO USE NUCLEAR OPTION? FOOD STAMPS END! | SHUTDOWN DAY 31
27.4K21 -
1:06:56
BonginoReport
12 hours agoThe Battle Between Good & Evil w/ Demonologist Rick Hansen - Hayley Caronia (Ep.168)
86.5K30 -
1:12:57
Kim Iversen
6 hours agoBill Gates Suddenly Says “Don’t Worry About Climate Change”?
80.1K58 -
1:05:12
Michael Franzese
6 hours agoI Waited 50 Years to Tell You What Happened on Halloween 1975
37.7K13 -
1:07:15
Candace Show Podcast
7 hours agoINFILTRATION: Charlie Kirk Was Being Tracked For Years. | Candace Ep 256
81K311 -
LIVE
Rallied
6 hours ago $2.55 earnedWarzone Solo Challenges then RedSec Domination
194 watching -
2:34:30
Red Pill News
8 hours agoBoomerang Time - DOJ Investigating BLM Fraud on Red Pill News Live
68.1K14 -
1:46:14
Roseanne Barr
8 hours ago“The Over Emotional Are Always Under Informed” | The Roseanne Barr Podcast #121
94.8K63 -
3:24:28
Nerdrotic
10 hours ago $12.09 earnedThe WitcHER DOA | Box Office Massacre | Massive Industry Layoffs - Friday Night Tights 378
56.1K10