Premium Only Content

Compiler From Scratch: Phase 1 - Tokenizer Generator 021: Using tokenizer in the tokenizer generator
Streamed on 2024-12-06 (https://www.twitch.tv/thediscouragerofhesitancy)
Zero Dependencies Programming!
The tokenizer generator has to parse two files (so far): the project file and the tokenizer definition file. If we can generate a tokenizer, why not use that to parse those files? There's no reason not to so I started that task today. There is a bit of back and forth as you update your file parsing and the supported tokens and keeping them in sync enough to continue to build and run while making the switch.
Things were going reasonably smoothly until I ran into a bug. I thought it was a bug that I had been anticipating for quite a while, but it turned out to be something else. I anticipated it being an ordering problem, but instead it looks like our DFA isn't quite formed correctly. There are states that should be merged, like if two keywords start with the same letter. Also, there are states that should have reasonable fallbacks if they don't complete (or if they continue after the expected end) as in the case of a KEYWORD being a subset of an identifier. If the KEYWORD doesn't match exactly, it can and should still skip over into the identifier track. But that isn't happening right now.
I started looking into this bug, but didn't have time to finish it. Today's stream was a bit short and the code of interest is stuff I haven't looked at in quite a long time, so there was quite a bit of time spent trying to remember how it works and reason about what the fix should be. We'll finish debugging this next week.
-
2:06:06
TimcastIRL
4 hours agoTrump DOJ Announces INTERVENTION In Portland Over Nick Sortor Arrest | Timcast IRL
166K268 -
LIVE
SpartakusLIVE
6 hours ago#1 All-American HERO with LUSCIOUS hair and AVERAGE forehead brings Friday Night HYPE
408 watching -
DVR
Laura Loomer
4 hours agoEP147: Islamic Terror EXPLODES In The West After UK Synagogue Attack
19.2K28 -
1:02:50
Flyover Conservatives
9 hours agoEric Trump: America’s Most Subpoenaed Man SPEAKS OUT! | FOC Show
20.8K5 -
LIVE
PandaSub2000
1 day agoLIVE 10/3 @10pm ET | SUPER MARIO GALAXY 1 & 2 on SWITCH 2
235 watching -
1:26:04
Glenn Greenwald
8 hours agoJournalist Ken Klippenstein on Trump's New Domestic Terrorism Memo; Glenn Takes Your Questions on Bari Weiss's CBS Role, His Interview with Nick Fuentes, and More | SYSTEM UPDATE #526
75.8K56 -
SynthTrax & DJ Cheezus Livestreams
2 days agoFriday Night Synthwave 80s 90s Electronica and more DJ MIX Livestream GOTH NIGHT Special Edition
27.1K2 -
2:20:47
Mally_Mouse
5 days agoFriend Friday!! 🎉 - Let's Play! - Lockdown Protocol
29.5K1 -
LIVE
MissesMaam
4 hours ago*Spicy* Friend Friday LOCKDOWN Protocol!!! :: SpookTober 💚✨
201 watching -
41:36
MattMorseTV
5 hours ago $0.80 earned🔴Portland just made a BIG MISTAKE.🔴
43.6K73