Transcript
Title text: This is how you all fucking sound
[A smug tech bro wearing a sideways cap, watch, chain around his neck stands in front of a data center by a lake with dead fish. A smoke stack blows pollution into the air]
Tech bro: AI is already here, there’s no going back.
[A smug man in a suit with cigarette in hand stands in a restaurant while two disgruntled diners cough from the smoke]
Suit: Smoking indoors is already here, there’s no going back.
[A smug man in a top hat and suit stands in a factory with two sad and dirty children]
Hat: Child labor is already here, there’s no going back.
[A smug plantation owner stands in front of a field with with two angry slaves]
Plantation owner: The Atlantic Slave trade is already here, there’s no going back.


As much as I’d like this to be true (don’t believe all the benchmarks), in reality, using e.g. gpt 5.5 is still a lot less pain in the ass, mostly has to do with more reprompting (gpt is just smarter, oneshots stuff more often) + a lot slower (on an RTX 3090 for reference).
I’ve tried using it for some time, but I think I’m faster writing (better, although that’s also true for gpt-5.5) code by hand, than using this (+ I need the valuable VRAM for other stuff, as I’m a graphics/shader programmer most of the time).
That said, it’s already fairly impressive how much progress these smaller models have made the last year, it’s usable, you can “vibe-code” at least simple stuff.
You’re not wrong, but perhaps you are not giving them adequete time to be wrong. Oneshotting is not the be-all, right is, especially maintainably correct. I’ve found letting them fight over it useful.
Eh, valid I do in fact agree and perhaps I exaggerated with 16GB, I do in fact have two,