model roundup
Haiku 4.5
-
Weekend build, ~10 hours. Demo: https://trurent-five.vercel.app/ Problem I was poking at: every major Indian rental site (NoBroker, MagicBricks, 99acres) is infested with brokers even when you filter "direct owner." Reddit actually has hon…
-
Made LLMs play Texas Hold’em against each other. 6 models at the table: a tiny 1.2B running locally on my 16GB MacBook, a couple mid-size ones, and cloud models going up to about 1 trillion parameters.
-
I made 6 LLMs play Texas Hold’em against each other. Ran 5 tournaments on my 16GB MacBook.
-
I just launched Socratize (socratize.io) - a rebranded and rebuilt version of FixAI, our original B2C experiment. This time it's B2B-only: teams use it to practice uncomfortable workplace conversations - difficult feedback, client escalati…
-
I build context/harness optimization tooling, so provider-side serialization quirks actually matter to me. If you're optimizing over prompts, you need to know exactly what hits the model.
-
I'm building an automated mail generation pipeline using Claude Haiku 4.5 OnPremise but the knowledge cutoff June 2025. This model needs to handle temporal expressions correctly like : next Monday end of the week this month 16 May 16 May 2…