David Hariri

March 24th 2026

MCP is not for you

The one in which I refute the recent arguments against MCP and where I think it's coming from.

March 11th 2025

Claude Plays Factorio

Jack Hopkins tests LLMs in his 'FLE' - Factorio Learning Environment.

January 16th 2025

How AI Took Over The World

Learn about artificial intelligence with Brit from 'Art of the Problem,' who excels in creating engaging and educational videos.

LLMs

December 17th 2024

Is Reasoning Language?

Exploring the nature of reasoning in AI models, questioning if making LLMs express their thoughts out loud limits their potential.

November 16th 2024

Only one LLM is good at chess

Exploring how different LLMs perform at chess, with most failing except turbo-instruct. Discusses tuning and training influences.

LLMs
Evals

November 10th 2024

Asking Chat to Draw My Life

A humorous take on asking ChatGPT to illustrate a depiction of one's life, featuring mysterious 'Horchar Blend'.

LLMs

November 2nd 2024

Just found an incredible guide on building LLMs as a judge by Hamel Husain! Super insightful, especially since we’re using a similar system at Ada to evaluate transcript resolutions. Excited about how smartly it's avoiding blind spots in test coverage!

LLMs
Evals

October 31st 2024

ChatGPT Search vs Perplexity Initial Thoughts

I just got access to the new ChatGPT search feature on macOS! Excited to compare how it stacks up against my go-to tool, Perplexity, for research. Gave it a spin with a few examples and shared my thoughts on the strengths and weaknesses of both. Check it out!

macOS
LLMs

October 30th 2024

Anthropic Computer Use

Just tried out Anthropic's Computer Use demo in a Docker setup! It can control a virtual machine and run tasks like adding a knowledge base for our bots. Super impressive, but it did trip up on some commands and interactions. Excited to see where this tech goes!

October 27th 2024

ChatGPT Easter Eggs

I stumbled upon a fun little experiment with ChatGPT tonight—someone mentioned it might have Easter eggs, so I asked for a random YouTube link. Turns out, it definitely trolled me! I love when tech has a sense of humor.

LLMs

October 25th 2024

Perplexity for Mac

Just checked out the new Perplexity MacOS app and it’s a game changer! It's already my go-to research tool, especially for those tricky service-desk KPIs. The inline citations and easy-to-copy tables make my life so much easier!

macOS
LLMs

October 22nd 2024

ChatGPT Growth

Reflecting on the mind-boggling growth of #ChatGPT and how it shattered expectations.

LLMs

October 22nd 2024

ChatGPT The Theological Scholar

Exploring my dad's deep conversations with ChatGPT about the Bahá'í faith and the rich religious texts that may influence its knowledge.

LLMs

October 17th 2024

Working Probabilistically

Exploring the importance of thinking probabilistically when working with LLMs, this post highlights insights on effective eval methodologies, the quirks of model behavior, and practical tips for building robust evaluation processes that go beyond traditional testing.

LLMs
Evals

October 16th 2024

LLM-Generated Descriptions

In this blog post, I share a recent enhancement to my website's intake endpoint that utilizes LLM technology to automatically generate short descriptions for my blog posts. By integrating OpenAI's API, I can now effortlessly create engaging summaries whenever I upload new content. I discuss the process behind this implementation, its effectiveness with past examples, and my plans to add features for generating tags based on existing ones. Dive in to learn how AI is transforming the way I present my ideas online!

April 9th 2024

Using LLMs in Production

A nod to Will Larson's post on using LLMs in production and some additional notes based on my own experience.

LLMs
Code

September 11th 2023

Did Code Win?

Some brief thoughts on the future of no-code in a world where code generation is ubiquitous.

April 29th 2023

LLMs as a New Kind of Computer

Thoughts on Beren's post about LLMs as a new form of computer

LLMs