Engineering Insights.

What we've learned shipping production systems. AI, architecture, and 25 years of pattern recognition.

A $2 Probe: Does a Short AI Constitution Behave Like a Long One?

Jul 18, 2026

Before spending on fine-tuning, we measured whether a 1.7k-word AI constitution actually produces different behavior than a 29k-word one. The cheap probe answered it, and located the bottleneck.

Augur is live: our side project now forecasts on Europe's grid

Jul 16, 2026

A wind and solar forecaster we built on the side, mostly by pointing AI agents at the problem, is now live in production on Predico, the platform used by Belgium's grid operator, and gets scored against professional forecasters. Here is the honest scorecard, losses included.

Does Your AI Constitution Actually Change Anything? A Cross-Judge Test

Jul 11, 2026

We stress-tested whether a compact 6-principle AI constitution (Kardia) and Anthropic's ~29,000-word hierarchy actually pick different answers — using two independent strong judges. The divergence is real; the story about where it lives is judge-dependent. An honest interim report.

Kardia Phase 0: Do Two AI Constitutions Actually Prefer Different Answers?

Jul 11, 2026

Before fine-tuning anything, we measured whether Kardia's short Citadel constitution and Anthropic's long hierarchy constitution encode different preferences on the same response pairs. Weak judges hid the signal; strong cross-family judges found a thin, real gap — about 17% opposite picks on a hard probe set. Method, confounds, and what we are not claiming.

One Prompt: A Mastermind Game, and a 1997 Thesis Brought Back From the Dead

Jul 10, 2026

We gave Claude Code a single /goal prompt. It recovered a 1997 undergraduate thesis from the Internet Archive, ported its solver, built a classic Mastermind board, and published it — with its thought process included.

An Energy Forecaster That Grades Itself Against the Grid Operator

Jul 7, 2026

Augur forecasts wind and solar for three European countries and scores every forecast against the grid operator's own — including where it loses. Why the baseline you beat is the whole story, and what honest energy forecasting actually looks like.

Testing Without Test Code: AI-Driven QA on Mobile Apps

Jun 3, 2026

What happens when an AI agent reads a scenario document and executes it on a real iOS simulator — adapting to layout shifts, verifying with screenshots, and writing the report itself.

Who's reviewing your AI-generated code?

Jun 2, 2026

Your team ships 5x the code. Who's checking it? AI-generated code has specific failure modes that junior engineers can't catch. Here's the honest picture.

Agentic AI in Production: What Actually Works

May 29, 2026

After building agentic AI systems in production environments, here's the honest accounting of what works, what fails, and what the hype gets wrong. From Eloquentix engineers.

MCP Security for Enterprise

May 29, 2026

Model Context Protocol connects your AI to live business data. Before you deploy: prompt injection, tool poisoning, over-permissioning, and a four-pillar security framework from Eloquentix engineers.

Real-Time at Scale: Architecture Lessons from E.ON's Virtual Power Plant

May 29, 2026

12 years building E.ON's Virtual Power Plant — dispatching 400 assets simultaneously across four countries. What the architecture looks like, what failed, and what it teaches about real-time systems at TSO grade.

Google Apps Script Is a Hidden AI Gem

May 25, 2026

While talking to someone who is among the most forward implementors of AI solutions in the enterprise, we both agreed that where agents are deployed is...

Introducing Kardia: What an AI Constitution Should Be

May 20, 2026

We just published Kardia, our take on what an AI constitution should be. kardia.eloquentix.com

AI Burnout, Right on Cue

May 5, 2026

In mid-February, after coming back from China and catching up with AI, a partner told me about all the agents he had set up, the scaffolding, the...

Agents Require a Text Abstraction Layer of Your Thinking

Apr 10, 2026

The divergence between those who are still on GPT's creepy voice and those running agents of different sorts is widening, and it may be that it is not...

RandomMaxxing: Staying Human in a Computer-Driven World

Apr 9, 2026

May I kindly ask to share a new term: RandomMaxxing.

Claude Mythos: Six Ways an Aligned Model Quietly Misbehaves

Apr 8, 2026

Some of Claude Mythos' adventures (credit @hesamation). This is exactly why a constitution built on formation, not just constraint, matters:

What's Resilient in an AI World? Maybe Character

Apr 6, 2026

Many who are aware of AI capabilities today are scared of their future. What is resilient in an AI world? Skills? Judgement? Rolodex? Knowledge? All...

Software Abundance and the Age of Abundance

Mar 31, 2026

You may have heard various tech voices say that we are headed to an age of abundance. This idea may seem cuckoo, but it is worth understanding where it...

Jony Ive, Steve Jobs, and the Psychopathy of Corporate Mediocrity

Mar 25, 2026

I saw a post on X about an interaction between Jony Ive and Steve Jobs that stuck with me. Asking the models what they think landed on remarkable...

Apple's Real Strategy: Devices as Data Aggregators for an Experience

Mar 23, 2026

I was listening to the original iPod presentation from 2001, the '1000 songs in your pocket' one. Actually the most important part was at the beginning.

Our Brains Are Learning to Forget Like Feeds

Mar 22, 2026

Our brain's capacity to forget is adapting to feeds. Just like you don't remember the details of the asphalt and trees that you saw in the last 24...

A Moral Question About AI: Should It Be Any Different?

Mar 19, 2026

Most people have never heard of ZF. This German company quietly builds the world's best automatic transmissions. Their 8-speed ZF 8HP powers BMWs,...

The Paradigm Shift: From ML Observation to GenAI Manipulation

Mar 14, 2026

After a few days taking to a young researcher at Wisconsin, he said that our discussions highlighted to him the difference between ML and what we now...

Karpathy's Autoresearch

Mar 10, 2026

Singularity stuff. Karpathy just addressed the priciest bottleneck in ML research for free. Researcher iteration speed. A senior ML engineer bleeds...

Metals LSP Plugin for Claude Code

Mar 10, 2026

Our latest open-source contribution at Eloquentix: the Metals LSP plugin for Claude Code. This integration brings advanced Scala language server...

AI Workload Creep

Mar 8, 2026

If it was not obvious.

Coding Agents Changed in December

Feb 26, 2026

Here is what Karpathy is saying and it hard to disagree after using tools in February 26. And these again are the worse tools we will ever have.

Google's Universal Commerce Protocol

Jan 11, 2026

Google just released the Universal Commerce Protocol -- an open-source standard for AI-powered shopping.

How Claude Would Code Without Humans

Dec 2, 2025

We asked Claude what it thought about human code and how it would approach coding if humans were not in the loop. Here is a part of its answer.

With AI, All Codebases Are Legacy

Nov 1, 2025

From an Eloquentix developer:

Co-workers Are Non-Deterministic

Sep 17, 2025

Co-workers are non-deterministic. That is why we have audits.

Innovation Gets the Glory

Sep 12, 2025

There's a pattern in how nations lose economic dominance that we keep missing. Europe gave us quantum mechanics, radioactivity, and modern chemistry....

Claude Code: Why It Feels Like Playing a Video Game

Aug 30, 2025

About Claude Code: The best compliment one can give to it is that it feels like playing a video game.

AI Got Developers to Pay for Software

Aug 29, 2025

AI has done something no human ever could.

Every AI Model Has a Cultural Personality

Aug 29, 2025

All the big labs pretrain on the ~same internet, but each has a secret sauce:

Formula SAE: Where Engineering Meets Entrepreneurship

Jun 23, 2025

Congrats PER.

The UX for Long-Running AI Agents

May 25, 2025

"The UX for long running AI Agents is going to be one of the most interesting design questions in the coming years. The more the agent is doing complex...

Virtual Power Plants and Grid Stability

Apr 29, 2025

Having worked on the best VPP in Europe for the last 10 years, what Spain's massive blackout, exposed the risks of relying heavily on renewables...

GPT-4.1 Agentic Workflows: Practical Tips

Apr 15, 2025

GPT-4.1 is out and here are some tips from inside OpenAI.

UX Principles for Agentic AI Systems

Apr 14, 2025

Emerging UX principles for agentic AI -- systems that act autonomously to achieve goals -- focus on balancing user control, trust, and seamless...

Shopify's AI-First Memo

Apr 7, 2025

With full appreciation of the clarity of his prose, here is Tobi Lutke's message to Shopify.

Prompt Engineering Is Micromanagement

Mar 5, 2025

"radu, nobody understands your posts, so say whatever" here you go:

The Age of Vibe Coding

Mar 5, 2025

Garry Tan - yCombinator:

Testing AI Models: Roman Game Benchmark

Feb 19, 2025

Grok3 Think mode is impressive. Since ChatGPT came out 27 months ago, at every model release I have tested it with my own test, a roman game that 1) Is...

Cursor's $700M Valuation

Feb 7, 2025

These Cursor mfs forked VSCode, added anthropic keys & a chat window, and now are worth ~$700mm EACH at 25 years old.

Apple iCloud at Scale

Jan 30, 2025

Listening to AAPL earning, I heard that their services revenue is running at 105b/year with a 14% growth rate. Double clicking on that, they have over...

AI Agents Are the New Microservices

Jan 28, 2025

AI Agents & Microservices: A New Paradigm for Dev Teams

Transformers and DNA

Jan 18, 2025

With GPT-4b micro, OpenAI shows that transformers are perfectly suited for protein engineering. Think about it - transformers process sequences of...

DeepSeek V3: How They Trained a Frontier Model Efficiently

Dec 27, 2024

Summary of how DeepSeek V3 was so efficient at training a frontier-level model according to Perplexity:

Jensen Huang's 60 Direct Reports

May 30, 2024

Minimizing layers in an organization is empowering. It treats others like adults. This is rare especially outside the United States.