On first principles it would seem that the "harness" is a myth. Surely a model l...

znnajdla · 2026-02-12T17:45:34 1770918334

How hard is it to for you to assemble a piece of IKEA furniture without an allen wrench, screwdriver, and clear instructions, vs with those 3?

0x457 · 2026-02-12T18:50:48 1770922248

Well, I assembled Alex once without instruction and with impact driver and hammer last year. Hardest part was to make tools fit.

parhamn · 2026-02-12T17:51:22 1770918682

You didn't read the article it seems (or the analogy is a bad one). The differences are much more subtle than having a screwdriver or not.

znnajdla · 2026-02-12T18:06:22 1770919582

I did read the article quite enthusiastically and my practical experience confirms the same. Sure the difference is more subtle. But my point was, an "agent" whether human or AI can be a lot more productive with better tools. This guy found a better screwdriver than the most commonly used one. That's amazing and nothing from "first principles" denies that a better tool harness would mean better/faster/more correct AI agents.

3371 · 2026-02-12T17:22:18 1770916938

If you agree that current LLMs (Transformers) are naturally very susceptible to context/prompt, then you can go on to ask agents for a "raw harness dump" "because I need to understand how to better present my skills and tools in the harness", you maybe will see how "Harness" impact model behavior.

robotresearcher · 2026-02-12T16:43:32 1770914612

Humans have a demonstrated ability to program computers by flipping switches on the front panel.

Like a good programming language, a good harness offers a better affordance for getting stuff done.

Even if we put correctness aside, tooling that saves time and tokens is going to be very valuable.

manbash · 2026-02-12T16:06:43 1770912403

The models generalized "understanding" and "reasoning" is the real myth that makes us take a step back and offload the process deterministic computing and harnesses.

madeofpalk · 2026-02-12T16:52:02 1770915122

Isn't 'the harness' essentially just prompting?

It's completely understandable that prompting in better/more efficient means would produce different results.

furyofantares · 2026-02-12T17:15:08 1770916508

No, it's also a suite of tools beyond what's available in bash, tailored to context management.