Astro - Hacker News

kristjansson 42 minutes ago ago

> The phrase "frontier model" is starting to mean two things. One is a checkpoint. The other is a system boundary.

LLM-isms aside, I don't think we want this to be the case? An LLM, for all its complexity, is something that can be reasoned about. It's picking the next token, until it hits an EOS. The semantics imposed on those tokens (reasoning ,tool call, etc.) are up to the user('s harness) to decide and act on. The more that's pushed behind the facade, the harder it is achieve sufficient understanding of the model's behavior s.t. one can compose it into larger abstractions. Perhaps the performance (and the adherence to an interface/contract) compensate? But swapping from Opus or 5.5 to this or Fugu seems like a much bigger change than swapping between different 'base' models.

[-]

Xx_crazy420_xX 34 minutes ago ago

I might be wrong, but strongly suspect that Fable 5 is already something in this shape, considering long time to first token while having normal troughput.

getcrunk 15 minutes ago ago

Every one has been saying it’s all about the harness. This is an obvious result of that.

I think an optimal solution would be to have more seamless integration between harness and router roles. As each are only half the picture

jerpint 27 minutes ago ago

Solutions like these are really cementing the view that LLMs are becoming a commodity

droidjj an hour ago ago

Can we please stop submitting fully AI-generated text to HN?

[-]

tensegrist an hour ago ago

at least 50% of the front page would disappear if this were enforced
[-]
- jghn 22 minutes ago ago
  
  Don’t threaten me with a good time
- Escapade5160 3 minutes ago ago
  
  So be it.
- folkrav 29 minutes ago ago
  
  I'd be perfectly okay with that.

alchemist1e9 an hour ago ago

This should help with better utilizing a heterogenous collection of inference hardware.