[HN Gopher] Replacing Judges with Juries: Evaluating LLM Generat...
___________________________________________________________________
Replacing Judges with Juries: Evaluating LLM Generations with a
Panel of Models
Author : Jimmc414
Score : 38 points
Date : 2024-04-30 19:20 UTC (3 hours ago)
(HTM) web link (arxiv.org)
(TXT) w3m dump (arxiv.org)
| xianshou wrote:
| I was going to knock this for incrementally improving performance
| while massively increasing costs, but it's actually 7x _less_
| expensive. Not bad.
| crooked-v wrote:
| I guess the whole "it's easier to be a critic than a writer"
| thing applies to LLMs too.
| cpeterso wrote:
| This reminds me of science fiction author Peter Watts' novella
| "The Freeze-Frame Revolution", where a space ship has two AIs:
| one that has been running for a million years and another that is
| reboot daily to start with a fresh state. The long-running AI
| confers with reboot AI for a second opinion on important issues.
| The second AI doesn't know it's "killed" daily, but eventually
| starts to suspect. And this is just a small subplot! If you like
| hard SF jam-packed with big ideas, I highly recommend "The
| Freeze-Frame Revolution" and Watts' other novels.
___________________________________________________________________
(page generated 2024-04-30 23:01 UTC)