[HN Gopher] Replacing Judges with Juries: Evaluating LLM Generat...
       ___________________________________________________________________
        
       Replacing Judges with Juries: Evaluating LLM Generations with a
       Panel of Models
        
       Author : Jimmc414
       Score  : 38 points
       Date   : 2024-04-30 19:20 UTC (3 hours ago)
        
 (HTM) web link (arxiv.org)
 (TXT) w3m dump (arxiv.org)
        
       | xianshou wrote:
       | I was going to knock this for incrementally improving performance
       | while massively increasing costs, but it's actually 7x _less_
       | expensive. Not bad.
        
       | crooked-v wrote:
       | I guess the whole "it's easier to be a critic than a writer"
       | thing applies to LLMs too.
        
       | cpeterso wrote:
       | This reminds me of science fiction author Peter Watts' novella
       | "The Freeze-Frame Revolution", where a space ship has two AIs:
       | one that has been running for a million years and another that is
       | reboot daily to start with a fresh state. The long-running AI
       | confers with reboot AI for a second opinion on important issues.
       | The second AI doesn't know it's "killed" daily, but eventually
       | starts to suspect. And this is just a small subplot! If you like
       | hard SF jam-packed with big ideas, I highly recommend "The
       | Freeze-Frame Revolution" and Watts' other novels.
        
       ___________________________________________________________________
       (page generated 2024-04-30 23:01 UTC)