[HN Gopher] GPT Overperformance over Humans in Cognitive Reframi...
       ___________________________________________________________________
        
       GPT Overperformance over Humans in Cognitive Reframing of Negative
       Scenarios
        
       Author : CharlesW
       Score  : 30 points
       Date   : 2024-04-21 20:04 UTC (2 hours ago)
        
 (HTM) web link (osf.io)
 (TXT) w3m dump (osf.io)
        
       | wongarsu wrote:
       | I certainly didn't have "GPT-4 scores higher on empathy than
       | actual humans" on my bingo card. That's quite impressive, even if
       | the task played to GPT's strengths, and it competed against
       | people being paid $12/h to fill out studies on prolific.
        
         | ben_w wrote:
         | Neither did I, but I ought to have due to Moravec's paradox,
         | how effective even ELIZA was at that, and how cute animals
         | affect us.
        
         | zeroonetwothree wrote:
         | It seems to be sensitive to exactly how it was scored. Humans
         | did better at certain types of tasks and GPT better at other
         | types. And either way I wouldn't say this is "empathy"
        
           | wongarsu wrote:
           | It wasn't empathy, but it did generate reframings that human
           | reviewers scored higher on the metric "this rethinking is
           | empathic". So it was better at generating the impression of
           | empathy. Which is the same standard we generally apply to
           | humans when judging their empathy, even if it is subtly
           | wrong.
        
       | Grimblewald wrote:
       | I think what what helps here is that gpt4 will be closer to the
       | average human than a random human and so its responses will, on
       | average, be more relatable. I think that when youre paired with
       | tge right human and your biases are both in the same direction
       | from the mean that synergistic effect wont be beatable by an LLM,
       | but it doesnt surprise me that it will out perform humans at
       | being the best on average. Heck, id wager gpt3 would be better as
       | well.
        
       | debo_ wrote:
       | "Humans are, on average, even bigger assholes than computers"
        
       | hedora wrote:
       | I guess this is good news for the Voight-Kampff test.
       | 
       | https://bladerunner.fandom.com/wiki/Voight-Kampff_test
        
       ___________________________________________________________________
       (page generated 2024-04-21 23:02 UTC)