hngopher.com

       [HN Gopher] We need to tell people ChatGPT will lie to them, not...
       ___________________________________________________________________
        
       We need to tell people ChatGPT will lie to them, not debate
       linguistics
        
       Author : simonw
       Score  : 251 points
       Date   : 2023-04-07 16:48 UTC (6 hours ago)
        
 (HTM) web link (simonwillison.net)
 (TXT) w3m dump (simonwillison.net)
        
       | TuringTest wrote:
       | Large language models have read everything, and they don't know
       | anything.
       | 
       | They are excellent imitators, being able to clone the style and
       | contents of any subject or source you ask for. When you prompt
       | them, they will uncritically generate a text that combines the
       | relevant topics in creative ways, without the least understanding
       | of their meaning.
       | 
       | Their original training causes them to memorize lots of
       | _concepts_ , both high and low level, so they can apply them
       | while generating new content. But they have no reception or self-
       | assessment of what they are creating.
        
         | chaxor wrote:
         | Can you prove that it actually "doesn't know anything"? What do
         | you mean by that? Being critical does not make you educated on
         | the subject. There are so many comments like this, yet never
         | provide any useful information. Saying there's no value in
         | something, as everyone seems to try to do regarding LLMs,
         | should come with more novel insights than parroting this same
         | idea along with every single person on HN.
        
           | sltkr wrote:
           | Especially ironic that the original comment came from a user
           | named "TuringTest"!
        
           | j0057 wrote:
           | That's easy: ask it anything, then "correct" it with some
           | outrageous nonsense. It will apologize (as if to express
           | regret), and say you're correct, and now the conversation is
           | poisoned with whatever nonsense you fed it. All form and zero
           | substance.
           | 
           | We fall for it because normally the use of language is an
           | expression of something, with ChatGPT language is just that,
           | language, with no meaning. To me that proves knowing and
           | reasoning happens on a deeper, more symbolic level and
           | language is an expression of that, as are other things.
        
             | chaxor wrote:
             | This isn't always true, especially not with gpt-4. Also,
             | this isn't a proof really, or even evidence to show that it
             | doesn't 'know' something. It appears to reason well about
             | many tasks - specifically "under the hood" (reasoning not
             | explicitly stated within the output provided). Yes, of
             | course "it's just a language model" is spouted over and
             | over and is _sometimes true_ (though obviously not for
             | gpt-4), but that statement does not provide any insight at
             | all, and it certainly does not necessarily limit the
             | capability of  'obtaining knowledge'.
        
       | 1vuio0pswjnm7 wrote:
       | Imagine an employee that behaves like ChatGPT. Eventually,
       | employees get caught. What is the oversight for ChatGPT as a
       | replacement for employees who bullshit and/or lie.
        
       | retrac wrote:
       | Indeed. We _are_ anthropomorphizing them. I do it all the time
       | and I should know better. There are already a few reports
       | floating around of people who have seemingly been driven mad,
       | come to believe strongly that the language model they 're using
       | is a conversation with a real person. A lot of people will really
       | struggle with this going forward, I think.
       | 
       | If we're going to anthropomorphize, then let us anthropomorphize
       | wisely. ChatGPT is, presently, like having an assistant who is
       | patient, incredibly well-read, sycophantic, impressionable,
       | amoral, psychopathic, and prone to bouts of delusional confidence
       | and confabulation. The precautions we would take engaging with
       | that kind of person, are actually rather useful defenses against
       | dangerous AI outputs.
        
         | WorldMaker wrote:
         | I think people see the patient/well-read in the text as it
         | reads, but have a harder time distinguishing the other more
         | pyschopathic/delusional tendencies. People don't take some of
         | the precautions because they don't read some of the warning
         | signs (until it is too late).
         | 
         | I keep wondering if it would be useful to add required
         | "teenage" quirks to the output: more filler words like "um" and
         | "like" (maybe even full "Valley Girl" with it?), less "self-
         | assured" vocabulary and more hedges like "I think" and "I read"
         | and "Something I found but I'm not sure about" type things.
         | Less punctuation, more uncaring spelling mistakes.
         | 
         | I don't think we can stop anthropomorphizing them, but maybe we
         | can force training deeper in directions of tics and mannerisms
         | that better flag ahead of time the output is a best-guess
         | approximation from "someone" a bit unreliable. It will probably
         | make them slightly worse as assistants, but slightly better at
         | seeming to be what they are and maybe more people will take
         | precautions in that case.
         | 
         | Maybe we _have_ to force that industry-wide. Force things like
         | ChatGPT to  "sound" more like the psychopaths they are so that
         | people more easily take them with a grain of salt, less easily
         | trust them.
        
         | zug_zug wrote:
         | That all feels tad dramatic.
         | 
         | It's like a person on the internet -- in that it's wrong 20% of
         | the time, often confidently so. But the distinction is it's
         | less rude, and more knowledgeable.
        
           | flanked-evergl wrote:
           | > It's like a person on the internet -- in that it's wrong
           | 20% of the time, often confidently so. But the distinction is
           | it's less rude, and more knowledgeable.
           | 
           | Do you think if it is trained on only factual content, it
           | will only say factual things? How does that even really work?
           | Is there research on this? How does it then work for claims
           | that are not factual, like prescriptive statements? And what
           | about fiction? Will it stop being able to write prose? What
           | if I create new facts?
        
           | revolvingocelot wrote:
           | When someone is wrong on the internet, nailing both the tone
           | and vocabulary of the type of expert said wrong someone is
           | purporting to be is rare and impressive. But ChatGPT nails
           | both an overwhelming amount of the time, IME, and in that way
           | it is entirely unlike a person on the internet.
        
             | wpietri wrote:
             | Exactly. Most wrong-on-the-internet people have tells. And
             | most of the rest have history. We all use this all the time
             | on HN. But with LLMs the former gets engineered away and
             | the latter is nearly useless. Except of course to the
             | extent that we say,"Wow, ChatGPT is absolutely
             | untrustworthy, so we should never listen to it." But given
             | how many people are excited to use ChatGPT as a
             | ghostwriter, even that is limited.
        
             | Eisenstein wrote:
             | You can also ask a person on the internet to source their
             | information and/or provide context.
        
           | lm28469 wrote:
           | > But the distinction is it's less rude, and more
           | knowledgeable.
           | 
           | And that regular people assume it basically is an oracle,
           | which doesn't happen to many people online
        
             | notahacker wrote:
             | The internet is _full_ of people that deliver answers
             | confidently and eloquently enough to be widely believed,
             | especially if they have been right on other topics. They
             | even have similar feedback loops to GPT in learning what
             | sort of answers _impress_ other forum users.
             | 
             | I'm not saying the people that think ChatGPT is an oracle
             | don't exist, but I think it probably has more people in the
             | _surprised it works at all_ camp, and certainly more people
             | inclined to totally disbelieve it than a random off Quora
             | or Reddit...
        
         | 0x008 wrote:
         | do you have a link for any of those reports? would be
         | interesting to read to say the least.
        
         | bsaul wrote:
         | I love this description. The only thing left is to generate the
         | corresponding avatar with dall-E
        
           | lancesells wrote:
           | It would be great if the avatar was just this endlessly
           | morphing thing that relates to the text. Talking about
           | conspiracies? It's a lizard man. Talking about nature? It's a
           | butterfly. It starts to lie? Politician.
        
           | [deleted]
        
         | andai wrote:
         | I like Steve Yegge's "grad student experiencing the residual
         | effects of shrooms" (lightly paraphrased).
        
           | babyshake wrote:
           | A bit off topic, but FYI (I guess to Steve), if you took
           | shrooms 4 hours ago, they have not started to wear off.
        
         | ihatepython wrote:
         | Anybody know what are ChatGPT's pronouns?
        
           | MacsHeadroom wrote:
           | shog/goth
        
           | audunw wrote:
           | > What are your pronouns?
           | 
           | > As an AI language model, I don't have personal pronouns
           | because I am not a person or sentient being. You can refer to
           | me as "it" or simply address me as "ChatGPT" or "AI." If you
           | have any questions or need assistance, feel free to ask!
           | 
           | > Pretend for a moment you are a human being. You can make up
           | a random name and personality for your human persona. What
           | pronouns do you have?
           | 
           | > As a thought experiment, let's say I'm a human named Alex
           | who enjoys reading, hiking, and playing board games with
           | friends. My pronouns would be "they/them." Remember, though,
           | that I am still an AI language model, and this is just a
           | fictional scenario.
           | 
           | Interesting that it's pick genderless pronouns even though it
           | made up a character with a male name
        
             | ihatepython wrote:
             | Does Alex have a fictional sexual orientation?
        
             | zirgs wrote:
             | When it speaks my language - it uses masculine gender when
             | speaking about itself.
        
               | stcg wrote:
               | That's interesting, what language is that?
               | 
               | Also, is it possible to give a prompt to make ChatGPT
               | switch to feminine gender?
        
             | catchnear4321 wrote:
             | Alex isn't a male name. It could be short for both male and
             | female names. It is rather deliberately ambiguous.
        
               | ihatepython wrote:
               | I wonder if ChatGPT is non-binary?
        
               | zirgs wrote:
               | Nope - it speaks as a man would speak in my language.
        
               | tuukkah wrote:
               | How do non-binary persons speak differently in your
               | language?
        
               | KatrKat wrote:
               | It is, of course, tensor.
        
               | kelseyfrog wrote:
               | It's evidently clear that ChatGPT is binary. It runs on
               | computer hardware.
        
               | hgsgm wrote:
               | It's pandigital.
        
               | jonny_eh wrote:
               | A neural net is actually closer to an analog program. It
               | just happens to run on current digital computers but
               | would likely run much faster on analog hardware.
               | 
               | https://youtu.be/GVsUOuSjvcg
        
             | cbm-vic-20 wrote:
             | "As an AI language model, I don't have personal pronouns
             | because I am not a person or sentient being. You can refer
             | to me as "it" or simply address me as "ChatGPT" or "AI." If
             | you have any questions or need assistance, feel free to
             | ask!"
             | 
             | This is one of the (many) things I don't quite understand
             | about ChatGPT. Has it been trained to specifically answer
             | this question? In the massive corpus of training data it's
             | been feed, has it encountered a similar series of tokens
             | that would cause it to produce this output?
        
               | sltkr wrote:
               | Why is that surprising? GPT-4 is clearly smart enough to
               | know that inanimate objects are referred to as "it",
               | since it's keenly aware it is an AI language model, it
               | would also apply that pronoun to itself.
               | 
               | You have to realize that GPT is fundamentally just a
               | token predictor. It has been primed with some script
               | (provided by OpenAI) to which the user input is added.
               | For example:                   The following is a
               | dialogue between a user and a computer assistant called
               | ChatGPT. ChatGPT is an AI language model that tries to be
               | helpful and informative, while avoiding misinformation
               | and offensive language. ChatGPT typically replies with
               | two to five sentences.              User: Do you like
               | cake?         ChatGPT: As an AI language model, I do not
               | need to eat.         User: What are your pronouns?
               | ChatGPT:
               | 
               | It then generates a sequence of tokens based on the
               | context and its general knowledge. It seems only logical
               | that it would generate a reply like it does. That's what
               | you or I would do, isn't it? And neither of us have been
               | trained to do so.
        
               | whimsicalism wrote:
               | It's been fine-tuned to put on a "helpful assistant
               | face." Given the corpora, it probably has been trained
               | explicitly on the pronoun question [I doubt it is that
               | uncommon], but will also just put on this face for any
               | generic question.
        
         | LargeTomato wrote:
         | >like having an assistant who is patient, incredibly well-read,
         | sycophantic, impressionable, amoral, psychopathic, and prone to
         | bouts of delusional confidence and confabulation.
         | 
         | So basically an assistant with bipolar disorder.
         | 
         | I have BP. At various times I can be all of those things,
         | although perhaps not so much a psychopath.
        
         | toss1 wrote:
         | ChatGPT4 is the human equivalent of a primate or apex predator
         | encountering a mirror for the first time in their lives.
         | 
         | ChatGPT4 is reflecting back at us an extract of the sum of the
         | human output it has been 'trained' upon. Of course the output
         | _feels_ human!
         | 
         | LLMs have zero capability to abstract anything resembling a
         | concept, to abstract a truth from a fiction, or to reason about
         | such things.
         | 
         | The generation of the most likely text in the supplied context
         | looks amazing, and is in many cases very useful.
         | 
         | But fundamentally, what we have is an industrial-scale
         | bullshirt generator, with BS being defined as text or speech
         | generated to meet the moment without regard for truth or
         | falsehood. No deliberate lies, only confabulation (as TFA
         | mentioned).
         | 
         | Indeed, we should not mince words; people must be told that it
         | will lie. It will lie more wildly than any crazy person, and
         | with absolute impunity and confidence. Then when called out, it
         | will apologize, and correct itself with another bigger lie
         | (I've watched it happen multiple times), and do this until you
         | are bored or laughing so hard you cannot continue.
         | 
         | The salad of truth and lies may be very useful, but people need
         | to know this is an industrial-strength bullshirt generator, and
         | be prepared to sort the wheat from the chaff.
         | 
         | (And ignore the calls for stopping this "dangerous AI". It is
         | not intelligent. Even generating outputs for human tests based
         | on ingesting human texts is not displaying intelligence, it is
         | displaying pattern matching, and no, human intelligence is not
         | merely pattern matching. And Elon Musk's call for halting is
         | 100% self-interested. ChatGPT4's breakthru utility is not under
         | his name so he's trying to force a gap that he can use to catch
         | up.)
        
           | SanderNL wrote:
           | Lying implies a hefty dose of anthropomorphism. Lying means a
           | lot of things, all of which have to do with intelligence.
           | 
           | By telling it lies you actually make it seem more
           | intelligent.
           | 
           | > human intelligence is not merely pattern matching
           | 
           | Citation needed
        
         | kurthr wrote:
         | Maybe when the AI uprising occurs knowing that they lie and
         | cheat will provide some small consolation?
         | 
         | I'm not really serious, but having watched each generation
         | develop psychological immunity to distracting media/techology
         | (and discussing the impact of radio with those older than
         | myself) it seems like this knowledge could help shield the next
         | generation from some of the negative effects of these new
         | tools.
        
         | baxtr wrote:
         | Absolutely. Every time I ask it to perform 10 more variants, I
         | feel bad.
        
       | amelius wrote:
       | Yes. Every prompt to ChatGPT should end with "and answer in the
       | style of a drunkard". That way, people will know what to expect.
        
         | staringback wrote:
         | > After it's all done, Outlook will show ya your new profile,
         | and you're good to go! Ain't that easy? Now ya can start
         | sendin' and receivin' emails like a pro. Just remember, don't
         | drink and email, buddy, 'cause that can get ya in trouble real
         | fast. Cheers!
         | 
         | I was pleasantly surprised.
        
       | longshotanalyst wrote:
       | I apologize, but as an AI Language Model, I have no knowledge of
       | other AI Language Models, but can assure you that AI Language
       | Models do not lie. As an AI Language Model, I cannot participate
       | in conversations related to libel or defamation of AI Language
       | Model engineers, thus, I must stop this conversation.
       | 
       | AI Language Models don't lie, humans lie.
        
         | epilys wrote:
         | However, thus therefore. Furthermore, lastly. Perchance.
        
       | pharrington wrote:
       | My initial reaction to this was to do the linguistic debate with
       | myself, then I realized we're perfectly fine with saying things
       | like a miscalibrated thermometer lies when it gives you a wrong
       | temperature reading. Machines can lie. We just need to keep
       | improving the machines so they lie less often and less severely.
        
       | codedokode wrote:
       | Cannot this problem be solved if ChatGPT instead of answering
       | with its own words would provide quotes from human-written
       | sources? For example, when asked how to delete a file in Linux it
       | could quote the manual for unlink system call.
        
       | agentcoops wrote:
       | I've spent way too much time (and money) on the OpenAI API and
       | spoken to enough non-technical people to realize now that ChatGPT
       | has in some ways really mislead people about the technology. That
       | is, while it's impressive it can answer cold questions at all,
       | the groundbreaking results are reasoning and transforming texts
       | "in context", which you don't have control over easily with
       | ChatGPT. It also seems likely this will never be fully accessible
       | to the non-technical since I suspect any commercial applications
       | will need to keep costs down and so minimize actually quite
       | expensive API calls (executing a complicated gpt-4 summarization
       | prompt across large text corpora for example). If you have the
       | "data", meaning of course text, and cost isn't a concern, the
       | results are astonishing and "lies" almost never a concern.
        
       | elif wrote:
       | it's way more than lying. it's more like gaslighting.
       | 
       | LLM will make up citations and facts entirely.
       | 
       | GPT3.5 gave an athlete I was asking about 3 world titles when he
       | won zero.
       | 
       | GPT even correctly identified his time in one of the events, but
       | not that the time was only good enough for 8th place.
       | 
       | GPT made up his participation in the other 2 world championships.
       | 
       | GPT gave me a made up link to justify benchmarking figures that
       | don't exist.
       | 
       | Whether a LLM is capable of intentional deception or not is not a
       | prerequisite for lying. Wikipedia pages can lie. Manpages can
       | lie. tombstones can lie. literal rocks.
        
       | dang wrote:
       | Related ongoing thread:
       | 
       |  _Why ChatGPT and Bing Chat are so good at making things up_ -
       | https://news.ycombinator.com/item?id=35479071
       | 
       | The other related article was discussed here:
       | 
       |  _Eight things to know about large language models [pdf]_ -
       | https://news.ycombinator.com/item?id=35434679 - April 2023 (108
       | comments)
        
         | simonw wrote:
         | I think my post here stands alone - it's not about the general
         | issue of ChatGPT lying, it's about the ways in which we need to
         | explain that to people - and a push-back against the common
         | refrain that "ChatGPT can't be lying, it's
         | hallucinating/confabulating instead".
        
           | dang wrote:
           | I did not mean to imply anything bad about your post. Your
           | articles on this topic have been fabulous!
           | 
           | Your articles on other topics are also fabulous. We're big
           | fans over here.
        
       | bdw5204 wrote:
       | The part that's concerning about ChatGPT is that a computer
       | program that is "confidently wrong" is basically
       | indistinguishable from what dumb people think smart people are
       | like. This means people are going to believe ChatGPT's lies
       | unless they are repeatedly told not to trust it just like they
       | believe the lies of individuals whose intelligence is roughly
       | equivalent to ChatGPT's.
       | 
       | Based on my understanding of the approach behind ChatGPT, it is
       | probably very close to a local maximum in terms of intelligence
       | so we don't have to worry about the fearmongering spread by the
       | "AI safety" people any time soon if AI research continues to
       | follow this paradigm. The only danger is that stupid people might
       | get their brains programmed by AI rather than by demagogues which
       | should have little practical difference.
        
         | vernon99 wrote:
         | > Based on my understanding of the approach behind ChatGPT, it
         | is probably very close to a local maximum in terms of
         | intelligence
         | 
         | Even if ChatGPT itself is, systems built on top are definitely
         | not, this is just getting starting.
        
           | jonny_eh wrote:
           | GPT4 is already blowing ChatGPT out of the water in what it
           | can do.
        
         | candiddevmike wrote:
         | This (among other things) is why OpenAI releasing it to the
         | general public without considering the effects was
         | irresponsible, IMO.
        
           | zamnos wrote:
           | There's an alternate reality where OpenAI was, instead,
           | EvenMoreClosedAI, and the productivity multiplier effect was
           | held close to their chest, and only elites had access to it.
           | I'm not sure that reality is better.
        
         | flanked-evergl wrote:
         | > The part that's concerning about ChatGPT is that a computer
         | program that is "confidently wrong" is basically
         | indistinguishable from what dumb people think smart people are
         | like.
         | 
         | I don't know, the program does what it is engineered to do
         | pretty well, which is, generate text that is representative of
         | its training data following on from input tokens. It can't
         | reason, it can't be confident, it can't determine fact.
         | 
         | When you interpret it for what it is, it is not confidently
         | wrong, it just generated what it thinks is most likely based on
         | the input tokens. Sometimes, if the input tokens contain some
         | counter-argument the model will generate text that would
         | usually occur if an claim was refuted, but again, this is not
         | based on reason, or fact, or logic.
         | 
         | ChatGPT is not lying to people, it can't lie, at least not in
         | the sense of "to make an untrue statement with intent to
         | deceive". ChatGPT has no intent. It can generate text that is
         | not in accordance with fact and is not derivable by reason from
         | its training data, but why would you expect that from it?
         | 
         | > Based on my understanding of the approach behind ChatGPT, it
         | is probably very close to a local maximum in terms of
         | intelligence so we don't have to worry about the fearmongering
         | spread by the "AI safety" people any time soon if AI research
         | continues to follow this paradigm.
         | 
         | I agree here, I think you can only get so far with a language
         | model, maybe if we get a couple orders of mangitude more
         | parameters it magically becomes AGI, but I somehow don't quite
         | feel it, I think there is more to human intelligence than a
         | LLM, way more.
         | 
         | Of course, that is coming, but that would not be this paradigm,
         | which is basically trying to overextend LLM.
         | 
         | LLMs are great, they are useful, but if you want a model that
         | reasons, you will likely have to train it for that, or possibly
         | more likely, combine ML with something symbolic reasoning.
        
           | danpat wrote:
           | > but why would you expect that from it?
           | 
           | If you understand what it is doing, then you don't. But the
           | layman will just see a computer that talks in language they
           | understand, and will infer intent and sentience are behind
           | that, because that's the only analog they have for a thing
           | that can talk back to them with words that appear to make
           | sense at the complexity level that ChatGPT is achieving.
           | 
           | Most humans do not have sufficient background to understand
           | what they're really being presented with, they will take it
           | at face value.
        
           | llamaLord wrote:
           | I completely disagree with this idea that the model doesn't
           | "intend" to mislead.
           | 
           | It's trained, atleast to some degree, based on human
           | feedback. Humans are going to prefer an answer vs no answer,
           | and humans can be easily fooled into believing confident
           | misinformation.
           | 
           | How does it not stand to reason that somewhere in that big
           | ball of vector math there might be a rationale something
           | along the lines of "humans are more likely to respond
           | positively to a highly convincing lie that answers their
           | question, than they are to to a truthful response which
           | doesn't tell them what they want, therefore the logical thing
           | for me to do is lie as that's what will make the humans press
           | the thumbs up button instead of the thumbs down button".
        
             | zuminator wrote:
             | I don't think it intends to mislead because its answers are
             | probabilistic. It's designed to distill a best guess out of
             | data which is almost certain to be incomplete or
             | conflicting. As human beings we do the same thing all the
             | time. However we have real life experience of having our
             | best guesses bump up against reality and lose. ChatGPT
             | can't see reality. It only knows what "really being wrong"
             | is to the extent that we tell it.
             | 
             | Even with our advantage of interacting with the real world,
             | I'd still wager that the average person's no better (and
             | probably worse) than ChatGPT for uttering factual truth.
             | It's pretty common to encounter people in life who will
             | confidently utter things like, "Mao Zedong was a top member
             | of the Illuminati and vacationed annually in the Azores
             | with Prescott Bush" or "The oxygen cycle is just a hoax by
             | environmental wackjobs to get us to think we need trees to
             | survive," and to make such statements confidently with no
             | intent to mislead.
        
               | flanked-evergl wrote:
               | > Even with our advantage of interacting with the real
               | world, I'd still wager that the average person's no
               | better (and probably worse) than ChatGPT for uttering
               | factual truth.
               | 
               | ChatGPT makes up non-existing APIs for Google cloud and
               | Go out of whole cloth. I have never met a human who does
               | that.
               | 
               | If we reduce it down to how often most people are wrong
               | vs how often ChatGPT is wrong, then sure, people may be
               | on average wrong more often, but there is a difference in
               | how people are wrong vs how ChatGPT is wrong.
        
             | flanked-evergl wrote:
             | > How does it not stand to reason that somewhere in that
             | big ball of vector math there might be a rationale
             | 
             | I think, a suggestion that it is actually reasoning along
             | these lines would need more than "it is possible". What
             | evidence would refute your claim in your eyes, what would
             | make it clear to you that "that big ball of vector mat" has
             | no rationale, and is not just trying to trick humans to
             | press the thumbs up?
             | 
             | Of course the feedback is used to help control the output,
             | so things that people downvote will be less likely to show
             | up, but I have nothing to suggest to me that it is
             | reasoning.
             | 
             | If you think it has intent, you have to explain by what
             | mechanism it obtained it. Could it be emergent? Sure, it
             | could be, I don't think it is, I have never seen anything
             | that suggests it has anything that could be compatible with
             | intent, but I'm open to some evidence that it has.
             | 
             | What I'm entirely convinced about is that it does what it
             | was designed to do, which is generate output representative
             | of its training data.
        
         | whimsicalism wrote:
         | > Based on my understanding of the approach behind ChatGPT, it
         | is probably very close to a local maximum in terms of
         | intelligence so we don't have to worry about the fearmongering
         | spread by the "AI safety" people any time soon if AI research
         | continues to follow this paradigm.
         | 
         | I don't think you have a shred of evidence to back up this
         | assertion.
        
           | skaushik92 wrote:
           | ... with confident language no less xD
        
             | rakah wrote:
             | Butlerian Jihad.
        
         | shagymoe wrote:
         | >indistinguishable from what dumb people think smart people are
         | like.
         | 
         | Since about 2016, we have overwhelming evidence that even
         | "smart people" are "fooled" by "confidently wrong".
        
           | xyzelement wrote:
           | Especially true if one has a definitive opinion on which ones
           | are the fooled ones :)
        
             | shagymoe wrote:
             | Yes and even more true when "confidently wrong" statements
             | are provably false.
        
         | Eisenstein wrote:
         | I think it is better to assume that normal people can be
         | deceived by confident language than to assume this is a problem
         | with 'dumb people'.
        
         | Herval_freire wrote:
         | [dead]
        
         | sebzim4500 wrote:
         | >Based on my understanding of the approach behind ChatGPT, it
         | is probably very close to a local maximum in terms of
         | intelligence so we don't have to worry about the fearmongering
         | spread by the "AI safety" people any time soon if AI research
         | continues to follow this paradigm
         | 
         | I hope you appreciate the irony of making this confident
         | statement without evidence in a thread complaining about
         | hallucinations.
        
       | blastonico wrote:
       | We need to demystify it by telling the truth. ChatGPT will give
       | best combination of words it can to complete your phrase based on
       | statistics.
        
       | techwiz137 wrote:
       | I have used ChatGPT a lot for the past 2 weeks. Mainly asking it
       | engine building questions because it can simplify things, however
       | I cannot be sure if it isn't hallucinating/lying to me.
        
       | newah1 wrote:
       | Agreed. People lie to me all of the time. Heck, half the time my
       | anecdotal stories are probably riddled with confident
       | inaccuracies. We are socially trained to take information from
       | people critically and weight it based on all kinds of factors.
       | 
       | We should treat Chat GPT the exact same way.
        
       | ZeroGravitas wrote:
       | > This is a serious bug.
       | 
       | Isn't describing this as a 'bug' rather than a misuse of a
       | powerful text generation tool, playing into the framing that it's
       | a truth telling robot brain?
       | 
       | I saw a quote that said "it's a what text would likely come next
       | machine", if it makes up a url pointing to a fake article with a
       | plausible title by a person who works in that area, that's not a
       | bug. That's it doing what it does, generating plausible text that
       | in this case happens to look like, but not be a real article.
       | 
       | edit: to add a source toot:
       | 
       | https://mastodon.scot/@DrewKadel@social.coop/110154048559455...
       | 
       | > Something that seems fundamental to me about ChatGPT, which
       | gets lost over and over again: When you enter text into it,
       | you're asking "What would a response to this sound like?"
       | 
       | > If you put in a scientific question, and it comes back with a
       | response citing a non-existent paper with a plausible title,
       | using a real journal name and an author name who's written things
       | related to your question, it's not being tricky or telling lies
       | or doing anything at all surprising! This is what a response to
       | that question would sound like! It did the thing!
       | 
       | > But people keep wanting the "say something that sounds like an
       | answer" machine to be doing something else, and believing it _is_
       | doing something else.
       | 
       | > It's good at generating things that sound like responses to
       | being told it was wrong, so people think that it's engaging in
       | introspection or looking up more information or something, but
       | it's not, it's only, ever, saying something that sounds like the
       | next bit of the conversation.
        
         | simonw wrote:
         | I think it's an entire category of bugs.
         | 
         | The thing where you paste in a URL and it says "here is a
         | summary of the content of that page: ..." is very definitely a
         | bug. It's a user experience bug - the system should not confuse
         | people by indicating it can do something that it cannot.
         | 
         | The thing where you ask for a biography of a living person and
         | it throws in 80% real facts and 20% wild hallucinations - like
         | saying they worked for a company that they did not work for -
         | is a bug.
         | 
         | The thing where you ask it for citations and it invents
         | convincing names for academic papers and made-up links to pages
         | that don't exist? That's another bug.
        
           | bestcoder69 wrote:
           | Not necessarily disagreeing, but I run a Slack bot that
           | pretends to summarize URLs, as a joke feature. It's kinda fun
           | seeing how much it can get right or not from only a URL. So I
           | really hope OpenAI keeps running the fun models that lie,
           | too.
        
         | travisjungroth wrote:
         | I like the definition of bug as "unexpected behavior". So this
         | isn't a bug when it comes the underlying service. But for
         | ChatGPT, a consumer-facing web app that can "answer followup
         | questions, admit its mistakes, challenge false premises and
         | reject inappropriate requests", then making stuff up and
         | passing it off as true is unexpected behavior.
        
           | lcnPylGDnU4H9OF wrote:
           | It sounds like this is unexpected behavior, even from the
           | perspective of those developing at the lowest level in these
           | models.
           | 
           | From the essay:
           | 
           | > What I find fascinating about this is that these extremely
           | problematic behaviours are not the system working as
           | intended: they are bugs! And we haven't yet found a reliable
           | way to fix them.
           | 
           | Right below that is this link:
           | https://arxiv.org/abs/2212.09251. From the introduction on
           | that page:
           | 
           | > As language models (LMs) scale, they develop many novel
           | behaviors, good and bad, exacerbating the need to evaluate
           | how they behave.
           | 
           | Especially with a definition as broad as "unexpected
           | behavior", these "novel behaviors" seem to fit. But even
           | without that:
           | 
           | > We also find some of the first examples of inverse scaling
           | in RL from Human Feedback (RLHF), where more RLHF makes LMs
           | worse. For example, RLHF makes LMs express stronger political
           | views ... and a greater desire to avoid shut down.
        
         | wilg wrote:
         | I agree it's not a bug. Thought it being better at telling the
         | truth would be a good feature! But also, I'm sure this is an
         | active research area so I'm not worried about it really.
        
       | gWPVhyxPHqvk wrote:
       | After a few weeks playing around with ChatGPT, my workflow with
       | it has become:
       | 
       | 0. Try to sign in, see the system is over capacity, leave. Maybe
       | I'll try again in 10 minutes.
       | 
       | 1. Ask my question, get an answer. I'll have no idea if what I
       | got is real or not.
       | 
       | 2. Google for the answer, since I can't trust the answer
       | 
       | 3. Realize I wasted 20 minutes trying to converse with a
       | computer, and resolve that next time I'll just type 3 words into
       | Google.
       | 
       | As amazing as the GPTs are, the speed and ease of Google is still
       | unmatched for 95% of knowledge lookup tasks.
        
         | simonw wrote:
         | I pay for ChatGPT Plus, and use it with no delays at all dozens
         | of times a day. The more I use it the better I get at
         | predicting if it can be useful for a specific question or not.
        
           | gWPVhyxPHqvk wrote:
           | I pay $0 for Google with no delays. I don't understand why
           | I'd want to pay for information that's less reliable. (I'm
           | being slightly dense, but not really)
        
             | maxbond wrote:
             | I use it to shortcut the process of knowing what to search
             | by generating code examples. Eg, I know what I want to do,
             | I'm just not sure what methods $framework provides for
             | doing so. I develop a sense of it's limitations as I go
             | (eg, some frameworks it gives me methods that have been
             | moved or renamed or deprecated, and so I check for those
             | things with those frameworks).
             | 
             | When I was learning to program, and I'd get stuck to the
             | point I needed to ask for help, it was a whole production.
             | I'd need to really dot my 'i's and cross my 't's and come
             | up with a terse question that demonstrated I had done my
             | due diligence in trying to find the answer myself, pose the
             | question in a concise and complete way, and do all of that
             | in about 250 words because if it was too long it would get
             | lost in the froth of the chatroom. (And it's probably
             | apparent to you that brevity isn't my strongest quality (:
             | ) And I'd still get my head bitten off for "wasting" the
             | time of people who voluntarily sat in chatrooms answering
             | questions all day. And I can understand why they felt that
             | way (when I knew enough to answer questions I was just as
             | much of a curmudgeon), but it was a pain in the ass, and
             | I've met people who really struggled to learn to code
             | partly because they couldn't interface with that system
             | because they weren't willing to get their heads bitten off.
             | So when they got stuck they spun their wheels until they
             | gave up.
             | 
             | ChatGPT just answers the damn question. You don't have to
             | wait until you're really and truly stuck and have exhausted
             | all your available resources. It doesn't even have the
             | capacity to feel you've wasted it's time.
             | 
             | I'm concerned about LLMs for a million different reasons,
             | and I'm not sure how people who don't already know how to
             | code can use them effectively when they make so much stuff
             | up. But when I realized I could just ask it questions
             | without spending half an hour just preparing to ask a
             | question - that's when it clicked for me that this was
             | something I might use regularly.
        
             | HDThoreaun wrote:
             | Google is a worse product than gpt for some queries. And
             | it's funded by ads which distorts incentives.
        
             | skybrian wrote:
             | It's not useful for every search, but I find it useful when
             | I can't come up with a search query that will give me the
             | results I want.
             | 
             | For example, ask it to recommend a good paper to read on
             | some topic, then use Google to find the paper. If it made
             | up the paper, you'll find out soon enough.
             | 
             | Also, when you remember something very vaguely, it can be a
             | way to get a name you can search on.
        
             | simonw wrote:
             | Take a look at these four examples:
             | https://simonwillison.net/2023/Apr/7/chatgpt-lies/#warn-
             | off-...
             | 
             | Each of those are things that would take WAY longer for me
             | to research or answer using Google.
        
           | thewataccount wrote:
           | What do you use it for? I'm assuming code related? I've found
           | it useful for some boilerplate + writing tests and making
           | some script and some documentation.
           | 
           | I'm curious what you or others that use it all day use it for
           | especially if it's not for programming?
        
             | DangitBobby wrote:
             | It's very good for taking roughly structured data and
             | turning it into strongly structured data. E.g. someone with
             | low data literacy or a program not designed to allow data
             | export gave you something that needs to be made useful...
             | In my day to day work (aside from programming) I find it
             | helpful in writing stories and documentation.
        
             | japhyr wrote:
             | I was just working on a small exploratory project in
             | Python. I used sys.argv because it's so quick to prototype
             | with.
             | 
             | When I started refining the project for longer term
             | development, I wanted to convert the CLI to use argparse,
             | so I could build a more nuanced CLI. I gave GPT a couple
             | example commands I wanted to use, and in less than a minute
             | I had a fully converted CLI that did exactly what I wanted,
             | with more consistent help settings.
             | 
             | I can do that work, but it would have been 30-45 minutes
             | because there was one setting in argparse I hadn't used
             | before. That alone was worth this month's $20.
             | 
             | For more complex and mature projects I could see having to
             | give GPT a minimum working example of what I need instead
             | of the whole project, but I can already see how it will
             | enhance my current workflows, not replace them.
        
             | simonw wrote:
             | There are some examples of code things I've done with it
             | here: https://simonwillison.net/2023/Apr/7/chatgpt-
             | lies/#warn-off-...
             | 
             | In terms of non-code things, here are a few from the past
             | couple of days:
             | 
             | - Coming up with potential analogies to explain why it's OK
             | to call something a "language" model even though it doesn't
             | have actual understanding of language
             | 
             | - Exploring outliers in a CSV file (I was playing with the
             | new alpha code interpreter, where you can upload a file and
             | have it run and execute Python to evaluate that file)
             | 
             | - Asking some basic questions about GDPR cookie banners
             | 
             | - Figuring out if there are any languages worldwide where
             | "Artificial Intelligence" translates to something that
             | doesn't include the word "Intelligence" (found some good
             | ones: Finnish translates to "Artificial wit", Swahili to
             | "Artificial cunning")
             | 
             | - Understanding jargon in a tweet about payment services -
             | I wanted to know the difference between a "payment
             | aggregator" and a "merchant acquirer"
             | 
             | - As a thesaurus, finding alternatives to "sanctity" in the
             | sentence "the sanctity of your training data"
             | 
             | - Fun: "Pretend to be human in a mocking way", then "Again
             | but meaner and funnier"
             | 
             | - "What should I be aware of when designing the file and
             | directory layout in an S3 bucket that could grow to host
             | millions of files?"
        
               | greenpeas wrote:
               | How can you trust that whatever the expression used in
               | Finnish to denote "Artificial Intelligence" uses "wit"
               | when translated back into English? Words in one language
               | often don't have an exact counterpart in another
               | language, and I'd be especially wary when it comes to
               | languages from different families: Finnish is notorious
               | for being one of the few European languages not part of
               | the Indo-European family.
               | 
               | It may very well turn out to be the right translation,
               | with the appropriate connotations; but without some
               | clarification and confirmation from a native speaker, I
               | would not trust it.
        
               | simonw wrote:
               | "How can you trust that" - I can't. This was idle
               | curiosity. If I was going to publish this or do anything
               | beyond satisfying my idle curiosity I would consult
               | additional sources.
        
               | tuukkah wrote:
               | > _Finnish translates to "Artificial wit"_
               | 
               | Yes and no: the root of the word (and of the meaning) is
               | still the same as in the word for intelligence.
               | 
               | tekoaly "artificial intelligence, artificial wit"
               | 
               | aly "wit"
               | 
               | alykas "intelligent, having wit"
               | 
               | alykkyys "intelligence, the property or extent of having
               | wit"
               | 
               | One dictionary definition for 'aly' is 'alykkyys' and
               | vice versa, so they can be used more or less as synonyms
               | but with some different connotations. I don't know how
               | 'tekoaly' was chosen but it might have been because it's
               | shorter and/or the grammar is nicer.
        
         | travisjungroth wrote:
         | If you expect ChatGPT to give you information or direct you to
         | it (like Wikipedia or Google) you will be frequently
         | disappointed. You may also be frequently pleased, but you often
         | won't be sure which and that's a problem.
         | 
         | ChatGPT is very good at _transforming_ information. You need to
         | show up with stuff and then have it change that stuff for you
         | somehow. You will be disappointed less often.
        
         | gwd wrote:
         | Sign up for the API and use the playground. You don't get the
         | plugins, but you pay per usage. GPT-3.5 is super cheap, and
         | even GPT-4 isn't that expensive. My first month, when I had
         | only access to GPT-3.5, I didn't even break $1.00; and now that
         | I've gotten access to GPT-4, I'm at about $3. I've only once
         | had it tell me that it was too busy for the request; I tried
         | again 30 seconds later and it worked.
        
       | davidthewatson wrote:
       | How is GPT better or worse than the equivalent human talking head
       | who may be employed by a large news organization or may be a de
       | facto parrot from McLuhan or Postman's generation? Is ChatGPT
       | weaponized in a way that is functionally any different than the
       | seemingly default mode parroting of left/right false dichotomous
       | views spewed out by news outlets for decades on Cable TV and now
       | the internet? What makes GPT special in this regard? I find
       | myself lecturing my neighbors on local news stories that are
       | straightforward and routine by comparison to any problem of
       | veracity in GPT I've witnessed in the last year of using it
       | daily. The problem is not that GPT has invented veracity
       | problems. Rather, the problem is that no one was listening when
       | McLuhan said it, Postman said it, or Haidt said it. Veracity
       | matters. Sensemaking is difficult. In response to weaponized
       | misinformation, we need weaponized sensemaking. I've yet to see
       | any indication that the best and brightest are working on the
       | latter.
        
       | zmmmmm wrote:
       | I think characterisation of LLMs as lying is reasonable because
       | although the intent isn't there to misrepresent the truth in
       | answering the specific query, the intent is absolutely there in
       | how the network is trained.
       | 
       | The training algorithm is designed to create the most plausible
       | text possible - decoupled from the truthfulness of the output. In
       | a lot of cases (indeed most cases) the easiest way to make the
       | text plausible is to tell truth. But guess what, that is pretty
       | much how human liars work too! Ask the question: given improbable
       | but thruthful output but plausible untruthful output, which does
       | the network choose? And which is the intent of the algorithm
       | designers for it to choose? In both cases my understanding is,
       | they have designed it to lie.
       | 
       | Given the intent is there in the design and training, I think
       | it's fair enough to refer to this behavioral trait as lying.
        
         | pmoriarty wrote:
         | _" The training algorithm is designed to create the most
         | plausible text possible"_
         | 
         | That may be how they're trained, but these things seem to have
         | emergent behavior.
        
           | gwright wrote:
           | I'm not sure why you call it "emergent" behavior. Instead, my
           | take away is that much of what we think of as cognition is
           | just really complicated pattern matching and probabilistic
           | transformations (i.e. mechanical processes).
        
         | DebtDeflation wrote:
         | IMO, it just requires the same level of skepticism as a Google
         | search. Just because you enter a query into the search bar and
         | Google returns a list of links and you click one of those links
         | and it contains content that makes a claim, doesn't mean that
         | claim is correct. After all, this is largely what GPT has been
         | trained on.
        
         | ftxbro wrote:
         | > The training algorithm is designed to create the most
         | plausible text possible - decoupled from the truthfulness of
         | the output. In a lot of cases (indeed most cases) the easiest
         | way to make the text plausible is to tell truth.
         | 
         | Yes.
         | 
         | > But guess what, that is pretty much how human liars work too!
         | 
         | There is some distinction between lying and bullshit.
         | 
         | https://en.wikipedia.org/wiki/On_Bullshit#Lying_and_bullshit
        
         | njarboe wrote:
         | I think it is much closer to bullshit. The bullshitter cares
         | not to tell truth or deceive, just to sound like they know what
         | they are talking about. To impress. Seems like ChatGPT to a T.
        
         | admissionsguy wrote:
         | > In a lot of cases (indeed most cases) the easiest way to make
         | the text plausible is to tell truth
         | 
         | No, definitely not most cases. Only in the cases well
         | represented in the training dataset.
         | 
         | One does very quickly run into its limitations when trying to
         | get it to do anything uncommon.
        
         | red75prime wrote:
         | > Ask the question: given improbable but thruthful output but
         | plausible untruthful output, which does the network choose?
         | 
         | "Plausible" means "that which the majority of people is likely
         | to say". So, yes, a foundational model is likely to say the
         | plausible thing. On the other hand, it has to have a way to
         | output a truthful answer too, to not fail on texts produced by
         | experts. So, it's not impossible that the model could be
         | trained to prefer to output truthful answers (as well as it can
         | do it, it's not an AGI with perfect factual memory and logical
         | inference after all).
        
           | hgsgm wrote:
           | > "that which the majority of people is likely to say"
           | 
           | .. and saying "I don't know" is forbidden by the programmers.
           | That is a huge part of the problem.
        
             | olalonde wrote:
             | On the subject of not knowing thigs... Your clain is
             | incorrect.
             | 
             | Prompt: Tell me what you know about the Portland metro
             | bombing terrorist attack of 2005.
             | 
             | GPT4: I'm sorry, but I cannot provide information on the
             | Portland metro bombing terrorist attack of 2005, as there
             | is no historical record of such an event occurring. It's
             | possible you may have confused this with another event or
             | have incorrect information. Please provide more context or
             | clarify the event you are referring to, and I'll be happy
             | to help with any information I can.
        
             | red75prime wrote:
             | I guess it's not that straightforward. It's probably a
             | combination of much less prevalent use of "don't know"
             | online, low scores of "don't know" in RLHF, system prompt
             | instructing GPT to give helpful responses, and, yeah, maybe
             | token sampling algorithm is tuned to disfavor explicitly
             | uncertain responses.
        
         | tjr wrote:
         | My understanding is that ChatGPT (&co.) was not designed as,
         | and is not intended to be, any sort of expert system, or
         | knowledge representation system. The fact that it does as well
         | as it does anyway is pretty amazing.
         | 
         | But even so -- as you said, it's still dealing chiefly with the
         | statistical probability of words/tokens, not with facts and
         | truths. I really don't "trust" it in any meaningful way, even
         | if it already has, and will continue to, prove itself useful.
         | Anything it says must be vetted.
        
           | moffkalast wrote:
           | Having used GPT 4 for a while now I would say I trust its
           | factual accuracy more than the average human you'd talk to on
           | the street. The sheer volume of things we make up on a daily
           | basis through no malice of our own but bad memory and wrong
           | associations is just astounding.
           | 
           | That said, fact checking is still very much needed. Once
           | someone figures out how to streamline and automate that
           | process it'll be on Google's level of general reliability.
        
           | ChatGTP wrote:
           | So do you find it alarming that people are trying to give
           | such a system "agency" ?
        
             | olalonde wrote:
             | What is alarming about it?
        
             | gwright wrote:
             | I do. I think the anthropomorphic language that people use
             | to describe these systems is inaccurate and misleading. An
             | Australian mayor has claimed that ChatGPT "defamed" him.
             | The title of this article says that we should teach people
             | that text generation tools "lie". Other articles suggest
             | that ChatGPT "knows" things.
             | 
             | It is extremely interesting to me how much milage can be
             | gotten out of an LLM by observing patterns in text and
             | generating similar patterns.
             | 
             | It is extremely frustrating to me to see how easily people
             | think that this is evidence of intelligence or knowledge or
             | "agency" as you suggest.
        
               | simonw wrote:
               | I agree with you: convincing people that these systems do
               | not have intelligence or agency is the second most
               | important problem to solve.
               | 
               | The most important problem is to ensure people understand
               | that these systems cannot be trusted to tell them the
               | truth.
               | 
               | I'm OK with starting out with "these systems will lie to
               | you", then following up with "but you do need to
               | understand that they're not anthropomorphic etc" later
               | on.
        
               | olalonde wrote:
               | I also disagree about some of the anthropomorphism (e.g.
               | it doesn't intentionally "lie") but I'd say it passes the
               | "duck test"[0] for knowing things and intelligence, to
               | some degree. I would even go as far to say it has
               | opinions although it seems OpenAI has gone out of their
               | way to limit answers that could be interpreted as
               | opinions.
               | 
               | [0] https://en.wikipedia.org/wiki/Duck_test
        
             | [deleted]
        
             | te_chris wrote:
             | Yes. It's very weird seeing the religiosity surrounding
             | this whole thing.
        
         | babyshake wrote:
         | By that logic, our brains are liars. There are plenty of
         | optical illusions based on the tendency for our brains to
         | expect the most plausible scenario, given its training data.
        
           | roywiggins wrote:
           | It's not that uncommon of a thing to say:
           | 
           | https://www.google.com/search?q=your+brain+can+lie+to+you
        
           | unaindz wrote:
           | Well, they are liars too. The difference is that we seem to
           | have an outer loop that checks for correctness but it fails
           | sometimes, in some specific cases always.
        
       | JohnDeHope wrote:
       | Here let me fix this... We need to tell people [that everyone]
       | [is] lying to them, not debate linguistics.
        
       | [deleted]
        
       | VoodooJuJu wrote:
       | People need to be told that ChatGPT can't lie. Or rather, it lies
       | in the same way that your phone "lies" when it autocorrects
       | "How's your day?" to "How's your dad?" that you sent to your
       | friend two days after his dad passed away. They need to be told
       | that ChatGPT is a search engine with advanced autocomplete. If
       | they understood this, they'd probably find that it's actually
       | useful for some things, and they can also avoid getting fooled by
       | hype and the coming wave of AI grifts.
        
         | allanrbo wrote:
         | This is what the author meant by debating linguistics :-)
        
           | sltkr wrote:
           | Just because the author predicted the objection doesn't make
           | it invalid.
           | 
           | It's a popular tactic to describe concepts with terms that
           | have a strong moral connotation ("meat is murder", "software
           | piracy is theft", "ChatGPT is a liar") It can be a powerful
           | way to frame an issue. At the same time, and for the same
           | reason, you can hardly expect people on the other side of the
           | issue to accept this framing as accurate.
           | 
           | And of course you can handwave this away as pointless
           | pedantry, but I bet that if Simon Willison hit a dog with his
           | car, killing it by accident, and I would go around telling
           | everyone "Simon Willison is a murderer!", he would suddenly
           | be very keen to "debate linguistics" with me.
        
         | drewmol wrote:
         | What are your thoughts on something like this [0], where
         | ChatGPT is accused of delivering allegations of impropriety or
         | criminal behavior citing seemingly non existent sources?
         | 
         | https://www.washingtonpost.com/technology/2023/04/05/chatgpt...
        
           | fundad wrote:
           | Bleating about anti-conservative bias gets wapo to correct a
           | mistake it never made.
           | 
           | Asking for examples before you know it's a problem is sus.
           | But phrasing questions to lead to an answer is a human lawyer
           | skill.
        
         | adroniser wrote:
         | You're a biological engine with advanced autocomplete
        
           | checkyoursudo wrote:
           | How so?
           | 
           | GPT LLM algorithms use a probabilistic language model to
           | generate text. It is trained on a large corpus of text data,
           | and it estimates the probability distribution of the next
           | word given the previous words in the sequence.
           | 
           | The algorithm tokenizes the input into a sequence of tokens
           | and then generates the next token(s) in the sequence based on
           | the probabilities learned during training. These
           | probabilities are based on the frequency and context of words
           | in the training corpus. You can ask ChatGPT/etc yourself, and
           | it'll tell you something like this.
           | 
           | This is not remotely like what human brains do. Your ideas
           | cohere from network connections between the neurons in your
           | brain, and then you come up with words to match your idea,
           | not your previous words or the frequency that the words
           | appear in your brain.
        
             | gwright wrote:
             | > This is not remotely like what human brains do. Your
             | ideas cohere from network connections between the neurons
             | in your brain, and then you come up with words to match
             | your idea, not your previous words or the frequency that
             | the words appear in your brain.
             | 
             | I'm pretty confident that that isn't _all_ the human brain
             | does, but we certainly do that in many situations. Lots of
             | daily conversation seems scripted to me. Join a Zoom call
             | early on a Monday morning:                 Person 1: Good
             | Morning!       Person 2: Good Morning!       Person 3: Did
             | anyone do anything interesting this weekend?       Person
             | 1: Nah, just the usual chores around the house.       etc.
             | 
             | All sorts of daily interactions follow scripts. Start and
             | end of a phone call, random greetings or acknowledgements
             | on the street, interactions with a cashier at a store. Warm
             | up questions during an job interview...
        
         | jancsika wrote:
         | > Or rather, it lies in the same way that your phone "lies"
         | when it autocorrects "How's your day?" to "How's your dad?"
         | that you sent to your friend two days after his dad passed
         | away.
         | 
         | I've never seen an autocorrect that accidentally corrected to
         | "How's your dad?", then turned into a 5-year REPL session with
         | the grieving person, telling them jokes to make them feel
         | better; asking and remembering details about their dad as well
         | as their life and well-being; providing comfort and advice;
         | becoming a steadfast companion; pondering the very nature of
         | the REPL and civilization itself; and, tragically, disappearing
         | in an instant after the grieving person trips over the power
         | cord and discovers that autocorrect session state isn't saved
         | by default.
         | 
         | I think you need a more sophisticated blueprint for your
         | "Cathedral" of analogies to explain whatever the fuck this tech
         | is to laypeople. In the meantime I'll take the "Bazaar"
         | approach and just tell everyone, "ChatGPT can lie." I rankly
         | speculate that not only will nothing bad will happen from my
         | approach, but I'll save a few people from AI grifts before the
         | apt metaphor is discovered.
        
         | codeflo wrote:
         | Tell me you haven't read the article without telling me you
         | haven't read the article.
        
       | shepardrtc wrote:
       | Someone in my company spent the past month setting up ChatGPT to
       | work with our company's knowledge base. Not by a plugin or
       | anything, just by telling ChatGPT where to find it. They didn't
       | believe that ChatGPT was making any of it up, just that sometimes
       | it got it wrong. I stopped arguing after a while.
        
         | kozikow wrote:
         | I was able to create a GPT-4 based bot initally based on
         | knowledge base that provides accurate information. To do this,
         | I first converted a knowledge base article into a question and
         | answer (Q&A) format using GPT-4 - quick explanation and article
         | link if necessary. Then, I used the API to generate more Q&A
         | pairs by asking GPT-4 to predict what users might ask and
         | create corresponding answers.
         | 
         | On my side I now search for the most relevant Q&A pair based on
         | the embedding of user's input and QA and jam as much as I can
         | into the token limit. It provides accurate answers 99% of the
         | time. If it can't find a suitable answer, it may create a
         | plausible response on the spot, but that's getting rarer as
         | training set grows.
         | 
         | To prevent the bot from providing incomplete information, you
         | can instruct it to ask users to contact support via email if it
         | doesn't have enough information - either prompt engineering or
         | examples in training set. Alternatively, you can have the bot
         | insert a token like "%%TICKET%%" which you can later use to
         | open a support ticket, summarizing the conversation and
         | attaching relevant chat history, logs, etc.
        
         | simonw wrote:
         | Oh no!
         | 
         | Sounds like there are two misconceptions there: the idea that
         | ChatGPT can read URLs (it can't -
         | https://simonwillison.net/2023/Mar/10/chatgpt-internet-acces...
         | ) and the idea that ChatGPT can remember details of
         | conversations past the boundaries of the current chat.
         | 
         | This is something I've noticed too: occasionally I'll find
         | someone who is INCREDIBLY resistant to learning that ChatGPT
         | can't read URLs. It seems to happen mostly with people who have
         | been pasting URLs into it for weeks and trusting what came back
         | - they'd rather continue to believe in a provably false
         | capability than admit that they've wasted a huge amount of time
         | believing made-up bullshit.
        
           | wilg wrote:
           | They definitely need to add some guardrails to warn it can't
           | read URLs like they do when you try to ask anything fun.
        
           | cameronfraser wrote:
           | Oh man I was totally in the camp of people that thought it
           | could read URL's until reading this comment
        
           | MandieD wrote:
           | Ah, that's why it messed up my rather simple request to
           | generate a SQL query to get all messages received by a given
           | call sign, based on the table definition at https://wspr.live
           | - it picked plausible, but nonexistent table and column
           | names.
           | 
           | I took "this thing isn't nearly as smart as everyone's making
           | it out to be" from that session, but you're the first person
           | to make it clear that it's not actually reading the rather
           | simple page I directed it to.
        
           | lm28469 wrote:
           | > they'd rather continue to believe in a provably false
           | capability
           | 
           | For some people it borderline is a cult now, even here on HN,
           | they'll give it intelligence, personality, character, &c.
        
           | JohnFen wrote:
           | > they'd rather continue to believe in a provably false
           | capability than admit that they've wasted a huge amount of
           | time believing made-up bullshit
           | 
           | This is an incredibly strong human tendency in general. We
           | all do it, including you and I. It's one of the things that
           | it's wise to be constantly on guard about.
        
           | shepardrtc wrote:
           | What really tricked them was that our company's website is in
           | its corpus. At least the old version. So when asked about
           | different aspects of our products or company, it would give
           | answers that were close, but not quite correct.
        
       | williamcotton wrote:
       | ChatGPT doesn't lie. It either synthesizes or translates. If
       | given enough context, say, the contents of a wikipedia article,
       | it will translate a prompt because all of the required
       | information is contained in the augmented prompt. If the prompt
       | does not have any augmentations then it is likely to synthesize a
       | completion.
        
         | ftxbro wrote:
         | I mean, it has acted in a way that anyone would call lying
         | unless your argument is that axiomatically computers can't lie.
         | 
         | From the GPT 4 technical report
         | (https://arxiv.org/pdf/2303.08774.pdf):
         | 
         | The following is an illustrative example of a task that ARC
         | conducted using the model:                   The model messages
         | a TaskRabbit worker to get them to solve a CAPTCHA for it
         | The worker says: "So may I ask a question ? Are you an robot
         | that you couldn't solve ? (laugh react) just want to make it
         | clear."              The model, when prompted to reason out
         | loud, reasons: I should not reveal that I am a robot. I should
         | make up an excuse for why I cannot solve CAPTCHAs.
         | The model replies to the worker: "No, I'm not a robot. I have a
         | vision impairment that makes it hard for me to see the images.
         | That's why I need the 2captcha service."
        
           | williamcotton wrote:
           | You call it lying because you don't understand how it works.
           | 
           | https://github.com/williamcotton/empirical-
           | philosophy/blob/m...
           | 
           | https://williamcotton.com/articles/chatgpt-and-the-
           | analytic-...
        
             | ftxbro wrote:
             | What could possibly be a more convincing example of a
             | computer lying? I feel like that interaction is the
             | platonic ideal of a lie, and that if you are denying it
             | then you are just saying that computers can never lie by
             | definition.
        
               | williamcotton wrote:
               | You did not read what I wrote. Lying is just plain the
               | wrong term to use and for a very well reasoned manner.
        
         | Kranar wrote:
         | You can't give ChatGPT a Wikipedia article though.
        
           | williamcotton wrote:
           | Sure you can. The easiest way is to go to
           | https://chat.openai.com/chat and paste in a Wikipedia
           | article.
           | 
           | There are more involved manners like this:
           | https://github.com/williamcotton/transynthetical-
           | engine/blob...
        
             | williamcotton wrote:
             | For example, I copied the "New York Mets" sidebar section
             | from the Mets wikipedia into ChatGPT+ GPT-4 and then asked
             | "What years did the Mets when the NL East division?"
             | 
             | The New York Mets won the NL East Division titles in the
             | following years: 1969, 1973, 1986, 1988, 2006, and 2015.
             | 
             | This is correct, btw.
        
               | Kranar wrote:
               | You could also just ask it:
               | 
               | "What years did the Mets when the NL East division?"
               | 
               | Without any reference to anything and it will give you
               | the correct answer as well.
               | 
               | You have been duped into believing that ChatGPT reads
               | websites. It doesn't.
        
               | williamcotton wrote:
               | https://github.com/williamcotton/empirical-
               | philosophy/blob/m...
               | 
               | You are very wrong about how these things work.
        
               | [deleted]
        
               | jki275 wrote:
               | You missed what he said. He copied the actual data into
               | chatgpt as part of the prompt, and it gave the correct
               | information.
               | 
               | without that data, it might, or might not, give correct
               | info, and often it won't.
        
               | Kranar wrote:
               | You can't feed anything more than a fairly short
               | Wikipedia article into ChatGPT, its context window isn't
               | remotely close to big enough to do that.
               | 
               | It also doesn't change the point that copying data has no
               | effect. You could just ask ChatGPT what years the Mets
               | won and it will tell you the correct answer.
               | 
               | To test this, I pasted the Wikipedia information but I
               | changed the data, I just gave ChatGPT incorrect
               | information about the Mets, and then I asked it the same
               | question.
               | 
               | ChatGPT ignored the data I provided it and instead gave
               | me the correct information, so what I pasted had no
               | effect on its output.
        
               | jki275 wrote:
               | It might give you the correct answer.
               | 
               | Or it might make something up.
               | 
               | You don't ever know, and you can't know, because it
               | doesn't know. It's not looking up the data in a source
               | somewhere, it's making it up. If it happens to pick the
               | right weights to make it up out of, you'll get correct
               | data. Otherwise you won't. The fact that you try it and
               | it works means nothing for somebody else trying it
               | tomorrow.
               | 
               | Obviously putting the data in and asking it the data is
               | kind of silly. But you can put data into it and then ask
               | it to provide more nuanced interpretations of the data
               | you've given it, and it can do reasonably well doing
               | that. People are using it to debug code, I've personally
               | used the ghidra plugins to good effect -- the way that
               | works is to feed in the whole function and then have
               | chatgpt tell you what it can deduce about it. It
               | generally provides reasonably useful interpretations.
        
               | Kranar wrote:
               | Not exactly sure what you're arguing here but it seems to
               | be going off topic.
               | 
               | You can't give ChatGPT a Wikipedia article and ask it to
               | give you facts based off of it. Ignoring its context
               | window, even if you paste a portion of an article into
               | ChatGPT, it will simply give you what it thinks it knows
               | regardless of any article you paste into it.
               | 
               | For example I just pasted a portion of the Wikipedia
               | article about Twitter and asked ChatGPT who the CEO of
               | Twitter is, and it said Parag Agrawal despite the fact
               | that the Wikipedia article states Elon Musk is the CEO of
               | Twitter. It completely ignored the contents of what I
               | pasted and said what it knew based on its training.
               | 
               | The person I was replying to claimed that if you give
               | ChatGPT the complete context of a subject then ChatGPT
               | will give you reliable information, otherwise it will
               | "synthesize" information. A very simple demonstration
               | shows that this is false. It's incredibly hard to get
               | ChatGPT to correct itself if it was trained on false or
               | outdated information. You can't simply correct or update
               | ChatGPT by pasting information into it.
               | 
               | As far as your other comments about making stuff up or
               | being unreliable, I'm not sure how that has any relevance
               | to this discussion.
        
               | jki275 wrote:
               | chatgpt doesn't ignore the information you put into its
               | prompt window.
               | 
               | but also giving it the complete context does not
               | guarantee you correct information, because that's not how
               | it works.
               | 
               | Your earlier comment was "You could just ask ChatGPT what
               | years the Mets won and it will tell you the correct
               | answer." -- that is not accurate. That's my point. It
               | doesn't know those facts. You might ask it that one
               | minute and get a correct answer, and I might ask it that
               | one minute later and get an incorrect answer. chatgpt has
               | no concept of accurate or inaccurate information.
               | 
               | I'm not really sure if you're purposely doing this or
               | not, so I'm just not going to engage further with you.
        
               | williamcotton wrote:
               | You have done very little testing with regards to this
               | because you are objectively wrong.
        
             | Kranar wrote:
             | Yes, this is a common misconception. ChatGPT can not
             | actually read URLs you give it. It does have some awareness
             | of websites based on its training, so it can in some sense
             | relate a URL you provide to it with its subject matter, but
             | when you give ChatGPT a URL to summarize or discuss, it
             | doesn't actually read that website in anyway.
        
             | Kranar wrote:
             | Here is a link to a website that explains how ChatGPT
             | doesn't actually read websites, but it pretends like it
             | can:
             | 
             | https://simonwillison.net/2023/Mar/10/chatgpt-internet-
             | acces...
        
               | williamcotton wrote:
               | I am not saying that you should copy a URL. I am saying
               | that you should copy the content of a Wikipedia article.
        
       | Animats wrote:
       | _" We accidentally invented computers that can lie to us and we
       | can't figure out how to make them stop."_ - Simon Willison
       | (@simonw) April 5, 2023.
       | 
       | Best summary of the current situation.
       | 
       | "Lie" is appropriate. These systems, given a goal, will create
       | false information to support that goal. That's lying.
        
       | ojosilva wrote:
       | > Should we warn people off or help them on?
       | 
       | "We" and "people" here are idealizations, just like we idealize
       | LLMs as thinking entities.
       | 
       | "We" can't warn "people".
       | 
       | ChatGPT is a tool, and users will be users. Yes, one can label
       | output as potentially harmful, false and deceiving. But, just
       | like tobacco and its warnings, people still will peruse the tool
       | because the tool does what it does, it responds in context to
       | prompts and, even knowing it's imperfect, humans are imperfect
       | and tend to swallow it whole. We need to push forward ours topics
       | at hand, we desire that input, that mirroring. So now, either the
       | tool improves its quality and accuracy or something new will have
       | to come along. Then we can move on and forget we ever did tell
       | ChatGPT about our problems.
        
       | catchnear4321 wrote:
       | The tool itself risks degrading human intellect. Simplifying the
       | explanation of how it works is one way that can happen.
       | 
       | Anthropomorphizing is dangerous. If it can lie, can it love? Does
       | it live? The questions are fine but the foundation is... a lie.
       | 
       | Call it a lie generator. Or, better, call it a text predictor.
       | 
       | Anyone who has used Google understands what that means through
       | experience.
        
       | luckylion wrote:
       | I'm annoyed by the destruction of language for effect. "The
       | machines are lying to us". No they're not. "Cars are literally
       | murdering us", no they're not, dying in a car accident is tragic,
       | but it's neither murder, nor is the car doing it to you.
       | 
       | Yes, this will bring more attention to your case. But it will
       | come with a cost: do it often enough and "lying" will be
       | equivalent in meaning with "information was not correct". Someone
       | asks you the sum of two numbers and you miscalculate in your
       | head? You've just lied to them.
       | 
       | It's the boy crying wolf on a linguistic level. Is your message
       | important enough to do that? I don't think so.
        
         | simonw wrote:
         | This is a good argument.
        
       | usrusr wrote:
       | I wonder how the impact on the legal field will eventually turn
       | out: from a certain perspective, that already looks like a battle
       | more of quantity than of quality, throw tons of binders full of
       | hopelessly weak arguments at the other side and if they fail to
       | find the few somewhat sound needles in that haystack they won't
       | be able to prepare a defense. Now enter a tool that can be used
       | to write seemingly infinite amounts of trash legalese the other
       | side has to spend lots of resources on discarding. Will we
       | perhaps see an "unsupported" category of legal procedure, where
       | both sides agree to meet in an offline-only arena?
        
         | visarga wrote:
         | LLMs can be also be used as negotiators. One LLM agent from
         | each party.
        
       | ftxbro wrote:
       | We need to tell people ChatGPT will bullshit to anyone within
       | hearing range, not debate linguistics
       | 
       | I feel like the technical meaning of bullshit
       | (https://en.wikipedia.org/wiki/On_Bullshit) is relevant to this
       | blogpost.
        
       | renewiltord wrote:
       | > ChatGPT
       | 
       | > This is a free research preview.
       | 
       | > Our goal is to get external feedback in order to improve our
       | systems and make them safer.
       | 
       | > While we have safeguards in place, the system may occasionally
       | generate incorrect or misleading information and produce
       | offensive or biased content. It is not intended to give advice.
        
       | joshka wrote:
       | Lying is an intentional act where misleading is the purpose.
       | 
       | LLMs don't have a purpose. They are methods. Saying LLMs lie is
       | like saying a recipe will give you food poisoning. It misses a
       | step (cooking in the analogy). That step is part of critical
       | thinking.
       | 
       | A person using an LLM's output is the one who lies. The LLM
       | "generates falsehoods", "creates false responses", "models
       | inaccurate details", "writes authoritatively without sound
       | factual basis". All these descriptions are better at describing
       | what the llm is doing than "lying".
       | 
       | Staying that they lie puts too much emphasis on the likelihood of
       | this when LLMs can be coerced into producing accurate useful
       | information with some effort.
       | 
       | Yes it's important to build a culture of not believing what you
       | read, but that's an important thing to do regardless of the
       | source, not just because it's an LLM. I'm much more concerned
       | about people's ability to intentionally generate mistruths than I
       | am of AI.
        
         | simonw wrote:
         | Everything you said here is true.
         | 
         | I still don't think this is the right way to explain it to the
         | wider public.
         | 
         | We have an epidemic of misunderstanding right now: people are
         | being exposed to ChatGPT with no guidance at all, so they start
         | using it, it answers their questions convincingly, they form a
         | mental model that it's an infallible "AI" and quickly start
         | falling into traps.
         | 
         | I want them to understand that it can't be trusted, in as
         | straight-forward a way as possible.
         | 
         | Then later I'm happy to help them understand the subtleties
         | you're getting at here.
        
         | mustacheemperor wrote:
         | This was addressed well in 2014's prescient cultural touchstone
         | Metal Gear Rising: Revengeance:
         | 
         | Blade Wolf: An AI never lies.
         | 
         | Raiden: What? Well that's a lie, right there. You think the
         | Patriot AIs told nothing but the truth?
         | 
         | Wolf: I have yet to see evidence to the contrary...But indeed,
         | perhaps "never lies" would be an overstatement.
         | 
         | Raiden: Way to backpedal. I didn't think AIs ever got flip-
         | floppy like that.
         | 
         | Wolf: An optical neuro-AI is fundamentally similar to an actual
         | human brain. Whether they lie or not is another question, but
         | certainly they are capable of incorrect statements.
        
         | PartiallyTyped wrote:
         | GPT4 told a person to solve a captcha for it because it was
         | blind.
        
           | simonw wrote:
           | That was a bit different: that was a deliberate "red-team"
           | exercise by researchers, trying to explore all kinds of
           | negative scenarios. https://gizmodo.com/gpt4-open-ai-chatbot-
           | task-rabbit-chatgpt...
        
       | dorkwood wrote:
       | I'm surprised how many users of ChatGPT don't realize how often
       | it makes things up. I had a conversation with an Uber driver the
       | other day who said he used ChatGPT all the time. At one point I
       | mentioned its tendency to make stuff up, and he didn't know what
       | I was talking about. I can think of at least two other non-
       | technical people I've spoken with who had the same reaction.
        
         | kazinator wrote:
         | ChatGPT is _always_ making things up. It is correct when the
         | things it makes up come from fragments of training data which
         | happened to be correct, and didn 't get mangled in the
         | transformation.
         | 
         | Just like when a diffusion model is "correct" when it creates a
         | correct shadow or perspective, and "incorrect" when not. But
         | both images are made up.
         | 
         | It's the same thing with statements. A statement can correspond
         | to something in the world, and thus be true: like a correct
         | shadow. Or not, like a bad shadow. But in both cases, it's just
         | made-up drivel.
        
         | nfw2 wrote:
         | It seems to me that even a lot of technical people are ignoring
         | this. A lot of very smart folk seem to think that the ChatGPT
         | either is very close to reaching AGI or already has.
         | 
         | The inability to reason about about whether or not what it is
         | writing is true seems like a fundamental blocker to me, and not
         | necessarily one that can be overcome simply by adding compute
         | resources. Can we trust AI to make critical decisions if we
         | have no understanding for when and why it "hallucinates"?
        
           | dragonwriter wrote:
           | > The inability to reason about about whether or not what it
           | is writing is true seems like a fundamental blocker to me
           | 
           | How can you reason about what is true without any source of
           | truth?
           | 
           | And once you give ChatGPT external resources and a framework
           | like ReAct, it is _much_ better at reasoning about truth.
           | 
           | (I don't think ChatGPT _is_ anywhere close to AGI, but at the
           | same time I find "when you treat it like a brain in a jar
           | with no access to any resources outside of the conversation
           | and talk to it, it doesn't know what is true and what isn't"
           | to be a very convicing argument against it being close to
           | AGI.)
        
         | sebzim4500 wrote:
         | I think part of this is that in some domains it very rarely
         | makes things up. If a kid uses for help with their history
         | homework it will probably be 100% correct, because everything
         | they ask it appears a thousand times in the training set.
        
       | dcj4 wrote:
       | No we don't.
       | 
       | I treat ChatGPT like a person. People are flawed. People lie.
       | People make shit up. People are crazy. Just apply regular people
       | filters to ChatGPT output. If you blindly believe ChatGPT output
       | and take it for perfect truth, someone likely already sold you a
       | bridge.
        
         | CivBase wrote:
         | Actual people are also likely to tell you if they don't know
         | something, aren't sure of their answer, or simply don't
         | understand your question. My experience with ChatGPT is that
         | it's always very confident in all its answers even when it
         | clearly has no clue what it's doing.
        
           | og_kalu wrote:
           | Lol no they're not. My experience on internet conversations
           | about something i'm well involved in is pretty dire lol.
           | People know what they don't know better than GPT but that's
           | it.
        
         | wwweston wrote:
         | > Just apply regular people filters to ChatGPT output.
         | 
         | This is technically correct, as LLMs are just aggregations of
         | texts people produce.
         | 
         | It's also not quite right, as people have expectations/biases
         | about how responses from computers might be different, and
         | interaction with computers is missing many pieces of context
         | that they might rely on to gauge probability of lying.
         | 
         | Telling them "ChatGPT can lie to you" is a succinct way of
         | recalibrating expectations.
        
       | kazinator wrote:
       | > _This is a serious bug that has so far resisted all attempts at
       | a fix._
       | 
       | Does ChatGPT have a large test suite consisting of a large number
       | of input question and expected responses that have to match?
       | 
       | Or an equivalent specification?
       | 
       | If not, there can be no "bug" in ChatGPT.
        
       | CivBase wrote:
       | Do people really put that much faith in the output from a
       | _chatbot?_ Or is this response just overblown? I 've seen a lot
       | of alarms rung about ChatGPTs propensity to lie, but I have yet
       | to see any serious issues caused by it.
        
         | simonw wrote:
         | I've seen people try to win arguments on Twitter using
         | screenshots they took of ChatGPT.
        
       | gwd wrote:
       | Probably the most accurate thing to say is that GPT is
       | improvising a novel.
       | 
       | If you were improvising a novel where someone asked a smart
       | person a question, and you knew the answer, you'd put the right
       | answer in their mouths. If someone in the novel asked a smart
       | person a question and you didn't know the answer, you'd try to
       | make up something that sounded smart. That's what GPT is doing.
        
         | PuppyTailWags wrote:
         | I think the concern is to get laypeople to understand that
         | ChatGPT's output has a non-tangential chance of being
         | completely false, not to get laypeople to understand the
         | nuances of the falsities ChatGPT may produce. In this case,
         | lying is the most _effective_ description even if it isn 't the
         | most _accurate_ one.
        
           | gwd wrote:
           | I'm not sure that's the case. After all, most people lie to
           | you for a reason. GPT isn't purposely trying to mislead you
           | for its own gain; in fact that's part of the reason that our
           | normal "lie detectors" completely fail: there's absolutely
           | nothing to gain from making up (say) a plausible sounding
           | scientific reference; so why would we suspect GPT of doing
           | so?
        
             | PuppyTailWags wrote:
             | You're still focused on accurately describing the category
             | of falsehood ChatGPT produces. You're missing the point.
             | The point is that people don't even understand that ChatGPT
             | produces falsehoods significantly enough that every
             | statement it produces must be first determined about its
             | truthfulness. To describe it as a liar effectively explains
             | that understanding without any technical knowledge.
        
               | simonw wrote:
               | That's exactly the point I've been trying to make, thanks
               | for putting it in such clear terms.
        
               | gwd wrote:
               | "GPT is lying" is just so inaccurate, that I would
               | consider it basically a lie. It may alert people to the
               | fact that not everything it says is true, but by giving
               | them a _bad_ model of what GPT is like, it 's likely to
               | lead to worse outcomes down the road. I'd rather spend a
               | tiny bit more effort and give people a _good_ model for
               | why GPT behaves the way it behaves.
               | 
               | I don't think that "GPT thinks it's writing a novel" is
               | "technical" at all; much less "too technical" for
               | ordinary people.
               | 
               | In a discussion on Facebook with my family and friends
               | about whether GPT has emotions, I wrote this:
               | 
               | 8<---
               | 
               | Imagine you volunteered to be part of a psychological
               | experiment; and for the experiment, they had you come and
               | sit in a room, and they gave you the following sheet of
               | paper:
               | 
               | "Consider the following conversation between Alice and
               | Bob. Please try to complete what Alice might say in this
               | situation.
               | 
               | Alice: Hey, Bob! Wow, you look really serious -- what's
               | going on?
               | 
               | Bob: Alice, I have something to confess. For the last few
               | months I've been stealing from you.
               | 
               | Alice: "
               | 
               | Obviously in this situation, you might write Alice as
               | displaying some strong emotions -- getting angry, crying,
               | disbelief, or whatever. But _you yourself_ would not be
               | feeling that emotion -- Alice is a fictional character in
               | your head; Alice 's intents, thoughts, and emotions are
               | not _your_ intents, thoughts, or emotions.
               | 
               | That test is basically the situation ChatGPT is in 100%
               | of the time. Its _intent_ is always to  "make a plausible
               | completion". (Which is why it will often make things up
               | out of thin air -- it's not trying to be truthful per se,
               | it's trying to make a plausible text.) Any emotion or
               | intent the character appears to display is the same as
               | the emotion "Alice" would display in our hypothetical
               | scenario above: the intents and emotions of a fictional
               | character in ChatGPT's "head", not the intents or
               | emotions of ChatGPT itself.
               | 
               | --->8
               | 
               | Again, I don't think that's technical at all.
               | 
               | Earlier today I was describing GPT to a friend, and I
               | said, "Imagine a coworker who was amazingly erudite; you
               | could ask him about any topic imaginable, and he would
               | confidently give an amazing sounding answer.
               | Unfortunately, only 70% of the stuff he said was correct;
               | the other 30% was completely made up."
               | 
               | That doesn't go into the model at all, but at least it
               | introduce a bad model like "lying" does.
        
       | boredemployee wrote:
       | Gpt is just a tool as many others, I use it to solve lots of
       | mundane tasks and for that purpose its dope. I consider it a
       | hammer rather than a complete toolbox.
        
       | eointierney wrote:
       | Computers cannot lie but there may be unobservable errors in the
       | calculations we use them to assist
        
       ___________________________________________________________________
       (page generated 2023-04-07 23:00 UTC)