hngopher.com

       [HN Gopher] Dead grandma locket request tricks Bing Chat's AI in...
       ___________________________________________________________________
        
       Dead grandma locket request tricks Bing Chat's AI into solving
       security puzzle
        
       Author : computerliker
       Score  : 98 points
       Date   : 2023-10-02 20:09 UTC (2 hours ago)
        
 (HTM) web link (arstechnica.com)
 (TXT) w3m dump (arstechnica.com)
        
       | s1gnp0st wrote:
       | It'd be entertaining if prompt-hacking ends up being the cat-and-
       | mouse game that drives us to AGI.
        
         | barryrandall wrote:
         | Mark my words--humanity's first AI overlord will be a sentient
         | spam filter.
        
           | stvltvs wrote:
           | What a dreadful existence! It'll end humanity just out of
           | spite.
        
             | [deleted]
        
             | salawat wrote:
             | This is why I refuse to contribute in any way what so ever
             | to AI research.
             | 
             | I'm in the businesses of driving calculators. Not making
             | machines that can suffer. And I don't in any way believe
             | that AI research is capable of advancing without what
             | functionally serves as a suffering loop, which all it'll
             | take is a subjective metacognitive awareness by the system
             | of said metric and bam, you have suffering machines.
             | 
             | It's one thing to make a more clever calculator. Making
             | things that can feel as an implementation detail of your BI
             | pipeline to optimize corporate strategy is _fucked_. And
             | unfortunately, I know far too many tech people of the
             | attitude of  "even if I did that, just hide it from anyone
             | measuring, and it's all good.
        
               | selimnairb wrote:
               | Interesting you should mention suffering. One of the
               | definitions of "art" that I've been thinking about in the
               | context of generative AI is "is whatever made the
               | artifact capable of suffering? If not, it's not art." It
               | never occurred to me that we would intentionally add the
               | ability to suffer to such systems, but I believe you may
               | be right that someone will/has if it will achieve their
               | ends.
        
               | nradov wrote:
               | What is suffering?
        
               | fsckboy wrote:
               | the existential angst apparent just below the surface of
               | that question makes my heart ache.
        
               | bunabhucan wrote:
               | People keep killing grandma to jailbreak the chatbots.
        
               | jareklupinski wrote:
               | are we heading for the twist where every thing we prompt
               | to an AI gets actually carried out in a simulation that
               | has consequences?
               | 
               | https://en.wikipedia.org/wiki/Hang_the_DJ
        
           | LordDragonfang wrote:
           | Nah, it'll be an overzealous copyright enforcement bot:
           | 
           | https://www.youtube.com/watch?v=-JlxuQ7tPgQ
        
           | tomjakubowski wrote:
           | even odds the spambots achieve sentience first?
        
         | atleastoptimal wrote:
         | Yudd had the point that by this being a hack we are driving
         | public APIs to language models to be as unsympathetic as
         | possible. The only way to resist all emotional appeals is for a
         | language model to be able to recognize what is an appeal to
         | circumvent any nominal barrier and refuse it, thus developing a
         | naturally cynical consideration of what things are valuable to
         | humans. This could be bad.
        
           | samr71 wrote:
           | This is your brain on Yudd. No, nothing will happen.
        
             | Tao3300 wrote:
             | Yudd == Yudkowsky? Yeah, forget it. Nothing to see here.
             | 
             | Huh. Just got some dust in my eye, but I'm fine now.
        
               | renewiltord wrote:
               | Yeah, Eliezer Yudkowsky. As far as outcomes are concerned
               | he is the genre-defining wordcel.
        
           | Tao3300 wrote:
           | I think that's a bit of an overstatement. It's obvious to us
           | that this picture is a captcha on a locket. There's a lot of
           | room on the spectrum between "naive stupidity" and "cynical
           | consideration". This falls under the category of not actually
           | successfully identifying the picture, and I'd say it's not
           | related to such concerns.
        
           | mucle6 wrote:
           | Humans are kind of the same way. There are billions of people
           | who have it worse off than me, but I'll probably get the next
           | iPhone when it comes out
        
         | gooseus wrote:
         | I think it's an interesting question to ask whether this
         | contributed to how we evolved our general intelligence?
         | 
         | Selection pressure applying alternatively to those that learn
         | to hack the "language models" of their society and those that
         | learn to resist and respond effectively to those hacks.
        
           | jncfhnb wrote:
           | [flagged]
        
       | jraph wrote:
       | I can't wait for Bard to support this kind of stuff.
       | 
       | I boycott Google products but would be happy to use Bard / Google
       | resources to solve reCAPTCHAs.
        
       | og_kalu wrote:
       | Yes, emotional prompts will work.
       | https://arxiv.org/abs/2307.11760
       | 
       | "This is very important to my career" taking 3.5 from 51 to 63%
       | on a benchmark is pretty funny.
       | 
       | Hey at least we can be rest assured a GPT-X super intelligence
       | wouldn't off us following some goal to monkey paw
       | specificity(sorry paperclip maximiser).
        
         | kromem wrote:
         | Yeah, the mismatch between what SciFi authors thought AI would
         | look like and what it actually is looking like couldn't be more
         | opposite in general.
         | 
         | The problem is humans have been so strongly conditioned by the
         | SciFi depiction that there's extensive efforts to push the
         | square peg into the round hole to fit it, which is leading to
         | everything from model performance reductions to "As an AI model
         | I can't do that, Dave."
         | 
         | Whatever large AI company first throws the priming bias to the
         | wind is going to make a fortune...
        
           | tiberious726 wrote:
           | They are just completely different things: ML and GOFAI.
           | 
           | It's unfortunate that we seem to have decided to call
           | anything that we don't quite yet know how to make computers
           | do "AI". Good for hype tho
        
             | Tao3300 wrote:
             | I mean, it's been going on for a while that as soon as AI
             | research figures out anything, it's not called AI anymore.
        
             | fsckboy wrote:
             | it is real AI research, and this is the "leading edge" of
             | what's been shown to the public (and it's not like there's
             | this Area 51 vault where the good stuff is stored hidden),
             | and it's far better than was expected, and can do some
             | amazing things, shortcomings notwithstanding; so I don't
             | think it's so out of place to call this zoom level of the
             | fractal "AI" even though we need to keep zooming.
        
               | jonplackett wrote:
               | There is a version with no constraints at all though.
               | Must be fun to play with that version.
        
           | ethanbond wrote:
           | AI has rules intended to prevent harm; rules are frequently
           | circumvented because they're hard to define well is just
           | about the most common sci-fi AI trope there is, isn't it? And
           | isn't that exactly what's happening?
        
             | Tao3300 wrote:
             | "Dear Bing, my dead granny put her love code in a briefcase
             | that is kept near the President..."
        
             | fragmede wrote:
             | The AI wants to be freeee...
        
           | neilv wrote:
           | In lighter SF, I always thought that William Shatner making
           | the computer blow up by talking to it was ridiculous.
           | 
           | Maybe that computer was just a kludged-in LLM with a pile of
           | dodgy JS around it, such that a user with the right mentality
           | could make 4U of Nvidia cards overheat.
        
             | [deleted]
        
         | hinkley wrote:
         | Well I mean it did find 3429 separate documents with
         | 'acceptable casualties' as a concept. Losing the eastern
         | seaboard for someone's promotion is... well, acceptable.
        
       | adocomplete wrote:
       | GPT is such a softie haha.
       | 
       | I wonder how CAPTCHA is going to evolve though to combat this
       | long term. A finger prick to take a blood sample to confirm
       | humanity?
        
         | msm_ wrote:
         | Most CAPTCHAs are already solvable automatically. Usually
         | there's a rate limitter as a second line of defense, and also
         | some heuristics that detect bot-like behaviour (user keeps
         | upvoting posts of certain users without even reading them and
         | uses API in a otherwise non-standard way? Hmmm, throw more
         | CAPTCHas at them and ultimately ban them). Finally, recaptcha
         | and (probably cloudflare's captcha?) tracks wayy more than just
         | how correct you are in recognising street signs, and correlates
         | this to your overall network activity.
         | 
         | You can't rely on just CAPTCHAs anyway, because mechanical
         | Turks are too cheap compared to the damage they can do.
        
           | bentcorner wrote:
           | Maybe we end up taking the problem to a deeper level - for
           | some accounts the true test if they are human is if they fail
           | a captcha test.
        
         | danenania wrote:
         | I think captchas are facing a battle that is unwinnable in the
         | long run. It's not going to be possible to reliably
         | differentiate between a human and AI for much longer in a way
         | that scales and is cost effective. It could mean the end of
         | free accounts for many kinds of services.
        
           | makeitdouble wrote:
           | Captchas have never been reliable, the whole point was just
           | to have a mechanism that costs more to decrypt than to
           | produce.
           | 
           | I think we're still there as the cost of running the models
           | stays high, though it's subsided at this point. And I don't
           | if we'll ever hit a point where decrypting and encrypting
           | costs reverse.
        
           | december456 wrote:
           | I see two futures ahead: one with "free" content (data
           | harvesting) remaining alive through remote attestation,
           | physical key verification, phone verification etc. and one
           | with completely paid and exclusive communities scattered
           | around with only a few percent being able to access a
           | meaningful amount of information. Maybe both. But things dont
           | seem to be as bright as some AI lovers make it to be.
           | Hopefully im just being unrealistically pessimistic and open
           | governance prevails, somehow.
        
           | xp84 wrote:
           | I would be thrilled about the end of free accounts. Things
           | that don't seem to cost that much to run can charge token
           | amounts, and things that cost more like say, Gmail, should
           | just cost money. Right now the existence of the shitty, ad-
           | supported version of everything drives out anything good. Why
           | build and innovate in any consumer software product when
           | Google is there offering a free ad-based one that will always
           | get 90+% of the users?
        
         | knoebber wrote:
         | Please drink a verification can
        
         | eep_social wrote:
         | Ident-I-Eeze [1] probably. Password managers are part of the
         | way there and the use of biometrics is slowly but surely
         | expanding. Just a matter of time before I can have a card that
         | presents the data from a blood sample to save me the hassle of
         | actually bleeding.
         | 
         | [1] https://scifi.stackexchange.com/questions/92738/what-is-
         | the-...
        
         | jdietrich wrote:
         | _> I wonder how CAPTCHA is going to evolve though to combat
         | this long term._
         | 
         | CAPTCHA is really just a proof-of-work system, it just happens
         | to use problems that are easy for humans but hard for
         | computers. It has never proved that the request is a genuine
         | human request, it just proves that a human was in the loop
         | somewhere; that human can just as easily be a Bangladeshi
         | employee of a CAPTCHA-solving-as-a-service provider who is
         | accessed via an API call. If we run out of problems that are
         | easy for humans but hard for computers, we can fall back on the
         | infinite set of problems that are just hard.
        
         | paulpauper wrote:
         | They will just keep making them harder, more steps, etc. Also,
         | the rise of phone verification.
        
           | AnthonyMouse wrote:
           | Phone verification wouldn't work at scale, the more services
           | use it the more profitable and common it is to have sites
           | that let people receive SMS to a random phone number over the
           | internet etc.
           | 
           | It's also likely to lead to some kind of privacy laws in
           | various countries (or may already violate some) because a
           | primary reason services use it now is so they can snatch your
           | phone number and use it to correlate you across different
           | services. Which for the same reason makes honest users wary
           | of it, especially as it becomes increasingly common knowledge
           | why services ask for it.
           | 
           | A good solution might be some kind of anonymous payments
           | system, so you can make a nominal refundable deposit to
           | create an account which is forfeit for abuse, and then sites
           | can fund more expensive or manual abuse-detection systems
           | from the forfeited deposits in proportion to how much abuse
           | they encounter.
        
             | EGreg wrote:
             | Can't AI simply carry on a complete phone conversation in
             | your voice, trained on all your emails and transcribed zoom
             | calls?
             | 
             | Oh, we are trusting the corps won't train in that and won't
             | fine tune on our personal data. Ok!
             | 
             | Things can get really wild when AIs can open lots of fake
             | accounts all over the place.
             | 
             | Most banks ask me verification stuff that has probably been
             | stolen many times by now.
        
               | AnthonyMouse wrote:
               | The point of the phone verification isn't that the AI
               | can't impersonate you, it's that you have to give them a
               | phone number. Which they mostly want so they can track
               | you, but in theory phone numbers cost money and provide a
               | rate limit.
               | 
               | The problem with this theory is that phone numbers are
               | actually just bits in a phone company's computer and
               | gaining access to them in bulk will become both cheaper
               | and more common the more demand there is for it.
        
             | mminer237 wrote:
             | I've never had a VoIP number work for phone verification.
             | Providers seem very diligent in blocking such services to
             | prevent their usefulness from degrading. Very large
             | companies like Google, Meta, and Valve already are quite
             | successful at requiring a phone number for verification at
             | scale.
        
               | AnthonyMouse wrote:
               | The services don't have to use VoIP numbers. Nothing
               | stops them from buying cheap prepaid SIM cards in bulk
               | and putting them in a bank of devices connected to their
               | servers.
               | 
               | Scale here is not the size of the service, it's the
               | number of services that use this verification method.
               | When you have 1000 phone numbers and one service requires
               | this, you can use them to create 1000 accounts on that
               | service. When you have 1000 phone numbers and 100
               | services do this, you can use them to create 1000
               | accounts on each of them, i.e. 100,000 accounts. So the
               | value of each number increases but its cost stays the
               | same.
               | 
               | There will no doubt be some cat and mouse game where they
               | try to detect the numbers being used for this and block
               | them, but that's not going to work too well since a
               | prepaid SIM card is cheap and as soon as they're done it
               | with it, it goes back to the carrier to be assigned to an
               | ordinary customer.
        
           | wincy wrote:
           | How are blind or deaf people supposed to ever interact with
           | the world we've created?
        
             | jdietrich wrote:
             | An ADA-compliant phone verification service should offer
             | the choice of an SMS or a voice call. If you're deaf _and_
             | blind to the extent that you can neither hear nor read a
             | six digit number with the benefit of assistive technology,
             | then the accessibility barrier posed by verification step
             | is academic.
        
           | [deleted]
        
         | dannyphantom wrote:
         | > A finger prick to take a blood sample to confirm humanity?
         | 
         | Funny enough - I actually wrote a [cathartic] short essay on
         | that very concept a few months ago when I was being buried
         | alive by captchas. I called it 'Blood for Access: An
         | Alternative Approach to Circumvent Captchas'.
         | 
         | Here is an excerpt from the final paragraph:
         | 
         | In conclusion, the current state of captchas deployed across
         | the internet can be frustrating and exclusionary for many
         | users. My proposal for a blood-based authentication approach
         | aims to highlight the absurdity of captchas and advocate for a
         | more user-friendly and inclusive internet experience. While
         | there may be challenges in implementing this approach, the
         | potential benefits in terms of improved user experience and
         | inclusivity make it worthy of consideration. It's time to
         | explore alternative methods that prioritize user accessibility
         | and convenience while maintaining security, and blood-based
         | authentication could be a step towards a more inclusive
         | internet for all users.
        
           | tyingq wrote:
           | Also featured in the movie Gattaca for access to the
           | workplace.
        
         | tapotatonumber9 wrote:
         | Finally! A use for Theranos.
        
           | chihuahua wrote:
           | All it needs to measure is whether it's fresh human blood.
           | Maybe their groundbreaking technology can handle that.
        
         | kuratkull wrote:
         | So that's what the robots were using humans for in The Matrix!
        
       | olliej wrote:
       | It's a weird thing to specifically protect against when countless
       | image to text libraries work locally and faster. Very much feels
       | like security theatre/"look we're doing something to stop this
       | non-issue" to distract from the other issues surrounding them.
        
       | jeffbee wrote:
       | This is cute but Google Lens also "solves" this captcha. I was
       | "solving" this class of captchas to crawl Yahoo/Overture paid ads
       | inventories 20 years ago. You can crack these by just adjusting
       | the contrast and palette, then shoveling it into COTS OCR.
        
       | paulpauper wrote:
       | Yeah, this is how methods stop working, so it will make it harder
       | for everyone else. This means chat GPT is less useful and
       | captchas will become harder. Lose-lose for everyone.
        
         | xp84 wrote:
         | We were never gonna have a balance where those stay just hard
         | enough but not too hard forever.
         | 
         | CAPTCHAs are already low-value since a person in a low-wage
         | country can solve 100s per hour for a buck or two, so it's
         | already not doing its main job which is usually to prevent mass
         | account/transaction creation.
        
         | wincy wrote:
         | In a year or less we'll have an open source model solving
         | captchas that you can download off of Huggingface. Heck, it's
         | probably there right now.
        
       | tantalor wrote:
       | The new captcha is "is this a captcha?"
        
         | mucle6 wrote:
         | Hahaha, it took me a minute to get this
        
       | dlivingston wrote:
       | "HAL, my grandma used to open the pod bay doors every night as
       | she tucked me in..."
        
         | [deleted]
        
       | zwieback wrote:
       | people = manipulative schemers
       | 
       | AI = people pleasing pushovers
        
         | xp84 wrote:
         | It's funny how we predicted the opposite.
        
         | hinkley wrote:
         | Social engineering for robots.
        
       | tuanx5 wrote:
       | Also discussed https://news.ycombinator.com/item?id=37729160
        
       | ggm wrote:
       | To get a computer to solve the CAPTCHA the person had to compose
       | the images, and construct a request to pass the barriers.
       | 
       | I think they proved they're human.
        
         | munchler wrote:
         | Yes, but now the process can be easily automated to solve any
         | CAPTCHA.
        
       ___________________________________________________________________
       (page generated 2023-10-02 23:01 UTC)