hngopher.com

       [HN Gopher] ChatRWKV, like ChatGPT but powered by the RWKV (RNN-...
       ___________________________________________________________________
        
       ChatRWKV, like ChatGPT but powered by the RWKV (RNN-based, open)
       language model
        
       Author : maraoz
       Score  : 78 points
       Date   : 2023-01-19 21:29 UTC (1 hours ago)
        
 (HTM) web link (github.com)
 (TXT) w3m dump (github.com)
        
       | nl wrote:
       | For those wondering how on earth they are getting decent results
       | from a RNN without long range forgetting, I don't really know
       | either!
       | 
       | But they reference https://arxiv.org/abs/2105.14103 and the
       | bottom section of https://github.com/BlinkDL/RWKV-LM has an
       | explainer.
        
       | [deleted]
        
       | totoglazer wrote:
       | This might be an interesting language model. However people care
       | about ChatGPT entirely due to its quality, which this doesn't
       | demonstrate yet.
        
         | phist_mcgee wrote:
         | The leap in public exposure wasn't so much GPT3 to GPT3.5, it
         | was attaching a clean UI to the model, (with sane defaults) and
         | allowing people to talk to it like a person.
         | 
         | Suddenly it became something 'real' then.
        
           | TJSomething wrote:
           | One of the important parts of ChatGPT over plain GPT-3 is the
           | reinforcement learning from human feedback to ensure
           | alignment, without which it's not quite as good of a product
           | for the public.
        
           | tinsmith wrote:
           | This is a remarkably good take that just didn't dawn on me
           | until I read your comment. Even if ChatGPT had a lesser
           | quality than the current iteration, the fact that they had a
           | way for anyone to _easily_ interact with it really was a
           | homerun, snd can be for any software, really.
        
           | junipertea wrote:
           | They also did reinforcement learning on top of a frozen
           | trained model. It is considerably more than just attaching a
           | UI as that would just finish sentences compared to answering
           | questions. https://huggingface.co/blog/rlhf
        
           | totoglazer wrote:
           | No. ChatGPT's UI is incredibly simple and basically exactly
           | what ever chat bot test repl looks like.
           | 
           | The delta of GPT3 -> ChatGPT is from the expanded context and
           | control the model offers through fine tuning. Eg read the
           | instructgpt paper to see the path on the way to ChatGPT.
        
           | redox99 wrote:
           | It's not just the UI. ChatGPT (which is further finetuned and
           | uses RLHF) definitely produces better output than GPT3,
           | especially without prompt engineering.
        
           | gamegoblin wrote:
           | This is mostly correct. GPT3.5 is better, has a larger
           | context window, etc. But it's a very incremental step above
           | GPT3.
           | 
           | I had wired up GPT3 to a Twilio phone number and made
           | something basically like ChatGPT months before ChatGPT was
           | released -- me and my friends texted it all the time to get
           | information, similar to how people use ChatGPT. The prompt to
           | get decent performance is super simple. Just something like:
           | The following is a transcript between a human and a helpful
           | AI assistant.         The AI assistant is knowledgeable about
           | most facts of the world and provides concise answers to
           | questions.              Transcript:         {splice in the
           | last 30 messages of the conversation}              The next
           | thing the assistant says is:
           | 
           | Over time I did upgrade the prompt a bit to improve
           | performance for specific kinds of queries, but nothing crazy.
           | 
           | Cost me $10-20/mo to run for the low/moderate use by me and a
           | few friends.
           | 
           | Interestingly, for people who didn't know its limitations /
           | how to break it, it was basically passing the turing test.
           | ChatGPT is inhumanly wordy, whereas GPT3 can actually be much
           | more concise when prompted to do so. If, instead of prompting
           | it that it is an AI assistant, you prompt it that it is a
           | close friend with XYZ personality traits, it does a very good
           | job of carrying on a light SMS conversation.
        
           | moffkalast wrote:
           | Well yes, having no context memory, being slightly worse and
           | requiring either a monster rig to run or paying per prompt
           | made it completely and utterly irrelevant.
           | 
           | Even now that it's improved and free to use its actual
           | practical usability is marginal at best given the rate of
           | blatantly wrong info being spewed with 105% confidence at the
           | moment.
        
       ___________________________________________________________________
       (page generated 2023-01-19 23:00 UTC)