Posts by jedfox@mastodon.social
 (DIR) Post #AU7JWu7G0roKKSAngW by jedfox@mastodon.social
       2023-03-29T19:05:26Z
       
       0 likes, 0 repeats
       
       @simon @pjbrunet I tried using that pattern with alpaca and it kept on guessing instead of trying to run external code :(I think someone needs to train a model specifically to use these sorts of external resources.
       
 (DIR) Post #AVYBf2meBg36nZTfPc by jedfox@mastodon.social
       2023-05-11T16:05:47Z
       
       0 likes, 0 repeats
       
       @simon Do you think it would be possible to train the model on dedicated “delimiter” tokens (maybe “instruction”/“input”/“output”) with no textual representation? Assuming the model was instruction trained using only these tokens, it might be more difficult to convince it that the input has ended, right?