[HN Gopher] AudioGen: Textually Guided Audio Generation
       ___________________________________________________________________
        
       AudioGen: Textually Guided Audio Generation
        
       Author : pierre
       Score  : 64 points
       Date   : 2022-09-30 19:02 UTC (3 hours ago)
        
 (HTM) web link (felixkreuk.github.io)
 (TXT) w3m dump (felixkreuk.github.io)
        
       | karmasimida wrote:
       | It will be more useful if it can narrate text along with those
       | background effects.
        
         | simonw wrote:
         | You can already achieve that by combining models - use a
         | dedicated speech synthesis model for the narration, then layer
         | that over background effects from AudioGen.
         | 
         | Given that, I don't think AudioGen particularly needs to add
         | full narration. That seems like a very different problem to me,
         | likely requiring a completely different architecture.
        
       | fuzzythinker wrote:
       | [code] redirects to the same page
        
         | ggerganov wrote:
         | According to one of the authors, the code and the models will
         | be available soon [0]
         | 
         | [0] - https://twitter.com/FelixKreuk/status/1575846953333579776
        
       | kevmo314 wrote:
       | The speech samples are really funny. Very Sims-esque.
        
       | nudpiedo wrote:
       | That could be another missing piece to videogame generational
       | art, sfx sounds and soon soundtracks.
        
       ___________________________________________________________________
       (page generated 2022-09-30 23:00 UTC)