https://mixed-news.com/en/open-source-rival-for-openais-dall-e-runs-on-your-graphics-card/ Skip to the content MIXEDMIXED Virtual Reality News & Augmented Reality News Menu close Menu * News * Virtual Reality + VR Games + VR Hardware * Augmented Reality + AR Apps + AR Hardware + AR Games * Artificial Intelligence + AI and society + AI research + AI application * Topics * Virtual Reality * Augmented Reality * Artificial Intelligence * MIXED * About MIXED * MIXED.de (German) RSSRSSRSS NEWSLETTERNEWSLETTERNEWSLETTER MODEMODEDARKMODE Search for: [ ] searchsearch Facebook Twitter Youtube DiscordDiscord Search for: [ ] searchsearch * Topics * Virtual Reality * Augmented Reality * Artificial Intelligence * MIXED * About MIXED * MIXED.de (German) RSSRSSRSS NEWSLETTERNEWSLETTERNEWSLETTER MODEMODEDARKMODE Facebook Twitter Youtube DiscordDiscord AI application 0 Open-source rival for OpenAI's DALL-E runs on your graphics card Aug 14 2022 Maximilian Schreiner Email Open-source rival for OpenAI's DALL-E runs on your graphics card Image: Stable Diffusion Der Artikel kann nur mit aktiviertem JavaScript dargestellt werden. Bitte aktiviere JavaScript in deinem Browser und lade die Seite neu. OpenAI's DALL-E 2 is getting free competition. Behind it is an AI open-source movement and the startup Stability AI. Artificial intelligence that can generate images from text descriptions has been making rapid progress since early 2021. At that time, OpenAI showed impressive results with DALL-E 1 and CLIP. The open-source community used CLIP for numerous alternative projects throughout the year. Then in 2022, OpenAI released the impressive DALL-E 2, Google showed Imagen and Parti, Midjourney reached millions , and Craiyon flooded social media with AI images. Startup Stability AI now announced the release of Stable Diffusion, another DALL-E 2-like system that will initially be gradually made available to new researchers and other groups via a Discord server. After a testing phase, Stable Diffusion will then be released for free - the code and a trained model will be published as open source. There will also be a hosted version with a web interface for users to test the system. Stability AI funds free DALL-E 2 competitor Stable Diffusion is the result of a collaboration between researchers at Stability AI, RunwayML, LMU Munich, EleutherAI and LAION. The research collective EleutherAI is known for its open-source language models GPT-J-6B and GPT-NeoX-20B, among others, and is also conducting research on multimodal models. The non-profit LAION (Large-scale Artificial Intelligence Open Network) provided the training data with the open-source LAION 5B dataset, which the team filtered with human feedback in an initial testing phase to create the final LAION-Aesthetics training dataset. Patrick Esser of Runway and Robin Rombach of LMU Munich led the project, building on their work in the CompVis group at Heidelberg University. There, they created the widely used VQGAN and Latent Diffusion. The latter served as the basis for Stable Diffusion with research from OpenAI and Google Brain. Recommended articles [svg][mona_lisa_dall_e_2_title] What would Mona Lisa look like with a body? DALL-E 2 has an answer To the article "Jazz robots." by TheRealBissy#StableDiffusion #AIArt #AIArtwork @StableDiffusion pic.twitter.com/V6hBWZUuM9 - Stable Diffusion Pics (@DiffusionPics) August 14, 2022 Stability AI, founded in 2020, is backed by mathematician and computer scientist Emad Mostaque. He worked as an analyst for various hedge funds for a few years before turning to public work. In 2019, he helped found Symmitree, a project that aims to lower the cost of smartphones and Internet access for disadvantaged populations. With Stability AI and his private fortune, Mostaque aims to foster the open-source AI research community. His startup previously supported the creation of the "LAION 5B" dataset, for example. For training the stable-diffusion model, Stability AI provided servers with 4,000 Nvidia A100 GPUs. "Nobody has any voting rights except our 75 employees -- no billionaires, big funds, governments, or anyone else with control of the company or the communities we support. We're completely independent," Mostaque told TechCrunch. "We plan to use our compute to accelerate open source, foundational AI." Stable Diffusion is an open-source milestone Currently, a test for Stable Diffusion is underway, with new additions being distributed in waves. The results, which can be seen on Twitter, for example, show that a real DALL-E-2 competitor is emerging here. [svg][Stable-Diffusion-merged-0006-860x172] Stable Diffusion is more versatile than Midjourney, but has a lower resolution than DALL-E 2. | Image: Github Unlike DALL-E 2, Stable Diffusion can generate images of prominent people and other subjects that OpenAI prohibits in DALL-E 2. Other systems like Midjourney or Pixelz.ai can do this as well, but do not achieve comparable quality with the high diversity seen in Stable Diffusion - and none of the other systems are open source. Turns out #stablediffusion can do really awesome interpolations between text prompts if you fix the initialization noise and slerp between the prompt conditioning vectors: pic.twitter.com/ lWOoETYVZ3 - Xander Steenbrugge (@xsteenbrugge) August 7, 2022 Stable Diffusion is already expected to run on a single graphics card with 5.1 gigabytes of VRAM - bringing AI technology to the edge that until now has only been available through cloud services. Stable Diffusion thus offers researchers and interested parties without access to GPU servers the opportunity to experiment with a modern generative AI model. The model is also supposed to run on MacBooks with Apple's M1 chip. However, image generation takes several minutes instead of seconds here. [svg][Stable-Diffusion-V1-Merged-Title-860x344] OpenAI's DALL-E 2 gets an open-source competition, led by an open-source community and startup Stability AI. | Image: Github Stability AI itself also wants to enable companies to train their variant of Stable Diffusion. Multimodal models are thus following the path previously taken by large language models: away from a single provider and toward the broad availability of numerous alternatives through open source. Runway is already researching text-to-video editing enabled by Stable Diffusion. #stablediffusion text-to-image checkpoints are now available for research purposes upon request at https://t.co/7SFUVKoUdl Working on a more permissive release & inpainting checkpoints. Soon(tm) coming to @runwayml for text-to-video-editing pic.twitter.com/7XVKydxTeD - Patrick Esser (@pess_r) August 11, 2022 Stable diffusion: Pandora's box and net benefits Of course, with open access and the ability to run the model on a widely available GPU, the opportunity for abuse increases dramatically. "A percentage of people are simply unpleasant and weird, but that's humanity," Mostaque said. "Indeed, it is our belief this technology will be prevalent, and the paternalistic and somewhat condescending attitude of many AI aficionados is misguided in not trusting society." Mostaque stresses, however, that free availability allows the community to develop countermeasures. "We are taking significant safety measures including formulating cutting-edge tools to help mitigate potential harms across release and our own services. With hundreds of thousands developing on this model, we are confident the net benefit will be immensely positive and as billions use this tech harms will be negated." More information is available on the Stable Diffusion github. You can find many examples of Stable Diffusion's image generation capabilities in the Stable Diffusion subreddit. Go here for the beta signup for Stable Diffusion. Sources: Stability AI Note: Links to online stores in articles can be so-called affiliate links. If you buy through this link, MIXED receives a commission from the provider. For you the price does not change. Google NewsGoogle News Follow us on Google News Read comments Recommended articles [svg] * What would Mona Lisa look like with a body? DALL-E 2 has an answerWhat would Mona Lisa look like with a body? DALL-E 2 has an answer What would Mona Lisa look like with a body? DALL-E 2 has an answer * Imagen: Google introduces DALL-E 2 competitionImagen: Google introduces DALL-E 2 competition Imagen: Google introduces DALL-E 2 competition * How to generate photorealistic images with DALL-E 2How to generate photorealistic images with DALL-E 2 How to generate photorealistic images with DALL-E 2 * OpenAI DALL-E 2 Prompt Guide: How to control image generation OpenAI DALL-E 2 Prompt Guide: How to control image generation OpenAI DALL-E 2 Prompt Guide: How to control image generation * Half-Life 2: VR port progressing, first gameplayHalf-Life 2: VR port progressing, first gameplay Half-Life 2: VR port progressing, first gameplay * Next-gen VR: Meta shows its latest headset protoypesNext-gen VR: Meta shows its latest headset protoypes Next-gen VR: Meta shows its latest headset protoypes * OpenAI's DALL-E 2 develops a hidden vocabularyOpenAI's DALL-E 2 develops a hidden vocabulary OpenAI's DALL-E 2 develops a hidden vocabulary * DALL-E mini becomes Craiyon and hopefully the confusion stops now DALL-E mini becomes Craiyon and hopefully the confusion stops now DALL-E mini becomes Craiyon and hopefully the confusion stops now * How good are Meta's new VR prototypes? First field reportHow good are Meta's new VR prototypes? First field report How good are Meta's new VR prototypes? First field report * Midjourney CEO: In 10 years, Xbox AI will dream your video game Midjourney CEO: In 10 years, Xbox AI will dream your video game Midjourney CEO: In 10 years, Xbox AI will dream your video game * BLOOM is a real open-source alternative to GPT-3BLOOM is a real open-source alternative to GPT-3 BLOOM is a real open-source alternative to GPT-3 * OpenAI competitor AI21 Labs gets large multi-million investment OpenAI competitor AI21 Labs gets large multi-million investment OpenAI competitor AI21 Labs gets large multi-million investment * OpenAI aims to make DALL-E safer - but runs into unexpected side effectsOpenAI aims to make DALL-E safer - but runs into unexpected side effects OpenAI aims to make DALL-E safer - but runs into unexpected side effects * OpenAI announces pricing for DALL-E 2: AI images are almost free OpenAI announces pricing for DALL-E 2: AI images are almost free OpenAI announces pricing for DALL-E 2: AI images are almost free * DALL-E 2 could become OpenAI's first money printerDALL-E 2 could become OpenAI's first money printer DALL-E 2 could become OpenAI's first money printer * Pico 4: First images of the VR controllers leakedPico 4: First images of the VR controllers leaked Pico 4: First images of the VR controllers leaked * Sony camera photographs "parallel universes" via DALL-E 2 appSony camera photographs "parallel universes" via DALL-E 2 app Sony camera photographs "parallel universes" via DALL-E 2 app * Pico 4 and Pico 4 Pro could hit the market soonPico 4 and Pico 4 Pro could hit the market soon Pico 4 and Pico 4 Pro could hit the market soon * New PSVR 2 features, Quest 2 price hike and AI prompts as a commodityNew PSVR 2 features, Quest 2 price hike and AI prompts as a commodity New PSVR 2 features, Quest 2 price hike and AI prompts as a commodity * Google's latest image AI is better than the last one, which is only four weeks oldGoogle's latest image AI is better than the last one, which is only four weeks old Google's latest image AI is better than the last one, which is only four weeks old * OpenAI: Is AI replacing creative jobs? "It's important to be honest"OpenAI: Is AI replacing creative jobs? "It's important to be honest" OpenAI: Is AI replacing creative jobs? "It's important to be honest" * DALL-E 2 becomes safer, first wave of layoffs also hits XR, Apple's headset with M2 chip?DALL-E 2 becomes safer, first wave of layoffs also hits XR, Apple's headset with M2 chip? DALL-E 2 becomes safer, first wave of layoffs also hits XR, Apple's headset with M2 chip? * A new online marketplace sells prompts for DALL-E 2 and GPT-3A new online marketplace sells prompts for DALL-E 2 and GPT-3 A new online marketplace sells prompts for DALL-E 2 and GPT-3 * OpenAI's latest AI builds a Diamond Axe in Minecraft - why it mattersOpenAI's latest AI builds a Diamond Axe in Minecraft - why it matters OpenAI's latest AI builds a Diamond Axe in Minecraft - why it matters Please enable JavaScript to view the comments powered by Disqus. * Subscribe to Our Newsletter Please leave this field empty[ ] logologo XR Briefing Get the most important XR news delivered to your email inbox once a week. + checkcheckVR, AR, AI & more + checkcheckfree + checkcheckcancel at any time E-Mail *[ ] Privacy Policy [Subscribe] Check your inbox or spam folder to confirm your subscription. * POPULAR 1 Open-source rival for OpenAI's DALL-E runs on your graphics card 2 Lenovo is back in VR with a Quest 2 clone 3 Meta's display chief names 10 features for a perfect VR headset 4 Half-Life 2 VR Mod: Launch in September, new trailer 5 "Iron Man VR" studio teases PSVR 2 project, gives away VR game RSSRSSRSS NEWSLETTERNEWSLETTERNEWSLETTER MODEMODEDARKMODE logologo * Topics * Virtual Reality * Augmented Reality * Artificial Intelligence * MIXED * About MIXED * MIXED.de (German) * FacebookFacebook * TwitterTwitter * YoutubeYoutube * DiscordDiscord RSSRSSRSS NEWSLETTERNEWSLETTERNEWSLETTER MODEMODEDARKMODE FacebookFacebook TwitterTwitter YoutubeYoutube DiscordDiscord (c)MIXED-NEWS.COM BY DEEP CONTENT GBR | ALL RIGHTS RESERVED 2022 To the topobenoben Legal Notice Privacy Policy Privacy-Manager To the topobenoben Legal Notice Privacy Policy Privacy-Manager (c)MIXED-NEWS.COM BY DEEP CONTENT GBR | ALL RIGHTS RESERVED 2022