Nodes for Stable Diffusion

krezrock · December 13, 2023, 4:11pm

Question. Is Vuo in a good position with its front end to implement Stable Diffusion? Similar to Comfy UI. If it is. That may be a good FR.

jstrecker · December 30, 2023, 5:43pm

So, at the most basic level, I think this feature request would be for a node that inputs a text phrase and outputs an image created by generative AI. Is that what you had in mind, @krezrock?

Currently Apple’s implementation of Stable Diffusion takes about 20-46 seconds to generate a 1024x1024 image on recent Mac hardware. Would you still find this a potentially useful Vuo feature at that speed, or would it be better to wait until the technology is to the point where you could use it more effectively for real-time generation of visuals?

Do you know if there are any alternatives to Stable Diffusion that are trained on images that the artists granted permission to use?

krezrock · January 2, 2024, 3:03am

That is correct, at a basic level.

Great point in regard to Macs processing these models. Maybe the best approach would be nodes to access API’s. I believe there are ways to use the API’s to access external GPU’s instead of using locally.

You bring up a very big issue with Stable Diffusion (and image diffusion in general). The only alternative that is supposed to be ethically trained is Adobe’s Firefly. They have not release API access.