Generative art with language+diffusion models
September 16, 2022 — March 5, 2024
Generative art using modern diffusion-backed image generators. The name-brand models are DALL-E 2, stable diffusion, Midjourney etc, which are diffusion + transformer models.
CLIP presumably goes somewhere near here.
For audio stuff see music diffusion.
1 DIY models
- That Pokemon diffusion post: Adventures in Finetuning Stable Diffusion.
Civitai is a labor of love from a small team. After being inspired daily by the incredible progress of the Stable Diffusion community and the explosion of custom fine-tuned models, textual inversions, and more, we wanted to see if we could create something that would continue to help the community grow and thrive.
After seeing a gap around sharing the custom models that were being made by the community, we decided to try our hand a putting together a tool that would make it easy for anyone to share, find, and review models. While there were existing services like HuggingFace that allowed users to expose their models as repositories, we felt that it was missing a few key features that would really allow it to serve as a home for the growing community and use case:
A way for creators to tag models with things that make sense to the SD community
A good way for people interested in the model to review and share their creations
A simpler upload and download interface (how many of us are really familiar with code repos)
An indexed and visual browsing experience of all the models available
An API that can be used by SD tools to tap into the growing library of models, embeds, aesthetic gradients, and hyper networks available
2 Punditry
3 Mechanics
4 UIs
DiffusionBee - Stable Diffusion App for AI Art /divamgupta/diffusionbee-stable-diffusion-ui: Diffusion Bee is the easiest way to run Stable Diffusion locally on your M1 Mac. Comes with a one-click installer. No dependencies or technical knowledge needed.
invoke-ai/InvokeAI Linux, Windows and macOS
InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multiple commercial products.
AUTOMATIC1111/stable-diffusion-webui: Stable Diffusion web UI
A browser interface based on Gradio library for Stable Diffusion.
NMKD Stable Diffusion GUI - AI Image Generator by N00MKRAD
A handy GUI to run Stable Diffusion, a machine learning toolkit to generate images from text, locally on your own hardware.
It is completely uncensored and unfiltered - I am not responsibly for any of the content generated with it. No data is shared/collected by me or any third party.
-
Stable Diffusion, DALL-E 2, CLIP-Guided Diffusion, VQGAN+CLIP and Neural Style Transfer are all available on NightCafe.
5 Incoming
- StableDiffusion
- Stable Diffusion Public Release — Stability.Ai
- Stable Diffusion launch announcement — Stability.Ai
- Stable Diffusion with 🧨 Diffusers
- huggingface/diffusers: 🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
- The Annotated Diffusion Model
- Stability-AI/stablediffusion: High-Resolution Image Synthesis with Latent Diffusion Models
- Stable Diffusion 2.0 Release — Stability.Ai
- Google AI Blog: High Fidelity Image Generation Using Diffusion Models
- Reddit for AI-generated and manipulated content
- Denoising Diffusion Restoration Models
- The text-to-image revolution, explained
- Stable Diffusion with Core ML on Apple Silicon
- Geometry in Text-to-Image Diffusion Models