FLUX.2 Klein 9B Schematic LoRA - Depth, Normal, Pose, and Segmentation

r/StableDiffusion
Generative AI

There have already been several projects that try to use the prior knowledge of image generation models for CV tasks, such as Marigold and SDPose. Now that image editing models have become common, there is a very simple idea: maybe these CV tasks can also be treated as image editing tasks. That is the idea behind Google's Vision Banana. When I saw it, I felt that a similar approach might also work with a local model like FLUX.2 Klein, so I trained a set of LoRAs for it. To avoid setting expectations too high: unfortunately, the quality is not good enough for practical use yet.