Skip to content

管道

管道提供了一种简单的方法,通过将所有必要的组件(多个独立训练的模型、调度器和处理器)捆绑到一个端到端的类中,来在推理过程中运行最先进的扩散模型。管道是灵活的,可以适应使用不同的调度器甚至模型组件。

所有管道都基于 [DiffusionPipeline] 类构建,该类提供了加载、下载和保存所有组件的基本功能。特定的管道类型(例如 [StableDiffusionPipeline])使用 [~DiffusionPipeline.from_pretrained] 加载时,会自动检测并加载管道组件,并传递给管道的 __init__ 函数。

下表列出了 🤗 Diffusers 中目前可用的所有管道及其支持的任务。点击一个管道以查看其摘要和已发表的论文。

PipelineTasks
aMUSEdtext2image
AnimateDifftext2video
Attend-and-Excitetext2image
AudioLDMtext2audio
AudioLDM2text2audio
AuraFlowtext2image
BLIP Diffusiontext2image
CogVideoXtext2video
Consistency Modelsunconditional image generation
ControlNettext2image, image2image, inpainting
ControlNet with Flux.1text2image
ControlNet with Hunyuan-DiTtext2image
ControlNet with Stable Diffusion 3text2image
ControlNet with Stable Diffusion XLtext2image
ControlNet-XStext2image
ControlNet-XS with Stable Diffusion XLtext2image
Dance Diffusionunconditional audio generation
DDIMunconditional image generation
DDPMunconditional image generation
DeepFloyd IFtext2image, image2image, inpainting, super-resolution
DiffEditinpainting
DiTtext2image
Fluxtext2image
Hunyuan-DiTtext2image
I2VGen-XLtext2video
InstructPix2Piximage editing
Kandinsky 2.1text2image, image2image, inpainting, interpolation
Kandinsky 2.2text2image, image2image, inpainting
Kandinsky 3text2image, image2image
Kolorstext2image
Latent Consistency Modelstext2image
Latent Diffusiontext2image, super-resolution
Lattetext2image
LEDITS++image editing
Lumina-T2Xtext2image
Marigolddepth
MultiDiffusiontext2image
MusicLDMtext2audio
PAGtext2image
Paint by Exampleinpainting
PIAimage2video
PixArt-αtext2image
PixArt-Σtext2image
Self-Attention Guidancetext2image
Semantic Guidancetext2image
Shap-Etext-to-3D, image-to-3D
Stable Audiotext2audio
Stable Cascadetext2image
Stable Diffusiontext2image, image2image, depth2image, inpainting, image variation, latent upscaler, super-resolution
Stable Diffusion XLtext2image, image2image, inpainting
Stable Diffusion XL Turbotext2image, image2image, inpainting
Stable unCLIPtext2image, image variation
T2I-Adaptertext2image
Text2Videotext2video, video2video
Text2Video-Zerotext2video
unCLIPtext2image, image variation
UniDiffusertext2image, image2text, image variation, text variation, unconditional image generation, unconditional audio generation
Value-guided planningvalue guided sampling
Wuerstchentext2image

DiffusionPipeline

[[autodoc]] DiffusionPipeline - all - call - device - to - components

[[autodoc]] pipelines.StableDiffusionMixin.enable_freeu

[[autodoc]] pipelines.StableDiffusionMixin.disable_freeu

FlaxDiffusionPipeline

[[autodoc]] pipelines.pipeline_flax_utils.FlaxDiffusionPipeline

PushToHubMixin

[[autodoc]] utils.PushToHubMixin