MAE offers stable masked region estimates, yet
falls short in texture detail.
GAN-based inpainting struggles with low fidelity, such as
omitted white horizontal lines.
SD is powerful but unstable, often
introducing random elements and suffers from
mask-unmask color inconsistency.
ASUKA ensures consistency of masked-unmasked areas during diffusion and decoding processes, achieving context-stable and visual-consistent inpainting.