devbnamdar/MM-DiT-From-Scratch: A high-fidelity Multimodal Diffusion Transformer (MM-DiT) built from scratch for human face synthesis, optimized for single-GPU training.

A high-fidelity Multimodal Diffusion Transformer (MM-DiT) built from scratch for human face synthesis, optimized for single-GPU training.

Read Original

Related