🧠 Introduction In the realm of generative modeling, the ability to synthesize high-quality images across diverse resolutions and aspect ratios has been a significant challenge. Traditional models often rely on fixed-resolution inputs, limiting their flexibility and scalability. Enter the Native-resolution Diffusion Transformer (NiT), a novel architecture designed to explicitly handle varying resolutions and aspect ratios