Estimate GPU memory usage for Megatron models
Demo fo Dream 7B, an open diffusion large language model