Used for movies, TV shows, and music videos purchased or rented through the iTunes Store .
Introduces the Multi-Modal Diffusion Mamba (MM-DiM) block, which allows for more efficient integration of spatiotemporal modeling in video generation. 576274.m4v
M4V: Multi-Modal Mamba for Text-to-Video Generation Used for movies, TV shows, and music videos