Nvidia creates faster tech for long video making
Researchers introduced Radial Attention, a new attention mechanism that improves the efficiency of long video generation by lowering computational costs without sacrificing video quality.
MIT, NVIDIA, Princeton, UC Berkeley, Stanford, First Intelligence
Xingyang Li et al.
The study identified Spatiotemporal Energy Decay in video diffusion models, where attention scores decline as spatial and temporal distances between tokens increase. Radial Attention translates this into a sparse attention mechanism with a computational complexity of O(n log n), improving on the standard O(n²) approach.
Dense attention methods in video generation are often computationally heavy and inefficient for long sequences. This work presents a more scalable solution, enabling faster and more cost-effective video generation across various domains.
The method assumes an exponential decay in attention scores, which simplifies spatiotemporal relationships. This assumption may limit performance in more complex or dynamic video scenarios.
Radial Attention offers a more efficient method for generating long videos, potentially lowering the barrier to producing high-quality video content.
📄 Read the full paper: Radial Attention: $O(nlog n)$ Sparse Attention with Energy Decay for Long Video Generation
……Read full article on Tech in Asia
Comments
Leave a comment in Nestia App