MUMBAI, India, June 13 -- Intellectual Property India has published a patent application (202417070822 A) filed by Google Llc, Mountain View, U.S.A., on Sept. 19, 2024, for 'multi-axis vision transformer.'
Inventor(s) include Li, Yinxiao; Tu, Zhengzhong; Talebi, Hossein; Zhang, Han; Yang, Feng; and Milanfar, Peyman.
The application for the patent was published on June 13, under issue no. 24/2025.
According to the abstract released by the Intellectual Property India: "Provided is an efficient and scalable attention model that can be referred to as multi-axis attention. Example implementations can include two aspects: blocked local and dilated global attention. These design choices allow global-local spatial interactions on arbitrary input resolutions with only linear complexity. The present disclosure also presents a new architectural element by effectively blending the proposed multi-axis attention model with convolutions. In addition, the present disclosure proposes a simple hierarchical vision backbone, example implementations of which can be referred to as MaxViT, by simply repeating the basic building block over multiple stages. Notably, MaxViT is able to "see" globally throughout the entire network, even in earlier, high-resolution stages."
The patent application was internationally filed on Mar. 30, 2023, under International application No.PCT/US2023/016952.
Disclaimer: Curated by HT Syndication.