MUMBAI, India, June 19 -- Intellectual Property India has published a patent application (202647065024 A) filed by Qualcomm Incorporated on May 22, 2026, for Efficient Speculative Decoding In Autoregressive Generative Artificial Intelligence Models.

Inventors include Jeon, Wonseok; Gagrani, Mukul; Lee, Mingu; Goel, Raghavv; Park, Junyoung; and Lott, Christopher.

The application for the patent was published on June 05, 2026, under issue no. 23/2026.

Abstract: Certain aspects of the present disclosure provide techniques and apparatus for efficiently generating a response to a query input in a generative artificial intelligence model. An example method generally includes generating, based on an input prompt and using a first machine learning model, a set of tokens including one or more subsets of tokens. Each respective subset of the one or more subsets corresponds to a respective portion of a response to the input prompt and includes a fixed number of tokens corresponding to a beam width for a beam search through the set of tokens. The set of tokens is output to a second machine learning model for verification, and information identifying a selected sequence of tokens from the generated set of tokens is received from the second machine learning model. The selected sequence of tokens is output as the response to the input prompt.

Disclaimer: Curated by HT Syndication.