MUMBAI, India, Jan. 2 -- Intellectual Property India has published a patent application (202541123274 A) filed by Vellore Institute Of Technology, Vellore, Tamil Nadu, on Dec. 6, 2025, for 'comprehensive pdf query system with semantic search and flashcard generation.'
Inventor(s) include Dr. Mohana Sundaril; and Swarnava Banerjee.
The application for the patent was published on Jan. 2, under issue no. 01/2026.
According to the abstract released by the Intellectual Property India: "The present disclosure provides a comprehensive PDF query system that includes a document processing module configured to extract text from PDF documents including scanned image-based PDFs using optical character recognition, a text preprocessing module configured to segment the extracted text into processable chunks, an embedding module configured to convert the text chunks into vector embeddings using natural language processing models, a vector database configured to store the vector embeddings and enable similarity searches, a query processing module configured to receive natural language queries from users, convert the queries into query vectors, and perform similarity searches against the stored vector embeddings to retrieve contextually relevant document segments, a flashcard generation module configured to analyze the preprocessed text to identify key concepts and generate question-answer pairs, and a user interface module (10) configured to provide document upload and query submission functionality."
Disclaimer: Curated by HT Syndication.