Abstract: The Cognitive Query System using Generative AI is a cutting-edge tool designed to transform the way we retrieve and analyse data from a wide range of sources, including text, images, and documents. Powered by the Google Gemini API and deployed through Streamlit, this system allows users to easily ask questions, process images, and explore document contents via an intuitive, interactive interface. The system comprises three core modules: the Question-Answering Module, which uses advanced natural language processing (NLP) techniques to provide contextually accurate, conversational responses to user queries, facilitating seamless, real-time interactions. The Image Processing and Querying Module enables users to upload images and ask questions related to their content, leveraging computer vision algorithms and AI to interpret and analyse visual data, making it easier to extract valuable insights. The Document Exploration Module supports various formats, including PDFs, Word documents, and spreadsheets, allowing users to quickly ask questions and navigate through sections without manual reading. Deployed online via Streamlit, the Cognitive Query System offers a smooth, real-time experience that does not require installation, making it ideal for remote and collaborative work environments. This innovative solution is designed to streamline workflows and effective outputs in the model.
Read full abstract