By Haziqa Sajid
In recent years, Generative AI has shown promising results in solving complex AI tasks. Modern AI models like ChatGPT, Bard, LLaMA, DALL-E.3, and SAM have showcased remarkable capabilities in solving multidisciplinary problems like visual question answering, segmentation, reasoning, and content generation. Moreover, Multimodal AI techniques have emerged, capable of processing multiple data modalities, i.e., text, images, audio, and




