Shamanthak Hegde
I am a final year Computer Science Master's student at Arizona State University where I am advised by Yezhou Yang. My current research interests lies in combining information from different sources—like text, images, and video—to help machines develop more robust, reliable, and safe commonsense reasoning.
I received my Bachelors in Computer Science Engineering from KLE Technological University in 2023 where I was advised by Shankar Gangisetty on various projects in the VQA domain.
Email /
CV /
Google Scholar /
X /
GitHub
|
|
News
- June, 2025 - New preprint ChartQA-X: Generating Explanations for Charts is out!
- June, 2025 - Nominated as a reviewer for NeurIPS 2025!
- April, 2025 - Nominated as a reviewer for ACM MM 2025!
- March, 2025 - Nominated as a reviewer for ACL 2025!
- February, 2025 - New preprint Dual Caption Preference Optimization for Diffusion Models is out!
- October, 2024 - Nominated as a reviewer for RBFM Workshop NeurIPS 2024!
- June, 2024 - 1 paper accepted to EvGenFM Workshop CVPR 2024!
- August, 2023 - Started Master's in Computer Science at ASU!
- July, 2023 - Graduated with a Bachelors's Degree in Computer Science Engineering from KLE Technological University!
- June, 2023 - 2 papers accepted to O-DRUM Workshop CVPR 2023!
- January, 2023 - Joining Bosch Global Software Technologies as a Software Development Intern.
|
Research
I'm interested in computer vision, machine learning, generative AI, and natural language processing. Representative papers are highlighted.
|
|
ChartQA-X: Generating Explanations for Charts
Shamanthak Hegde,
Pooyan Fazli,
Hasti Seifi
Under Review
Paper |
bibtex
|
|
Dual Caption Preference Optimization for Diffusion Models
Amir Saeidi*,
Yiran Luo*,
Agneet Chatterjee,
Shamanthak Hegde,
Bimsara Pathiraja,
Yezhou Yang,
Chitta Baral
Under Review
Paper |
Project |
Code |
bibtex
|
|
Evaluating Multimodal Large Language Models Across Distribution Shifts and Augmentations
Aayush Atul Verma*,
Amir Saeidi*,
Shamanthak Hegde*,
Ajay Therala*,
Fenil Denish Bardoliya*,
Nagaraju Machavarapu*,
Shri Ajay Kumar Ravindhiran*,
Srija Malyala*,
Agneet Chatterjee*,
Yezhou Yang,
Chitta Baral
CVPR EvGenFM Workshop, 2024
Paper |
bibtex
|
|
Making the V in Text-VQA Matter
Shamanthak Hegde,
Soumya Jahagirdar,
Shankar Gangisetty
CVPR O-DRUM Workshop, 2023
Paper |
bibtex
|
|
Weakly Supervised Visual Question Answer Generation
Charani Alampalle,
Shamanthak Hegde,
Soumya Jahagirdar,
Shankar Gangisetty
CVPR O-DRUM Workshop, 2023
Paper |
bibtex
|
|