Pratham Singla

AI/ML Researcher & Engineer | IIT Roorkee
Roorkee, IN.

About

Highly accomplished AI/ML researcher and engineer with a 9.195/10 CGPA from IIT Roorkee, specializing in Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), and Computer Vision. Proven ability to develop and evaluate advanced AI systems, evidenced by multiple research internships, publications in top-tier workshops (NeurIPS, AAAI), and innovative projects. Seeking to leverage deep technical expertise and research acumen to drive impactful advancements in AI development.

Work

Lossfunk
|

Research Intern

Remote, India

Summary

Pioneered the development of advanced AI benchmarks and contributed to reinforcement learning frameworks for vision-language models at a leading research firm.

Highlights

Developed a novel benchmark, inspired by ZeroBench, for complex visual reasoning, integrating an automated question-generation pipeline to train and evaluate vision-language models on GPT-level reasoning tasks.

Contributed to a reinforcement learning-based training framework, specifically designed to enhance models' ability to ground reasoning in visual inputs, reducing reliance on textual cues.

Pioneered the creation of a sophisticated evaluation system for next-generation AI models, setting a new standard for visual-language understanding.

Omega Intelligence
|

Gen AI Intern

Remote, India

Summary

Explored and optimized Retrieval-Augmented Generation (RAG) architectures, establishing robust local benchmarking setups for Large Language Models (LLMs).

Highlights

Explored and analyzed cutting-edge agentic Retrieval-Augmented Generation (RAG) architectures to optimize information retrieval and synthesis for complex queries.

Established a robust local benchmarking setup using tools like Ollama, enabling efficient deployment and performance evaluation of Large Language Models (LLMs).

Boston University
|

Research Intern

Boston, MA, US

Summary

Investigated self-awareness in Large Language Models (LLMs) and evaluated fine-tuning methods to understand model behavior and generalization.

Highlights

Investigated self-awareness in Large Language Models (LLMs) through in-depth analysis of alignment between internal reasoning and external outputs.

Evaluated the performance of SFT, DPO, and GRPO tuned models on tasks involving bias, risk, and reward hacking, providing critical insights into model vulnerabilities.

Designed and executed comprehensive experiments to rigorously test the awareness of learned behaviors and generalization of reasoning across diverse domains.

Volunteer

ACM, IIT Roorkee
|

Core Member

Roorkee, Uttarakhand, India

Summary

Actively participated in ACM chapter activities, contributing to initiatives that advance computer science education and community engagement at IIT Roorkee.

Highlights

Collaborated with fellow members to promote computer science and foster a vibrant academic community.

Contributed to the development and dissemination of technical knowledge through participation in group projects and discussions.

Supported the chapter's mission to provide resources and opportunities for students interested in computing.

Vision and Language Group, IIT Roorkee
|

Head of Projects

Roorkee, Uttarakhand, India

Summary

Led project planning and execution, fostered an active Deep Learning community, and moderated discussions within the Vision and Language Group at IIT Roorkee.

Highlights

Orchestrated the overall planning and moderation of research projects, contributing to strategic direction and successful project outcomes.

Organized and moderated paper discussions, enhancing knowledge sharing and collaborative research initiatives within the group.

Actively contributed to the development of an active Deep Learning community on campus, fostering growth and engagement among students.

Education

Indian Institute of Technology Roorkee
Roorkee, Uttarakhand, India

Bachelor of Technology

Mechanical Engineering

Grade: 9.195/10 CGPA

Publications

Reflective Self-Awareness and Reasoning Alignment in LLMs

Published by

AAAI'26

Summary

Analyzed self-awareness and reasoning alignment in Large Language Models (LLMs) across various scenarios, evaluating the impact of SFT, DPO, and GRPO fine-tuning on model behavior, including bias, risk, and reward hacking.

Adaptive Urban Planning

Published by

AAAI'25 AI4UP Workshop

Summary

Developed a Multi-Agent urban planning framework using LLMs and Genetic Algorithms for optimization and regional customization, resulting in significant improvements in livability and accessibility.

StegaVision: Enhancing Steganography with Attention Mechanism

Published by

AAAI-25 Student abstract

Summary

Developed a novel image steganography model using a neural network-based approach, integrating attention mechanisms (channel and spatial attention) into an autoencoder architecture.

Give me a hint: Can LLMs take hint to solve math problems?

Published by

NeurIPS’24 Math-AI Workshop

Summary

Conducted a study on LLMs' ability to utilize hints in mathematical problem-solving on the MATH dataset, benchmarking zero-shot, few-shot, chain-of-thought, and adversarial hinting against traditional prompting.

Projects

DE -VTL: A Retrieval Framework for RAG

Summary

Developed an innovative Retrieval-Augmented Generation (RAG) framework focused on enhancing retrieval quality through active learning and efficient fine-tuning.

Evaluating AI Agent frameworks

Summary

Evaluated AI agent frameworks on QA tasks to assess their reasoning capabilities, response accuracy, and efficiency.

AI Mock Interview Chatbot

Summary

Engineered a voice-interactive AI interview coach utilizing Gemini LLM and Vapi AI to provide real-time feedback and structured scoring for personalized interview preparation.

Low Light Image Enhancement

Summary

Developed a neural network-based solution for enhancing low-light images, achieving significant PSNR improvements through advanced image processing techniques.

References

Paras Chopra

Founder, Lossfunk, Wingify