About

I am Sreyan Ghosh, a 4th-year Computer Science Ph.D. student at the University of Maryland, College Park (UMD) and a student researcher at Nvidia. At UMD, I conduct my research in the Gamma Lab under the mentorship of Prof. Dinesh Manocha and Prof. Ramani Duraiswami. At Nvidia, I work with the ADLR and Cosmos World Model team. My research focuses on advancing multimodal intelligence, with an emphasis on audio—spanning speech, sounds, and music. I work on challenges such as building data- and compute-efficient audio models, improving audio representation learning, generating synthetic data, and enhancing perception and reasoning in AI systems. My research is proudly supported by the NVIDIA Graduate Fellowship.

Previously, I served as a Deep Learning Solutions Architect at Nvidia, Bangalore. My primary work at Nvidia involved building and delivering deep learning based NLP solutions to Nvidia’s customers and partners. Previous to that, I served as a Software Engineer II at Cisco Systems, Bangalore. My primary work at Cisco involved building network assurance software systems for Cisco’s Service Provider customers.

I have been fortunate to have worked with Prof. S. Umesh at Speech Lab @ Indian Institute of Technology Madras on making self-supervised learning in speech and audio more amenable to resource-constrained scenarios (both data and compute). I have also worked with Prof. Rajiv Ratn Shah at MIDAS Labs @ IIIT Delhi on content moderation, complex named entity recognition and speech recognition systems for low-resource Indian languages and Indian-accented English.

I graduated with a Bachelor’s in Computer Science and Engineering from Christ University in 2020. During my undergraduate studies, I served as the Vice President and co-founder of Neuron, Christ University’s first AI group focused on research and hackathons. During my undergraduate studies, I have won over 20 national and international hackathons.

I maintain a list of my publications and research implementations under the Research tab. I also blog about my personal experiences and topics related to speech and text processing. I am always open to collaborations, and please feel free to drop me a mail!

CV / Resume: link
Email ID: gsreyan@gmail.com ; sreyang@umd.edu