Shruti Palaskar bio photo

Shruti Palaskar

Graduate Student
Carnegie Mellon University
Pittsburgh, PA.

Email Twitter LinkedIn Github

Research

I am interested in enabling machines to learn from multiple modalities of data like text, audio, video, semantics as humans naturally do.

Ongoing Research

  • Multimodal Video Understanding
    Exploring video summarization, rationalization and understanding multimodally in videos. The How2 dataset largely facilitates this work.

  • End-to-End and Multimodal Speech Recognition
    Building robust direct acoustic-to-word models for audio-only and audio-visual data.

Summer Internships

Past Projects

Undergrad Internships

During my undergrad, I was fortunate to work on computer vision problems with Dr. Hyunsung Park and Prof. Ramesh Raskar at the MIT Media Lab mentored REDX Innovation Labs, on machine translation with Prof. Ganesh Ramakrishnan at IIT Bombay and on recommender systems with Harshad Saykhedkar at a digital marketing startup, Sokrati Technologies.