Shruti Palaskar bio photo

Shruti Palaskar

Graduate Student
Carnegie Mellon University
Pittsburgh, PA.

Email LinkedIn Github

Publications

Google Scholar Page

2019

  1. Learned in Speech Recognition: Contextual Acoustic Word Embeddigs
    Shruti Palaskar*, Vikas Raunak*, Florian Metze
    International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

  2. Learning from Multiview Correlations in Open-Domain Videos
    Nils Holzenberger*, Shruti Palaskar*, Pranava Madhyastha, Florian Metze, Raman Arora
    International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

  3. Multimodal Grounding for Sequence-to-Sequence Speech Recognition
    Ozan Caglayan, Ramon Sanabria, Shruti Palaskar, Loïc Barrault, Florian Metze
    International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

  4. CMU Sinbad’s Submission to the DSTC7 AVSD Track
    Ramon Sanabria*, Shruti Palaskar*, Florian Metze
    Oral Presentation
    7th Dialog State Tracking Challenge (DSTC) at AAAI, 2019

2018

  1. Acoustic to Word Recognition with Sequence to Sequence Models
    Shruti Palaskar and Florian Metze
    IEEE Workshop on Spoken Language Technology (SLT), 2018

  2. How2: A Large-scale Dataset for Multimodal Language Understanding
    Ramon Sanabria, Ozan Caglayan, Shruti Palaskar, Desmond Elliott, Loïc Barrault, Lucia Specia, Florian Metze
    NeurIPS workshop on Visually Grounded Interaction and Language (ViGIL), 2018

  3. Multimodal Abstractive Summarization for Open-Domain Videos
    Jindrich Libovicky, Shruti Palaskar, Spandana Gella, Florian Metze
    Spotlight Presentation
    NeurIPS workshop on Visually Grounded Interaction and Language (ViGIL), 2018

  4. End-to-End Multimodal Speech Recognition
    Shruti Palaskar*, Ramon Sanabria*, Florian Metze
    International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2018

  5. Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the “Speaking Rosetta” JSALT 2017 Workshop
    Odette Scharenborg, Laurent Besacier, Alan Black, Mark Hasegawa-Johnson, Florian Metze, Graham Neubig, Sebastian Stüker, Pierre Godard, Markus Müller, Lucas Ondel, Shruti Palaskar, Philip Arthur, Francesco Ciannella, Mingxing Du, Elin Larsen, Danny Merkx, Rachid Riad, Liming Wang, Emmanuel Dupoux
    International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2018

2017

  1. Building an asr system for a low-resource language through the adaptation of a high-resource language asr system: Preliminary results
    Odette Scharenborg, Francesco Ciannella, Shruti Palaskar, Alan Black, Florian Metze, Lucas Ondel, Mark Hasegawa-Johnson
    ICNLSSP, 2017

  2. Combining LSTM and Latent Topic Modeling for Mortality Prediction
    Yohan Jo, Lisa Lee, Shruti Palaskar
    Preprint, 2017

* - Equal contribution