Shruti Palaskar bio photo

Shruti Palaskar

Graduate Student
Carnegie Mellon University
Pittsburgh, PA.

Email Twitter LinkedIn Github

Publications

Google Scholar Page

2020

  1. Hierarchical Semantic Concepts for Multimodal Video Understanding
    Shruti Palaskar, Ruslan Salakhutdinov, Alan W. Black, Florian Metze
    Under Review

  2. Transfer Learning for Multimodal Dialog
    Shruti Palaskar*, Ramon Sanabria*, Florian Metze
    Elsevier Computer Speech and Language

  3. Grounded Sequence-to-Sequence Transduction
    Specia, Arora, Barrault, Caglayan, Duarte, Elliott, Gella, Holzenberger, Lala, Lee, Libovick' y, Madhyastha, Metze, Mulligan, Ostapenko, Palaskar, Sanabria, Wang
    IEEE Journal for Select Topics in Signal Processing

  4. Speech Technology for Unwritten Languages
    Scharenborg, Besacier, Black, Hasegawa-Johnson, Metze, Neubig, Stüker, Godard, Müller, Ondel, Palaskar, Arthur, Ciannella, Du, Larsen, Merkx, Riad, Wang, Dupoux
    IEEE Transactions for Audio, Speech and Language

  5. ASR Error Correction and Domain Adaptation using Machine Translation
    Anirudh Mani*, Shruti Palaskar*, Nimshi Venkat Meripo, Sandeep Konam and Florian Metze
    International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020

2019

  1. Multimodal Abstractive Summarization for How2 Videos
    Shruti Palaskar, Jindrich Libovický, Spandana Gella, Florian Metze
    Association for Computational Linguistics (ACL), 2019

  2. Learned in Speech Recognition: Contextual Acoustic Word Embeddigs
    Shruti Palaskar*, Vikas Raunak*, Florian Metze
    International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

  3. Learning from Multiview Correlations in Open-Domain Videos
    Nils Holzenberger*, Shruti Palaskar*, Pranava Madhyastha, Florian Metze, Raman Arora
    International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

  4. Multimodal Grounding for Sequence-to-Sequence Speech Recognition
    Ozan Caglayan, Ramon Sanabria, Shruti Palaskar, Loïc Barrault, Florian Metze
    International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2019

  5. CMU Sinbad’s Submission to the DSTC7 AVSD Track
    Ramon Sanabria*, Shruti Palaskar*, Florian Metze
    Oral Presentation
    7th Dialog State Tracking Challenge (DSTC) at AAAI, 2019

2018

  1. Acoustic to Word Recognition with Sequence to Sequence Models
    Shruti Palaskar and Florian Metze
    IEEE Workshop on Spoken Language Technology (SLT), 2018

  2. How2: A Large-scale Dataset for Multimodal Language Understanding
    Ramon Sanabria, Ozan Caglayan, Shruti Palaskar, Desmond Elliott, Loïc Barrault, Lucia Specia, Florian Metze
    NeurIPS workshop on Visually Grounded Interaction and Language (ViGIL), 2018

  3. Multimodal Abstractive Summarization for Open-Domain Videos
    Jindrich Libovicky, Shruti Palaskar, Spandana Gella, Florian Metze
    Spotlight Presentation
    NeurIPS workshop on Visually Grounded Interaction and Language (ViGIL), 2018

  4. End-to-End Multimodal Speech Recognition
    Shruti Palaskar*, Ramon Sanabria*, Florian Metze
    International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2018

  5. Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the “Speaking Rosetta” JSALT 2017 Workshop
    Odette Scharenborg, Laurent Besacier, Alan Black, Mark Hasegawa-Johnson, Florian Metze, Graham Neubig, Sebastian Stüker, Pierre Godard, Markus Müller, Lucas Ondel, Shruti Palaskar, Philip Arthur, Francesco Ciannella, Mingxing Du, Elin Larsen, Danny Merkx, Rachid Riad, Liming Wang, Emmanuel Dupoux
    International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2018

2017

  1. Building an asr system for a low-resource language through the adaptation of a high-resource language asr system: Preliminary results
    Odette Scharenborg, Francesco Ciannella, Shruti Palaskar, Alan Black, Florian Metze, Lucas Ondel, Mark Hasegawa-Johnson
    ICNLSSP, 2017

  2. Combining LSTM and Latent Topic Modeling for Mortality Prediction
    Yohan Jo, Lisa Lee, Shruti Palaskar
    Preprint, 2017

* - Equal contribution