Jump to content

A Hybrid Reverberation Model and Its Application to Joint Speech Dereverberation and Separation

Fast facts

  • Internal authorship

  • Further publishers

    • Tongzheng Liu
    • Zhihua Lu
    • João Paulo J. da Costa
  • Publishment

    • 2023
  • Anthology

    A Hybrid Reverberation Model and Its Application to Joint Speech Dereverberation and Separation (31)

  • Journal

    IEEE/ACM Transactions on Audio, Speech, and Language Processing,IEEE/ACM Transactions on Audio, Speech, and Language Processing

  • Organizational unit

  • Subjects

    • Communication and information technology
  • Publication format

    Journal article (Article)

Quote

Liu, Tongzheng et al. 2023. A Hybrid Reverberation Model and Its Application to Joint Speech Dereverberation and Separation. IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 3000-3014.

Content

This article proposes a hybrid reverberation model by integrating two conventional models, namely, the multichannel linear prediction (MCLP) model and the spatial coherence model. The late reverberation is divided into two components. One component is modeled using an MCLP model, and the other is modeled using the spatial coherence model. In contrast with the conventional models, the proposed hybrid model increases modeling capacity, especially in the case of long reverberation time. In order to optimally estimate model parameters, joint speech dereverberation and separation is taken into account. The hybrid reverberation model is then used in conjunction with the multichannel nonnegative matrix factorization (MNMF). The method called Hybrid-FastMNMF is proposed by treating the reverberation component modeled by the spatial coherence model as a noise source and estimating its parameters similarly to speech sources. Furthermore, prior knowledge of the spatial coherence matrix is employed to whiten the observations, resulting in another method called Hybrid-FastMNMF-W. Experimental findings demonstrate the proposed methods' superior performance in terms of joint speech dereverberation and separation, and they further justify the efficiency of the proposed hybrid reverberation model.

Notes and references

This site uses cookies to ensure the functionality of the website and to collect statistical data. You can object to the statistical collection via the data protection settings (opt-out).

Settings(Opens in a new tab)