Publications

You can also find my articles on Google Scholar.

Attentive Recurrent Network for Low-Latency Active Noise Control

Published in INTERSPEECH, 2022

Cite as: H. Zhang, A. Pandey and D. L. Wang, "Attentive Recurrent Network for Low-Latency Active Noise Control," in proceedings of INTERSPEECH, 2022, pp. 956-960.

Attentive Training: A New Training Framework for Talker-independent Speaker Extraction

Published in INTERSPEECH, 2022

Download

Cite as: A. Pandey and D. L. Wang, "Attentive Training: A New Training Framework for Talker-independent Speaker Extraction," in proceedings of INTERSPEECH, 2022, pp. 201-205.

Time-domain Ad-hoc Array Speech Enhancement Using a Triple-path Network

Published in INTERSPEECH, 2022

Download

Cite as: A. Pandey, B. Xu, A. Kumar, J. Donley, P. Calamia, and D. L. Wang, "Time-domain Ad-hoc Array Speech Enhancement Using a Triple-path Network," in proceedings of INTERSPEECH, 2022, pp. 729-733.

Multichannel Speech Enhancement Without Beamforming

Published in ICASSP, 2022

Download

Cite as: A. Pandey, B. Xu, A. Kumar, J. Donley, P.Calamia, and D. L. Wang, "Multichannel Speech Enhancement Without Beamforming," in proceedings of ICASSP, 2022, pp. 6502-6506.

TPARN: Triple-Path Attentive Recurrent Network for Time-Domain Multichannel Speech Enhancement

Published in ICASSP, 2022

Download

Cite as: A. Pandey, B. Xu, A. Kumar, J. Donley, P.Calamia, and D. L. Wang, "TPARN: Triple-Path Attentive Recurrent Network for Time-Domain Multichannel Speech Enhancement," in proceedings of ICASSP, 2022, pp. 6497-6501.

Self-attending RNN for Speech Enhancement to Improve Cross-corpus Generalization

Published in IEEE/ACM Transactions on Audio, Speech and Language Processing, 2021

Download

Cite as: A. Pandey and D. L. Wang, "Self-attending RNN for Speech Enhancement to Improve Cross-corpus Generalization," in IEEE/ACM Transactions on Audio, Speech and Language Processing, vol. 30, pp. 1374-1385, 2022.

Dense CNN with Self-Attention for Time-Domain Speech Enhancement

Published in IEEE/ACM Transactions on Audio, Speech and Language Processing, 2021

Download

Cite as: A. Pandey and D. L. Wang, "Dense CNN with Self-Attention for Time-Domain Speech Enhancement," in IEEE/ACM Transactions on Audio, Speech and Language Processing, vol. 29, pp. 1270-1279, 2021.

Dual Application of Speech Enhancement for Automatic Speech Recognition

Published in Workshop on Spoken Language Technology, 2021

Download

Cite as: A. Pandey, C. Liu, Y. Wang and Y. Saraf, "Dual Application of Speech Enhancement for Automatic Speech Recognition," in Workshop on Spoken Language Technology, 2021, pp. 223-228.

Dual-path Self-Attention RNN for Real-Time Speech Enhancement

Published in arXiv, 2020

Download

Cite as: A. Pandey and D. L. Wang, "Dual-path Self-Attention RNN for Real-Time Speech Enhancement," arXiv:2010.12713, 2020.

On Cross-Corpus Generalization of Deep Learning Based Speech Enhancement

Published in IEEE/ACM Transactions on Audio Speech and Language Processing, 2020

Download

Cite as: A. Pandey and D. L. Wang, "On Cross-Corpus Generalization of Deep Learning Based Speech Enhancement," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 28, pp. 2489-2499, 2020.

Learning Complex Spectral Mapping for Speech Enhancement with Improved Cross-corpus Generalization

Published in INTERSPEECH, 2020

Download

Cite as: A. Pandey and D. L. Wang, "Learning Complex Spectral Mapping for Speech Enhancement with Improved Cross-Corpus Generalization,", in proceedings of INTERSPEECH, 2020, pp. 4511-4515.

Densely Connected Neural Network with Dilated Convolutions for Real-Time Speech Enhancement in the Time Domain

Published in ICASSP, 2020

Download

Cite as: A. Pandey and D. L. Wang, "Densely Connected Neural Network with Dilated Convolutions for Real-Time Speech Enhancement in the Time Domain,", in proceedings of ICASSP, 2020, pp. 6629-6633.

A New Framework for CNN Based Speech Enhancement in the Time Domain

Published in IEEE/ACM Transactions on Audio Speech and Language Processing, 2019

Download

Cite as: A. Pandey and D. L. Wang, "A New Framework for CNN-Based Speech Enhancement in the Time Domain," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 27, pp. 1179-1188, 2019.

Exploring Deep Complex Networks for Complex Spectrogram Enhancement

Published in ICASSP, 2019

Download

Cite as: A. Pandey and D. L. Wang, "Exploring Deep Complex Networks for Complex Spectrogram Enhancement," in proceedings of ICASSP, 2019, pp. 6885-6889.

TCNN: Temporal Convolutional Neural Network for Real-Time Speech Enhancement in the Time Domain

Published in ICASSP, 2019

Download

Cite as: A. Pandey and D. L. Wang, "TCNN: Temporal Convolutional Neural Network for Real-time Speech Enhancement in the Time Domain," in proceedings of ICASSP, 2019, pp. 6875-6879.

A New Framework for Supervised Speech Enhancement in the Time Domain

Published in INTERSPEECH, 2018

Download

Cite as: A. Pandey and D. L. Wang, "A New Framework for Supervised Speech Enhancement in the Time Domain," in proceedings of INTERSPEECH, 2018, pp. 1136-1140.

On Adversarial Training and Loss Functions for Speech Enhancement

Published in ICASSP, 2018

Download

Cite as: A. Pandey and D. L. Wang, "On Adversarial Training and Loss Functions for Speech Enhancement", in proceedings of ICASSP, 2018, pp. 5414-5418.

Significance of Glottal Activity Detection for Speaker Verification in Degraded and Limited Data Condition

Published in TENCON, 2015

Download

Cite as: A. Pandey, R. K. Das, N. Adiga, N. Gupta and S. R. M. Prasanna, "Significance of Glottal Activity Detection for Speaker Verification in Degraded and Limited Data Condition," in proceedings of TENCON, 2015, pp. 1-6.

Ashutosh Pandey

Publications