Shutong Niu scite author profile

This system description describes our submission system to the Third DIHARD Speech Diarization Challenge. Besides the traditional clustering based system, the innovation of our system lies in the combination of various front-end techniques to solve the diarization problem, including speech separation and target-speaker based voice activity detection (TS-VAD), combined with iterative data purification. We also adopted audio domain classification to design domain-dependent processing. Finally, we performed post processing to do system fusion and selection. Our best system achieved DERs of 11.30% in track 1 and 16.78% in track 2 on evaluation set, respectively.

show abstract

A Two-Stage Approach to Device-Robust Acoustic Scene Classification

Yang

Xia

et al. 2021

View full text Add to dashboard Cite

To improve device robustness, a highly desirable key feature of a competitive data-driven acoustic scene classification (ASC) system, a novel two-stage system based on fully convolutional neural networks (CNNs) is proposed. Our two-stage system leverages on an ad-hoc score combination based on two CNN classifiers: (i) the first CNN classifies acoustic inputs into one of three broad classes, and (ii) the second CNN classifies the same inputs into one of ten finergrained classes. Three different CNN architectures are explored to implement the two-stage classifiers, and a frequency sub-sampling scheme is investigated. Moreover, novel data augmentation schemes for ASC are also investigated. Evaluated on DCASE 2020 Task 1a, our results show that the proposed ASC system attains a state-of-theart accuracy on the development set, where our best system, a twostage fusion of CNN ensembles, delivers a 81.9% average accuracy among multi-device test data, and it obtains a significant improvement on unseen devices. Finally, neural saliency analysis with class activation mapping (CAM) gives new insights on the patterns learnt by our models.

show abstract

A smart city used low-latency seamless positioning system based on inverse global navigation satellite system technology

Zou

Niu

Chen

et al. 2019

International Journal of Distributed Sensor Networks

View full text Add to dashboard Cite

People have to move between indoor and outdoor frequently in city scenarios. The global navigation satellite system signal cannot provide reliable indoor positioning services. To solve the problem, this article proposes a seamless positioning system based on an inverse global navigation satellite system signal, which can extend the global navigation satellite system service into the indoor scenario. In this method, a signal source is arranged at a key position in the room, and the inverse global navigation satellite system signal is transmitted to the global navigation satellite system receiver to obtain a preset positioning result. The indoor positioning service is continued with the inertial navigation system after leaving the key position. The inverse global navigation satellite system seamless positioning system proposed in this article can unify indoor and outdoor positioning using the same receiver. The receiver does not need to re-receive navigation information when the scene changes, which avoids the switching process. Through the design of signal layer coverage, the receiver is in a warm start state, and the users can quickly fix the position when the scenario changes, realizing quick access in a true sense. This enables the ordinary commercial global navigation satellite system receiver to obtain indoor positioning capability without modification, and the algorithm can perform accurate positioning indoors and outdoors without switching.

show abstract

Scenario-Dependent Speaker Diarization for DIHARD-III Challenge

Wang

et al. 2021

View full text Add to dashboard Cite

QDM-SSD: Quality-Aware Dynamic Masking for Separation-Based Speaker Diarization

Niu

Sun

et al. 2023

IEEE/ACM Trans. Audio Speech Lang. Process.

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Shutong Niu

USTC-NELSLIP System Description for DIHARD-III Challenge

A Two-Stage Approach to Device-Robust Acoustic Scene Classification

A smart city used low-latency seamless positioning system based on inverse global navigation satellite system technology

Scenario-Dependent Speaker Diarization for DIHARD-III Challenge

QDM-SSD: Quality-Aware Dynamic Masking for Separation-Based Speaker Diarization

Contact Info

Product

Resources

About