This system description describes our submission system to the Third DIHARD Speech Diarization Challenge. Besides the traditional clustering based system, the innovation of our system lies in the combination of various front-end techniques to solve the diarization problem, including speech separation and target-speaker based voice activity detection (TS-VAD), combined with iterative data purification. We also adopted audio domain classification to design domain-dependent processing. Finally, we performed post processing to do system fusion and selection. Our best system achieved DERs of 11.30% in track 1 and 16.78% in track 2 on evaluation set, respectively.
To improve device robustness, a highly desirable key feature of a competitive data-driven acoustic scene classification (ASC) system, a novel two-stage system based on fully convolutional neural networks (CNNs) is proposed. Our two-stage system leverages on an ad-hoc score combination based on two CNN classifiers: (i) the first CNN classifies acoustic inputs into one of three broad classes, and (ii) the second CNN classifies the same inputs into one of ten finergrained classes. Three different CNN architectures are explored to implement the two-stage classifiers, and a frequency sub-sampling scheme is investigated. Moreover, novel data augmentation schemes for ASC are also investigated. Evaluated on DCASE 2020 Task 1a, our results show that the proposed ASC system attains a state-of-theart accuracy on the development set, where our best system, a twostage fusion of CNN ensembles, delivers a 81.9% average accuracy among multi-device test data, and it obtains a significant improvement on unseen devices. Finally, neural saliency analysis with class activation mapping (CAM) gives new insights on the patterns learnt by our models.
People have to move between indoor and outdoor frequently in city scenarios. The global navigation satellite system signal cannot provide reliable indoor positioning services. To solve the problem, this article proposes a seamless positioning system based on an inverse global navigation satellite system signal, which can extend the global navigation satellite system service into the indoor scenario. In this method, a signal source is arranged at a key position in the room, and the inverse global navigation satellite system signal is transmitted to the global navigation satellite system receiver to obtain a preset positioning result. The indoor positioning service is continued with the inertial navigation system after leaving the key position. The inverse global navigation satellite system seamless positioning system proposed in this article can unify indoor and outdoor positioning using the same receiver. The receiver does not need to re-receive navigation information when the scene changes, which avoids the switching process. Through the design of signal layer coverage, the receiver is in a warm start state, and the users can quickly fix the position when the scenario changes, realizing quick access in a true sense. This enables the ordinary commercial global navigation satellite system receiver to obtain indoor positioning capability without modification, and the algorithm can perform accurate positioning indoors and outdoors without switching.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.