Random Projections for Non-linear Dimensionality Reduction

Cheng, Long; You, Chenyu; Guan, Yani

doi:10.18178/ijmlc.2016.6.4.601

Cited by 16 publications

(2 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In other words, we can distill the knowledge from one model (massive or teacher model) to another (small or student model). Previous work has shown that KD can significantly boost prediction accuracy in natural language processing and speech processing (Kim and Rush, 2016;Hu et al, 2018;Huang et al, 2018b;Hahn and Choi, 2019;Liu et al, 2021b,a;Cheng et al, 2016b;Cheng and You, 2016;Cheng et al, 2016a;You et al, 2020bYou et al, , 2021e, 2022bYou et al, , 2019aLyu et al, 2018Lyu et al, , 2019Guha et al, 2020;Yang et al, 2020;Ma et al, 2021a,b), while adopting KD-based methods for SQA tasks has been less explored. In this work, our goal is to handle the SCQA tasks.…”

Section: Spoken Question Answeringmentioning

confidence: 99%

End-to-end Spoken Conversational Question Answering: Task, Dataset and Model

You¹,

Liu²,

Ge³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

In spoken question answering, the systems are designed to answer questions from contiguous text spans within the related speech transcripts. However, the most natural way that human seek or test their knowledge is via human conversations. Therefore, we propose a new Spoken Conversational Question Answering task (SCQA), aiming at enabling the systems to model complex dialogue flows given the speech documents. In this task, our main objective is to build the system to deal with conversational questions based on the audio recordings, and to explore the plausibility of providing more cues from different modalities with systems in information gathering. To this end, instead of directly adopting automatically generated speech transcripts with highly noisy data, we propose a novel unified data distillation approach, DDNET, which effectively ingests cross-modal information to achieve finegrained representations of the speech and language modalities. Moreover, we propose a simple and novel mechanism, termed Dual Attention, by encouraging better alignments between audio and text to ease the process of knowledge transfer. To evaluate the capacity of SCQA systems in a dialogue-style interaction, we assemble a Spoken Conversational Question Answering (Spoken-CoQA) dataset with more than 40k question-answer pairs from 4k conversations. The performance of the existing state-of-the-art methods significantly degrade on our dataset, hence demonstrating the necessity of cross-modal information integration.Our experimental results demonstrate that our proposed method achieves superior performance in spoken conversational question answering tasks.

show abstract

Section: Spoken Question Answeringmentioning

confidence: 99%

End-to-end Spoken Conversational Question Answering: Task, Dataset and Model

You¹,

Liu²,

Ge³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…-Non-linear Random Projections: RP-based techniques have been used to capture non-linear features in a compact representation. Approaches range from RP-based preprocessing for existing non-linear dimensionality reduction methods [22] to ad-hoc variants for non-linear kernel functions [5,23]. -Structured Johnson-Lindestrauss: Following the work of [24], structured JL methods try to approximate the result of a traditional RP by decomposing the projection matrix into a set of low-memory matrices [25,26].…”

Section: Random Projection Variantsmentioning

confidence: 99%

Tuning Database-Friendly Random Projection Matrices for Improved Distance Preservation on Specific Data

López-Sánchez

Bodt

Lee³

et al. 2021

Appl Intell

View full text Add to dashboard Cite

Random Projection is one of the most popular and successful dimensionality reduction algorithms for large volumes of data. However, given its stochastic nature, different initializations of the projection matrix can lead to very different levels of performance. This paper presents a guided random search algorithm to mitigate this problem. The proposed method uses a small number of training data samples to iteratively adjust a projection matrix, improving its performance on similarly distributed data. Experimental results show that projection matrices generated with the proposed method result in a better preservation of distances between data samples. Conveniently, this is achieved while preserving the database-friendliness of the projection matrix, as it remains sparse and comprised exclusively of integers after being tuned with our algorithm. Moreover, running the proposed algorithm on a consumer-grade CPU requires only a few seconds.

show abstract