“…The dysarthria information, as a speaker characteristic, should be extracted to speaker variable z i,n 2 by the FHVAE. However, our previous work [16] found that the FHVAE does not separate the dysarthria and content information and speech impairment is identifiable from z i,n 1 . To obtain dysarthria-invariant features z i,n 2 , inspired by [17], we introduce adversarial training into the FHVAE model.…”