We present MooseNet, a trainable speech metrics that predict listeners' Mean Opinion Score (MOS). We report improvements to the challenge baselines using easy-to-use modeling techniques which also scales for larger self-supervised learning (SSL) model. We present two models. The first model is a Neural Network (NN). As a second model, we propose a PLDA generative model on top layers of the first NN model, which improves the pure NN model. Ensembles from our two models achieve the top 3 or 4 VoiceMOS leaderboard place on all system and utterance level metrics for both main and OOD tracks. 1