“…Whereas, the second model aims at accommodating the desired task such as question answer, document classification or ranking. However, multiple recent researches showed that BERT architecture has non-outstanding performance on AES task compared to techniques ( [16]; [30]; [43]). Although BERT showed magnificent performance in problems like question answering, its architecture failed to give an accurate scoring for an answer.…”