“…Major handwritten datasets include IAM [286], NIST [271], MNIST [277], CEDAR [261], RIMES [313], [314], UNIPEN , CENPARMI-Arabic [340] PE92 [410], etc. The datasets developed are mostly in languages like English like IAM, CEDAR, NIST, MNIST, IAM-OnDB, etc., Arabic AHDB, ARABASE, CENPARMI-A, LMCA, KHATT, CENPARMI-F etc., Chinese HCL2000, CASIA, SCUT-COUCH, etc., Indian languages like Bangla: BN-HTRd, Numerals DB, Devanagari DB, Multiscript Indian DB (Bangla, Devanagari, Tamil, Telugu) [217], Multiscript DB 11 scripts (Roman, Devanagari, Urdu, Kannada, Oriya, Gujarati, Bangla, Gurumukhi, Tamil, Telugu, Malayalam ) [404] The traditional tasks for DAR, supported by most datasets, are pre-processing, segmentation and recognition. Other tasks like DLA, word spotting, and forensic document analysis (WI and verification) have very few datasets concerning them.…”