“…Different from a traditional entity-typing task that typically classifies entities into coarse-grained types (e.g., person, location, organization ) [ 3 , 4 ], FET aims to assign an entity with more specific types [ 5 , 6 ], which usually follow a hierarchical structure that can provide more semantic information about the entity [ 7 , 8 ], such as /person/politician , /book/author , etc. FET is a significant subtask of named-entity recognition (NER) [ 9 ] for downstream natural language processing (NLP) applications, such as relation extraction [ 10 , 11 ], question answering [ 12 , 13 ], knowledge base population [ 14 ], and recommendation [ 15 , 16 ].…”