“…Nevertheless the adaptation of NER methods to conversational speech remains challenging due to, for example, case insensitivity, lack of punctuations, un-grammatical structure, repetition, and presence of disfluencies inherent to conversations. In addition, there is not much spoken data annotated with named entities to cover the huge variety of named entity instances likely occurring in speech, and simply increasing the amount of manual annotation is not realistic for reasons of cost, evolution of new spoken terms and diversity.Several works on NER from spoken contents have already explored the use of external resources like online gazetteers[73] and Wikipedia[74] to overcome the lack of annotations. Gazetteers, for instance, have successfully boosted NER performance for given entities (e.g., Location), but do not convey the information related to the context words surrounding the entity names that are also important for NER.…”