“…We used the following features as a baseline. These features were used in Named Entities Recognition task during our previous research Marcińczuk and Kocoń (2013); Marcińczuk et al (2013). - Morphosyntactic – lemma, grammatical class, case, number, gender.
- Orthographic – word, word shape (pattern), prefix, suffix, starts with upper case, starts with lower case, starts with symbol, starts with digit, has upper case, has symbol, has digit.
- Semantic – word synonym, hypernym.
- Dictionary – person first name, person last name, country name, city name, road name, person prefix, country prefix, person noun, person suffix, road prefix, specific triggers (country, district, geographic name, organisation name, person name, region, settlement).
…”