“…They include HTML structural analysis, natural language processing, machine learning, data modeling, and ontology. For citation domain, numerous works reported in the literature, e.g., [7], [8], [9], [10], [11], [12], [13], [14], [15], [16], [17], [18], [19], [20], [21], [22], [23] use similar concepts to extract metadata from citations. The approaches can be roughly classified into two categories: learning-based and knowledge-based approaches.…”