“…Providing top-down awareness on a corpus level (i.e., distant reading techniques) enables not only to inform corpus trends and high-level patterns but also to provide entry points for further exploration [3,22]. Common approaches include topics [3,12,14,22,26], extracted entities [14,22,33,38], relevant keywords [13,15,27], and aggregate statistics over terms and metadata [15], sometimes organized over time [13,14,27]. These elements are often interactive and serve as content filters, linking back to particular text mentions.…”