Share valuations are known to adjust to new information entering the market, such as regulatory disclosures. We study whether the language of such news items can improve short-term and especially long-term (24 months) forecasts of stock indices.For this purpose, this work utilizes predictive models suited to high-dimensional data and specifically compares techniques for data-driven and knowledge-driven dimensionality reduction in order to avoid overfitting. Our experiments, based on 75,927 ad hoc announcements from 1996-2016, reveal the following results: in the long run, text-based models succeed in reducing forecast errors below baseline predictions from historic lags at a statistically significant level. Our research provides implications to business applications of decision-support in financial markets, especially given the growing prevalence of index ETFs (exchange traded funds). reflect publicly-available information. Based on this premise, one can expect price changes whenever new information enters the market. In practice, regulations ensure that stock-relevant information is revealed primarily via regulatory disclosures in order to provide equal access for all market participants. Such materials disclose, for instance, quarterly earnings, but also management changes, legal risks and other events deemed important [2]. Accordingly, financial disclosures present an alluring and potentially financially-rewarding means of forecasting changes in stock valuations [3].In this respect, corporate news conveys a broad spectrum of information concerning the past performance and current challenges of the business [4], as well as frequently hinting at the future outlook. Research has followed this reasoning and empirically quantified the impact of the narrative content on the subsequent stock market responses [cf. 5, 6, 7]. Moreover, researchers have also demonstrated the prognostic capability of financial disclosures with respect to individual stock market returns in the short term [e. g. 8, 9, 10]. Accordingly, news-based forecasting has received considerable traction and, as a result, various publications have evaluated different news datasets, forecasted indicator/markets, preprocessing operations from the field of natural language processing and forecasting algorithms.Here we refer to the literature, which provides a thorough overview [3].Forecasting the development of stock indices is highly demanded by multiple stakeholders in financial markets. The underlying reason is that households are investing their money no only in individual stocks, government bonds or savings accounts; rather, they increasingly prefer exchange-traded funds (ETFs). These ETFs replicate the movements of marketable securities, with stock indices being the most prominent example. As part of their benefits, ETFs are traded on stock exchanges but often with higher liquidity and lower fees. Hence, private investors demand for decision support in better understanding the development of markets, as well as for obtaining prognostic support. For ...