2020
DOI: 10.3233/sji-200627
|View full text |Cite
|
Sign up to set email alerts
|

Detecting innovative companies via their website

Abstract: Producing an overview of innovative companies in a country is a challenging task. Traditionally, this is done by sending a questionnaire to a sample of companies. This approach, however, usually only focuses on large companies. We therefore investigated an alternative approach: determining if a company is innovative by studying the text on its website. For this task a model was developed based on the texts of the websites of companies included in the Community Innovation Survey of the Netherlands. The latter i… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
12
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
6
1
1

Relationship

0
8

Authors

Journals

citations
Cited by 21 publications
(27 citation statements)
references
References 16 publications
0
12
0
Order By: Relevance
“…Many of the projects presented by Beck et al (2018) that use machine learning for classification, were intended to measure variables that could only be measured by employing machine learning algorithms. A typical example is identifying small innovative companies (too small to be included in traditional surveys), using data from their websites (Daas and Van der Doef 2020). Therefore, we aim for the best of Breiman's two cultures, complementing statistical quality with information quality.…”
Section: A Paradigm Shift In Official Statisticsmentioning
confidence: 99%
See 1 more Smart Citation
“…Many of the projects presented by Beck et al (2018) that use machine learning for classification, were intended to measure variables that could only be measured by employing machine learning algorithms. A typical example is identifying small innovative companies (too small to be included in traditional surveys), using data from their websites (Daas and Van der Doef 2020). Therefore, we aim for the best of Breiman's two cultures, complementing statistical quality with information quality.…”
Section: A Paradigm Shift In Official Statisticsmentioning
confidence: 99%
“…Although this type of statistical output is rather uncomplicated, we stress that it occurs in a wide variety of applications, also outside official statistics. Examples include counting solar panels (Curier et al 2018), estimating the relative occurrence of small innovative companies (Daas and Van der Doef 2020), measuring deforestation and other applications of land cover mapping (Costa et al 2018), and predicting election outcomes based on sentiment analysis (O'Connor et al 2010). Moreover, results for estimating the base rate generalise to more complicated statistical output.…”
Section: Statistical Output Quality and Misclassification Biasmentioning
confidence: 99%
“…By adding the possibility to analyze also unstructured data without a predefined search realm, however, the advancements in big data and AI create unprecedented opportunities to develop data‐supported and meaningful insights in future technology and innovation trends and their potential societal impact. That is, due to the rapid development of computing capacity, algorithms to analyze patterns in unstructured data sources—such as natural language processing (NLP), topic‐modeling such as latent Dirichlet allocation (LDA), and deep learning methods based on artificial neural networks—are increasingly available (Daas & van der Doef, 2020; LeCun et al, 2015; Mühlroth & Grottke, 2020; Porter, 2019). Recently, a number of proposals have been made on how to mobilize this potential in foresight, especially for identifying early signals of emerging changes (e.g., Krigsholm & Riekkinen, 2019; Lee & Park, 2018; Mühlroth & Grottke, 2018; 2020).…”
Section: New Perspectives On Data‐supported Foresight—why It Is Neede...mentioning
confidence: 99%
“…Another example is detecting small innovative companies using text data from their websites [2]. Statistics Netherlands sends out surveys to collect information on innovation in companies but these do not include the smaller innovative companies.…”
Section: Innovation and Big Data At Cbsmentioning
confidence: 99%