2017
DOI: 10.1145/3012004
|View full text |Cite
|
Sign up to set email alerts
|

The Challenge of Test Data Quality in Data Processing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
3
2
1

Relationship

2
4

Authors

Journals

citations
Cited by 8 publications
(7 citation statements)
references
References 13 publications
0
7
0
Order By: Relevance
“…e development of high quality test datasets is a key concern for any kind of data processing evaluation [9] [5]. is has been a well recognized challenge for years in domains such as digital preservation [22].…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations
“…e development of high quality test datasets is a key concern for any kind of data processing evaluation [9] [5]. is has been a well recognized challenge for years in domains such as digital preservation [22].…”
Section: Discussionmentioning
confidence: 99%
“…Other datasets with ground truth include CleanEval 4 and L3S-GN1 5 . Unfortunately, these datasets cover only web page documents and are tailored for narrow needs of distinguishing real web text content from boilerplate text.…”
Section: Background 21 Text Extraction Evaluation and Test Datasetsmentioning
confidence: 99%
See 2 more Smart Citations
“…In order to establish whether a piece of DF software functions correctly, it must be validated, where Guo et al's., (2009, p.13) definition states 'validation is the confirmation by examination and the provision of objective evidence that a tool, technique or procedure functions correctly and as intended'. In essence, a level of 'functional correctness' needs to be determined and assessed before a tool should be utilised in a live investigation (SWGDE, 2014;SWGDE, 2017;Becker et al, 2017), yet in practice it can be difficult to do this. The Forensic Science Regulator (2015) provides direction in the form of guidelines designed to support the the development of methods for validating forensic software with acknowledgement of the need to embed validation into laboratory practices in order to adhere to the ISO 17025 standard.…”
Section: Digital Forensic Softwarementioning
confidence: 99%