2018
DOI: 10.1007/978-3-319-76941-7_65
|View full text |Cite
|
Sign up to set email alerts
|

Towards Maximising Openness in Digital Sensitivity Review Using Reviewing Time Predictions

Abstract: Abstract. The adoption of born-digital documents, such as email, by governments, such as in the UK and USA, has resulted in a large backlog of born-digital documents that must be sensitivity reviewed before they can be opened to the public, to ensure that no sensitive information is released, e.g. personal or confidential information. However, it is not practical to review all of the backlog with the available reviewing resources and, therefore, there is a need for automatic techniques to increase the number o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
2
2
1

Relationship

4
1

Authors

Journals

citations
Cited by 5 publications
(5 citation statements)
references
References 9 publications
0
5
0
Order By: Relevance
“…We note that the output of our proposed prioritisation strategies that rank clusters by their sensitivity classification probabilities can be dependent on the effectiveness of the deployed sensitivity classifier. Moreover, reviewing shorter documents first may not always lead to a faster review of documents, as is described by McDonald et al [14]. In this study, we control these variables as constant across all the aforementioned review prioritisation strategies to isolate the effectiveness of our proposed Cluster+Metadata approach.…”
Section: User Study#2: Review Opennessmentioning
confidence: 99%
See 1 more Smart Citation
“…We note that the output of our proposed prioritisation strategies that rank clusters by their sensitivity classification probabilities can be dependent on the effectiveness of the deployed sensitivity classifier. Moreover, reviewing shorter documents first may not always lead to a faster review of documents, as is described by McDonald et al [14]. In this study, we control these variables as constant across all the aforementioned review prioritisation strategies to isolate the effectiveness of our proposed Cluster+Metadata approach.…”
Section: User Study#2: Review Opennessmentioning
confidence: 99%
“…Differently from the work of Hutchison, in this work, we study document clustering for sensitivity review in an interactive setting to evaluate the impact of reviewing documents in semantic clusters, on the efficiency, accuracy, and openness of sensitivity review. McDonald et al [14] proposed a method of predicting the amount of time that a reviewer would need to review a collection of documents and show that their proposed method can notably improve openness by prioritising the documents that are predicted to take less time to review. In contrast, in this work, we present a method of prioritising groups of documents for review to maximise openness.…”
Section: Related Workmentioning
confidence: 99%
“…For example, the details of an employee's salary are more likely to be sensitive in documents about business discussions than mentions of salaries in documents about political discussions, since politicians' salaries are usually in the public domain. Prioritising particular groups of related documents for review can also help to increase the number of documents that can be opened to the public when there are limited reviewing resources [9] (i.e., openness [6]). However, in large unstructured document collections, it is not practical for reviewers to manually identify such groups of related documents.…”
Section: Document Collec�on Document Collec�onmentioning
confidence: 99%
“…For example, by prioritising not-sensitive documents to increase the number of non-sensitive documents that can be released to the public with limited reviewing resources [McDonald et al 2018]. In this work, we focus on evaluating if providing the reviewers with sensitivity classification predictions can reduce the time that it takes for a reviewer to review a collection of documents, while maintaining (or increasing) the reviewing accuracy.…”
Section: Technology-assisted Sensitivity Reviewmentioning
confidence: 99%