2018
DOI: 10.36001/phmconf.2018.v10i1.541
|View full text |Cite
|
Sign up to set email alerts
|

Benchmarking for Keyword Extraction Methodologies in Maintenance Work Orders

Abstract: Maintenance has largely remained a human-knowledge centered activity, with the primary records of activity being textbased maintenance work orders (MWOs). However, the bulk of maintenance research does not currently attempt to quantify human knowledge, though this knowledge can be rich with useful contextual and system-level information. The underlying quality of data in MWOs often suffers from misspellings, domain-specific (or even workforce specific) jargon, and abbreviations, that prevent its immediate use … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
28
0
1

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
3

Relationship

1
5

Authors

Journals

citations
Cited by 41 publications
(29 citation statements)
references
References 10 publications
0
28
0
1
Order By: Relevance
“…A number of different named entity recognition classes are being used for MWO annotation: item-activity-state 10 and item-problem/symptom-solution/action. 23,24 The familiar assumptions of NLP often mislead in the analysis of maintenance text. For example, although the overall number of maintenance records can be similar to the number of documents in an NLP corpora, the MWO text tends to be much smaller (Table 2).…”
Section: Maintenance Text Challengesmentioning
confidence: 99%
See 2 more Smart Citations
“…A number of different named entity recognition classes are being used for MWO annotation: item-activity-state 10 and item-problem/symptom-solution/action. 23,24 The familiar assumptions of NLP often mislead in the analysis of maintenance text. For example, although the overall number of maintenance records can be similar to the number of documents in an NLP corpora, the MWO text tends to be much smaller (Table 2).…”
Section: Maintenance Text Challengesmentioning
confidence: 99%
“…Although for many use cases, a rules-based approach can handle the presence of zero, one, or multiple labels on a single MWO, this can challenge supervised learning approaches 23,30 and performance depends on the handling of class imbalance. Seale 44 handled the challenge of 1200 different component classes by injecting additional information relevant to the physical systems into the model training systems through "privileged information" which is a form of fortuitous data.…”
Section: Making Use Of Fortuitous Datamentioning
confidence: 99%
See 1 more Smart Citation
“…Keyword spotting, as one of the document image analysis techniques, includes a systematic methodology and framework to facilitate this transformation and help to create computable knowledge. This technique searches for a known vocabulary in a document image dataset and maps them to higher‐level concepts created by indexing the document images and creating a dictionary of domain‐specific terms and the knowledge they represent (Sexton et al, 2018). As an important category of approaches for knowledge discovery and mining, and with the increase in generating textual information in the forms of the scene and video images, word spotting from the scene, and video images has grown in recent years.…”
Section: Motivation For Text Mining In Natural Scene and Video Imagesmentioning
confidence: 99%
“…The objective of Nestor is to help analysts make their natural language data, which is often unstructured, filled with technical content, jargon, mispellings, and abbreviations, computable to improve analysis. An example of natural language data that could be input to Nestor and the subsequent output data and the corresponding output is shown in Table 1.The annotated datasets generated by Nestor (as either a CSV or .h5 file) can be used for different analysis techniques, such as failure prediction, problem hot spot identification, and maintenance technician expertise assessment, as shown in [2][3][4][5][6][7][8][9][10]. Currently, the majority of use cases involve maintenance in the engineering domain (manufacturing, mining, heating ventilation and air conditioning (HVAC)), however, any natural language CSV file with UTF-8 encoding can be input to Nestor.…”
mentioning
confidence: 99%