2011 IEEE International Conference on Granular Computing 2011
DOI: 10.1109/grc.2011.6122664
|View full text |Cite
|
Sign up to set email alerts
|

Automatic approaches to clustering occupational description data for prediction of probability of workplace exposure to beryllium

Abstract: We investigated automatic approaches for clustering data that describes occupations related to hazardous airborne exposure (beryllium). The regulatory compliance data from Occupational Safety and Health Administration includes records containing short free text job descriptions and associated numerical exposure levels. Researchers in public health domain need to map job descriptions to Standard Occupational Classification (SOC) nomenclature for estimating occupational health risks. Previous manual process was … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
3
0

Year Published

2012
2012
2018
2018

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 16 publications
0
3
0
Order By: Relevance
“…Some authors have manually recoded this variable when their dataset was restricted to few industries (Teschke et al, 1999;Hamm and Burstyn, 2011). More recently, Slutsky et al developed an algorithm to automatically create standard occupations across all industries in IMIS from the text description (Slutsky et al, 2011). The analysis of variables internal to IMIS, while informative, cannot evaluate adequately the relationship between exposure levels in IMIS and those occurring in US workplaces.…”
Section: Comparison Of Imis and The Cehd Datasetmentioning
confidence: 99%
“…Some authors have manually recoded this variable when their dataset was restricted to few industries (Teschke et al, 1999;Hamm and Burstyn, 2011). More recently, Slutsky et al developed an algorithm to automatically create standard occupations across all industries in IMIS from the text description (Slutsky et al, 2011). The analysis of variables internal to IMIS, while informative, cannot evaluate adequately the relationship between exposure levels in IMIS and those occurring in US workplaces.…”
Section: Comparison Of Imis and The Cehd Datasetmentioning
confidence: 99%
“…Industries were classified using the North American Industry Classification System 2007 (Canadian edition) and occupations were classified using the Standard Occupation Classification 2010 (US edition). Industrial classification was done manually, while occupational classification was completed using an automated approach,24 which has previously been assessed for its reliability,25 and then manually reviewed by the authors (DGL, IB) to ensure accuracy. The JEM estimates the probability of job-specific exposure (τ) exceeding the permissible exposure level (PEL=0.2 mg/m 3 ) for PAHs (θ=Pr(τ>PEL)).…”
Section: Methodsmentioning
confidence: 99%
“…We previously developed an automated method to cluster free-text descriptions of occupations into groups with similar meaning and applied the method to identify jobs with different probability of exposure to beryllium, in the context of exposure measurements that were collected by Occupational Safety and Health Administration (OSHA) in the Integrated Management Information System (IMIS) (Slutsky et al, 2011). Compared to the crosswalk created in manual coding of free-text descriptions of occupations (Hamm and Burstyn, 2011), the automated coding produced clusters with a similar ability to discriminate among jobs with different exposures, leading us to conclude that the automated coding approach is promising and produces results that are no worse than those by human coders at the fraction of cost and effort (Slutsky et al, 2011). However, such findings have limited transferability to other settings because the clustered occupations did not correspond to widely used systems for coding occupations, such as the major occupation groups used by the SOC (Bureau of Labour and Statistics, 2013).…”
mentioning
confidence: 99%