This paper demonstrates a novel and efficient unsupervised clustering method with the combination of a Self-Organising Map (SOM) and a convolutional autoencoder. The rapidly increasing volume of radio-astronomical data has increased demand for machine learning methods as solutions to classification and outlier detection. Major astronomical discoveries are unplanned and found in the unexpected, making unsupervised machine learning highly desirable by operating without assumptions and labelled training data. Our approach shows SOM training time is drastically reduced and high-level features can be clustered by training on auto-encoded feature vectors instead of raw images. Our results demonstrate this method is capable of accurately separating outliers on a SOM with neighborhood similarity and K-means clustering of radio-astronomical features complexity. We present this method as a powerful new approach to data exploration by providing a detailed understanding of the morphology and relationships of Radio Galaxy Zoo (RGZ) dataset image features which can be applied to new radio survey data.
Information retrieval systems are evaluated against test collections of topics, documents, and assessments of which documents are relevant to which topics. Documents are chosen for relevance assessment by pooling runs from a set of existing systems. New systems can return unassessed documents, leading to an evaluation bias against them. In this paper, we propose to estimate the degree of bias against an unpooled system, and to adjust the system's score accordingly. Bias estimation can be done via leave-one-out experiments on the existing, pooled systems, but this requires the problematic assumption that the new system is similar to the existing ones. Instead, we propose that all systems, new and pooled, be fully assessed against a common set of topics, and the bias observed against the new system on the common topics be used to adjust scores on the existing topics. We demonstrate using resampling experiments on TREC test sets that our method leads to a marked reduction in error, even with only a relatively small number of common topics, and that the error decreases as the number of topics increases.
Retinal arteriovenous (AV) nicking is one of the prominent and significant microvascular abnormalities. It is characterized by the decrease in the venular caliber at both sides of an artery-vein crossing. Recent research suggests that retinal AV nicking is a strong predictor of eye diseases such as branch retinal vein occlusion and cardiovascular diseases such as stroke. In this study, we present a novel method for objective and quantitative AV nicking assessment. From the input retinal image, the vascular network is first extracted using the multiscale line detection method. The crossover point detection method is then performed to localize all AV crossing locations. At each detected crossover point, the four vessel segments, two associated with the artery and two associated with the vein, are identified and two venular segments are then recognized through the artery-vein classification method. The vessel widths along the two venular segments are measured and analyzed to compute the AV nicking severity of that crossover. The proposed method was validated on 47 high-resolution retinal images obtained from two population-based studies. The experimental results indicate a strong correlation between the computed AV nicking values and the expert grading with a Spearman correlation coefficient of 0.70. Sensitivity was 77% and specificity was 92% (Kappa κ = 0.70) when comparing AV nicking detected using the proposed method to that detected using a manual grading method, performed by trained photographic graders.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.