The role of machine intelligence in photogrammetric 3D modeling – an overview and perspectives

Qin, Rongjun; Gruen, Armin

doi:10.1080/17538947.2020.1805037

Cited by 16 publications

(8 citation statements)

References 126 publications

(125 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The impacts of SfM – combining high-resolution commercial digital cameras, UASs, automating algorithms and streamlined software – have been profound for aerial photogrammetry. While it still lacks some of the accuracy of high-end systems (Fawcett et al, 2019: 302), SfM has dramatically reduced photogrammetry costs, offers increased user-friendliness, and opens it up to non-expert users (Granshaw, 2018; Qin and Gruen, 2021), thus contributing to the ‘democratization of structural data acquisition’ (Fawcett et al, 2019: 301).…”

Section: The Digitization and Automation Of Photogrammetrymentioning

confidence: 99%

“…In the drive towards full automation of the photogrammetry workflow, machine learning (especially using convolutional neural networks) has the potential to become ‘a game changer’ (Heipke and Rottensteiner, 2020: 10; on the importance of machine learning within geography more broadly, see Lavallin and Downs, 2021). Machine learning is identified as having clear application in several key areas: in image matching and scene triangulation, employing dense imaging mapping (where rather than search an entire image for features this process compares two overlapping images row by row – Kodde, 2016); in interpreting aerial/satellite images – a process known as semantic interpretation; and in georeferencing (adding geographic coordinates to an image) (see Qin and Gruen, 2021).…”

Section: The Digitization and Automation Of Photogrammetrymentioning

confidence: 99%

See 1 more Smart Citation

Vertical geomediation: The automation and platformization of photogrammetry

Wilken

Thomas

2022

New Media & Society

View full text Add to dashboard Cite

Photogrammetry is the science of using photographs to make measurements and derive three-dimensional (3D) data about objects or terrain from two-dimensional (2D) imaging. In this article we view photogrammetry through the lens of geomedia studies, arguing two things. First, we suggest the accumulation and concentration of photogrammetric capabilities, technologies and knowledge, from the First World War onwards can be understood as both part of the 20th-century creation of a ‘government machine’, and a crucial element within the longer-run ‘cartographic project’. Through both world wars and the post-war period, aerial photogrammetry emerged as a fundamental capability for government-supported geomedia infrastructure and spatial information capture – what we term an extended geomedia infosphere. Second, we examine the critical dynamics of digitization, automation and platformization. These developments, we argue, have led to a redistribution of photogrammetric capabilities and technologies outside governmental cartography, with implications for platforms and geomedia studies.

show abstract

Section: The Digitization and Automation Of Photogrammetrymentioning

confidence: 99%

Section: The Digitization and Automation Of Photogrammetrymentioning

confidence: 99%

Vertical geomediation: The automation and platformization of photogrammetry

Wilken

Thomas

2022

New Media & Society

View full text Add to dashboard Cite

show abstract

“…Researchers in the photogrammetry community have proposed methodologies for incorporating semantic information in the photogrammetric pipeline [42][43][44][45]. Research works on improving photogrammetric tasks using semantic information have been reported.…”

Section: Introductionmentioning

confidence: 99%

Oblique View Selection for Efficient and Accurate Building Reconstruction in Rural Areas Using Large-Scale UAV Images

Liang

Fan

Yang

et al. 2022

Drones

View full text Add to dashboard Cite

3D building models are widely used in many applications. The traditional image-based 3D reconstruction pipeline without using semantic information is inefficient for building reconstruction in rural areas. An oblique view selection methodology for efficient and accurate building reconstruction in rural areas is proposed in this paper. A Mask R-CNN model is trained using satellite datasets and used to detect building instances in nadir UAV images. Then, the detected building instances and UAV images are directly georeferenced. The georeferenced building instances are used to select oblique images that cover buildings by using nearest neighbours search. Finally, precise match pairs are generated from the selected oblique images and nadir images using their georeferenced principal points. The proposed methodology is tested on a dataset containing 9775 UAV images. A total of 4441 oblique images covering 99.4% of all the buildings in the survey area are automatically selected. Experimental results show that the average precision and recall of the oblique view selection are 0.90 and 0.88, respectively. The percentage of robustly matched oblique-oblique and oblique-nadir image pairs are above 94% and 84.0%, respectively. The proposed methodology is evaluated for sparse and dense reconstruction. Experimental results show that the sparse reconstruction based on the proposed methodology reduces 68.9% of the data processing time, and it is comparably accurate and complete. Experimental results also show high consistency between the dense point clouds of buildings reconstructed by the traditional pipeline and the pipeline based on the proposed methodology.

show abstract

“…In aerial and close-range photogrammetry, automation of the process is extremely important due to the large amount of data to be processed. Machine learning and deep learning are two approaches with many investigations devoted to pipeline automation [5,6]. Concerning deep learning, the relevance of the training dataset on the final robustness of the net represents the major drawback.…”

Section: Introductionmentioning

confidence: 99%

“…Concerning deep learning, the relevance of the training dataset on the final robustness of the net represents the major drawback. Although shared databases and addition of synthetic scenes for the training sets may help to enlarge the generality of these approaches, as highlighted in [7], the computation efforts may rapidly increase, shifting the bottleneck from the lack of automation to the lack of computational resources [8]. In recent years, Convolutional Neural Networks (CNN) have been increasingly used in image analysis and image segmentation, but only nowadays, they started to be used for the improvement of semantic photogrammetry [9], for increasing the level of automation in close-range applications [10], or for background removal with the aim of detecting moving objects [11].…”

Section: Introductionmentioning

confidence: 99%

Convolutional Neural Network for Background Removal in Close Range Photogrammetry: Application on Cultural Heritage Artefacts

Bici

Gherardini

Guachi-Guachi

et al. 2022

Advances on Mechanics, Design Engineering and Manufacturing IV

View full text Add to dashboard Cite

Post-processing pipeline for image analysis in reverse engineering modelling, such as photogrammetry applications, still asks for manual interventions mainly for shadows and reflections corrections and, often, for background removal. The usage of Convolutional Neural Network (CNN) may conveniently help in recognition and background removal. This paper presents an approach based on CNN for background removal, assessing its efficiency. Its relevance pertains to a comparison of CNN approaches versus manual assessment, in terms of accuracy versus automation with reference to cultural heritage targets. Through a bronze statue test case, pros and cons are discussed with respect to the final model accuracy. The adopted CNN is based on the U-NetMobilenetV2 architecture, a combination of two deep networks, to converge faster and achieve higher efficiency with small datasets. The used dataset consists of over 700 RGB images used to provide knowledge from which CNNs can extract features and distinguish the pixels of the statue from background ones. To extend CNN capabilities, training sets with and without dataset integration are investigated. Dice coefficient is applied to evaluate the CNN efficiency. Results obtained are used for the photogrammetric reconstruction of the Principe Ellenistico model. This 3D model is compared with a model obtained through a 3D scanner. Moreover, through a comparison with a photogrammetric 3D model obtained without the CNN background removal, performances are evaluated. Although few errors due to bad light conditions, the advantages in terms of process automation are consistent (over 50% in time reduction).

show abstract

The role of machine intelligence in photogrammetric 3D modeling – an overview and perspectives

Cited by 16 publications

References 126 publications

Vertical geomediation: The automation and platformization of photogrammetry

Vertical geomediation: The automation and platformization of photogrammetry

Oblique View Selection for Efficient and Accurate Building Reconstruction in Rural Areas Using Large-Scale UAV Images

Convolutional Neural Network for Background Removal in Close Range Photogrammetry: Application on Cultural Heritage Artefacts

Contact Info

Product

Resources

About