Recently, machine learning (ML) methods have been used to create powerful language models for a broad range of natural language processing tasks. An important subset of this field is that of generating code of programming languages for automatic software development. This review provides a broad and detailed overview of studies for code generation using ML. We selected 37 publications indexed in arXiv and IEEE Xplore databases that train ML models on programming language data to generate code. The three paradigms of code generation we identified in these studies are description-to-code, code-todescription, and code-to-code. The most popular applications that work in these paradigms were found to be code generation from natural language descriptions, documentation generation, and automatic program repair, respectively. The most frequently used ML models in these studies include recurrent neural networks, transformers, and convolutional neural networks. Other neural network architectures, as well as non-neural techniques, were also observed. In this review, we have summarized the applications, models, datasets, results, limitations, and future work of 37 publications. Additionally, we include discussions on topics general to the literature reviewed. This includes comparing different model types, comparing tokenizers, the volume and quality of data used, and methods for evaluating synthesized code. Furthermore, we provide three suggestions for future work for code generation using ML.
In this research work, we have demonstrated the application of Mask-RCNN (Regional Convolutional Neural Network), a deep-learning algorithm for computer vision and specifically object detection, to semiconductor defect inspection domain. Stochastic defect detection and classification during semiconductor manufacturing has grown to be a challenging task as we continuously shrink circuit pattern dimensions (e.g., for pitches less than 32 nm). Defect inspection and analysis by state-of-the-art optical and e-beam inspection tools is generally driven by some rule-based techniques, which in turn often causes to misclassification and thereby necessitating human expert intervention. In this work, we have revisited and extended our previous deep learning-based defect classification and detection method towards improved defect instance segmentation in SEM images with precise extent of defect as well as generating a mask for each defect category/instance. This also enables to extract and calibrate each segmented mask and quantify the pixels that make up each mask, which in turn enables us to count each categorical defect instances as well as to calculate the surface area in terms of pixels. We are aiming at detecting and segmenting different types of inter-class stochastic defect patterns such as bridge, break, and line collapse as well as to differentiate accurately between intra-class multi-categorical defect bridge scenarios (as thin/single/multi-line/horizontal/non-horizontal) for aggressive pitches as well as thin resists (High NA applications). Our proposed approach demonstrates its effectiveness both quantitatively and qualitatively.
With the progression of deep learning algorithms in computer vision, a lot of research is taking place in the semiconductor industry towards improving real-time defect detection and classification analysis. An Automated Defect Classification and Detection (ADCD) framework not only enables rapid measurement of dimensions and classification of defects, but also helps minimize production costs, engineering time as well as tool cycle time associated with the defect inspection process. As we continue to shrink the pitch (below 36nm), defect characterization at wafer scale becomes a key issue as it demands rapid measurement but without losing accuracy and repeatability. Also, in the context of high NA lithography (thin resist), accurate metrology becomes difficult with very noisy as well as low contrast images (No BKM exists till now). Human eyes generally demonstrate close to the Bayesian Error limit in detecting smaller objects (for example, extracting contextual information instantaneously from nanoscale defects in SEM images). However, for most One-stage and Two-stage object detectors, this is still a very challenging task due to variable image resolution and SEM (scanning electron microscope) image quality (low SNR). In this research work, we have experimented with different modified YOLOv5 object detectors to improve challenging stochastic defect detection precision. In this work, we have proposed an ensemble strategy by empirically combining multiple custom-trained models (YOLOv5n, YOLOv5s, YOLOv5m, YOLOv5l, and YOLOv5x) together at the test and inference time. We have noticed four YOLOv5 architecture variants are outperforming against our previous Ensemble ResNets model with improvements of the average precision metric (AP) of the most difficult defect classes as p gap and microbridges as well as overall mAP accuracy. With Ensemble YOLOv5, the p gap AP and microbridge AP metrices have been improved by 35% and 25.33%, respectively, whereas the overall mAP has been improved by 6.25%. The proposed Automated Defect Classification and Detection (ADCD) framework can also be used for high resolution and high-speed metrology, providing rapid identification of defects with improved certainty and further root cause investigation.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.