James C. Davis scite author profile

James C. Davis

4Publications

31Citation Statements Received

202Citation Statements Given

How they've been cited

How they cite others

145

191

Affiliations

Loyola University Chicago, Purdue University System

Publications

Order By: Most citations

Why aren’t regular expressions a lingua franca? an empirical study on the re-use and portability of regular expressions

Davis

Michael

Coghlan

et al. 2019

View full text Add to dashboard Cite

This paper explores the extent to which regular expressions (regexes) are portable across programming languages. Many languages offer similar regex syntaxes, and it would be natural to assume that regexes can be ported across language boundaries. But can regexes be copy/pasted across language boundaries while retaining their semantic and performance characteristics? In our survey of 158 professional software developers, most indicated that they re-use regexes across language boundaries and about half reported that they believe regexes are a universal language. We experimentally evaluated the riskiness of this practice using a novel regex corpus Ð 537,806 regexes from 193,524 projects written in JavaScript, Java, PHP, Python, Ruby, Go, Perl, and Rust. Using our polyglot regex corpus, we explored the hitherto-unstudied regex portability problems: logic errors due to semantic differences, and security vulnerabilities due to performance differences. We report that developers' belief in a regex lingua franca is understandable but unfounded. Though most regexes compile across language boundaries, 15% exhibit semantic differences across languages and 10% exhibit performance differences across languages. We explained these differences using regex documentation, and further illuminate our findings by investigating regex engine implementations. Along the way we found bugs in the regex engines of JavaScript-V8, Python, Ruby, and Rust, and potential semantic and performance regex bugs in thousands of modules. CCS CONCEPTS • Software and its engineering → Reusability; • Social and professional topics → Software selection and adaptation.

show abstract

Fast and accurate incremental feedback for students’ software tests using selective mutation analysis

Kazerouni

Davis²,

Basak

et al. 2021

Journal of Systems and Software

View full text Add to dashboard Cite

As incorporating software testing into programming assignments becomes routine, educators have begun to assess not only the correctness of students' software, but also the adequacy of their tests. In practice, educators rely on code coverage measures, though its shortcomings are widely known. Mutation analysis is a stronger measure of test adequacy, but it is too costly to be applied beyond the small programs developed in introductory programming courses. We demonstrate how to adapt mutation analysis to provide rapid automated feedback on software tests for complex projects in large programming courses. We study a dataset of 1389 student software projects ranging from trivial to complex. We begin by showing that although the state-of-the-art in mutation analysis is practical for providing rapid feedback on projects in introductory courses, it is prohibitively expensive for the more complex projects in subsequent courses. To reduce this cost, we use a statistical procedure to select a subset of mutation operators that maintains accuracy while minimizing cost. We show that with only 2 operators, costs can be reduced by a factor of 2-3 with negligible loss in accuracy. Finally, we evaluate our approach on open-source software and report that our findings may generalize beyond our educational context.

show abstract

Snapshot Metrics Are Not Enough: Analyzing Software Repositories with Longitudinal Metrics

Synovic

Hyatt

Thota

et al. 2022

View full text Add to dashboard Cite

A replication of ‘DeepBugs: a learning approach to name-based bug detection’

Winkler

Agarwal

Tung

et al. 2021

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

James C. Davis

Why aren’t regular expressions a lingua franca? an empirical study on the re-use and portability of regular expressions

Fast and accurate incremental feedback for students’ software tests using selective mutation analysis

Snapshot Metrics Are Not Enough: Analyzing Software Repositories with Longitudinal Metrics

A replication of ‘DeepBugs: a learning approach to name-based bug detection’

Contact Info

Product

Resources

About