Ali Mesbah scite author profile

Using JavaScript and dynamic DOM manipulation on the client-side of web applications is becoming a widespread approach for achieving rich interactivity and responsiveness in modern web applications. At the same time, such techniques, collectively known as Ajax, shatter the metaphor of web 'pages' with unique URLs, on which traditional web crawlers are based. This paper describes a novel technique for crawling Ajax-based applications through automatic dynamic analysis of user interface state changes in web browsers. Our algorithm scans the DOM-tree, spots candidate elements that are capable of changing the state, fires events on those candidate elements, and incrementally infers a state machine modelling the various navigational paths and states within an Ajax application. This inferred model can be used, for instance, in program comprehension, analysis and testing of dynamic web states, or for generating a static version of the application. In this paper, we discuss our sequential and concurrent Ajax crawling algorithms.We present our open source tool called Crawljax, which implements the concepts and algorithms discussed in this paper. Additionally, we report a number of empirical studies in which we apply our approach to a number of open-source and industrial web applications and elaborate on the obtained results.

show abstract

Real Challenges in Mobile App Development

Joorabchi

2013

View full text Add to dashboard Cite

Building houses with local materials: means to drastically reduce the environmental impact of construction

Morel

Mesbah

Oggero³

et al. 2001

Building and Environment

489

180

View full text Add to dashboard Cite

Stochastic nonlinear model predictive control with probabilistic constraints

Mesbah

Streif

Findeisen³

et al. 2014

180

159

View full text Add to dashboard Cite

Stochastic model predictive control with active uncertainty learning: A Survey on dual control

Mesbah

2018

Annual Reviews in Control

152

136

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Ali Mesbah

Crawling Ajax-Based Web Applications through Dynamic Analysis of User Interface State Changes

Real Challenges in Mobile App Development

Building houses with local materials: means to drastically reduce the environmental impact of construction

Stochastic nonlinear model predictive control with probabilistic constraints

Stochastic model predictive control with active uncertainty learning: A Survey on dual control

Contact Info

Product

Resources

About