GTB (the Grammar Tool Box) is the tool that underpins our investigations into generalised parsing. Our goal is to produce a system that supports systematic investigation of various styles of generalised parsing in a way that allows meaningful comparisons between them in a repeatable and easily accessible fashion whilst also allowing: (i) new theoretical ideas to be generated and explored; (ii) production quality parsers to be generated and (iii) humane pedagogy. GTB comprises a language (LC) with various kinds of built-in grammar and automata related objects, and a set of black-box methods written in C++ that provide implementations of grammar transforms, automata construction algorithms, parsing and recognition algorithms, and a variety of visualisation aids. In this paper we focus on the overall rationale for the GTB framework; the GTB design goals; and some detailed operational flows that are supported by GTB.The late Roger Needham famously said that research should be done with a shovel, not tweezers [20], and the computing community both academic and industrial has certainly taken this maxim to heart. However, as archaeologists know, once past the surface layers, using a shovel is dangerous because important details may be obscured; in which case only partial or even incorrect theories may be achievable. In reality, the devil hides in the details.Computing as an academic subject has three main aspects: a purely mathematical one in which machines are seen as the physical realisation of formal results; a scientific one in which the emphasis is on controlled, repeatable experimentation in the pursuit of observable truth; and an engineering facet in which the delivery of cheap, fast and reliable systems is paramount. These three are often in tension with each other, but in all cases the degree to which we document our motivation and design decisions can be critical to wider acceptance of a technology. As in any scientific domain, proofs, scientific observations and engineered artefacts all need their broader context (as well as their core features) explained. Proofs and pedagogy; science and systems: this is what we should be doing.Sadly, computing is not like the established sciences in this respect. We see very few genuinely repeatable studies in the literature, and thus very few repeated experiments, making it hard to distinguish a well-founded consensus view from mere fashion. To be fair, computing does not always lend itself easily to controlled experimentation. The field sits at a nexus between mathematics, engineering, psychology and perhaps sociology. Some of our phenomena are * Corresponding author. Tel.: +44 (0) 1784 443425; fax: +44 (0) 1784 439786.