If you scan a page from a soil report, this is called digitization. If you deploy digital technologies, both software such as building information modeling and machine learning and hardware such as autonomous drones and additive manufacturing, to support new and more collaborative forms of project delivery, this is called digitalization. Data lies at the heart of this transformation that is targeted at re-valuing infrastructure from a "brick and mortar" asset to a service for the interests of the end-users. There is a need to view the value of data completely differently from how they are routinely used in current practice. In particular, there is a need to treat data as assets in themselves, over and above their conventional roles as inputs to a physical model or as monitoring data to trigger interventions. This paper explores the availability and nature of geotechnical data and presents two recent advances made in this direction for a specific but important task of estimating soil/rock properties (compressive sampling and Bayesian machine learning). Data-driven decision making does not imply taking the engineer out of the entire life cycle management chain. It is intended to support rather than to replace human judgment.