Abstract. Spatial soil databases can help model complex phenomena in which soils are decisive, for example, evaluating agricultural potential or estimating carbon storage capacity. The Soil Information System for Latin America and the Caribbean, SISLAC, is a regional initiative promoted by the FAO's South American Soil Partnership to contribute to the sustainable management of soil. SISLAC includes data coming from 49,084 soil profiles distributed unevenly across the continent, making it the region's largest soil database. However, some problems hinder its usages, such as the quality of the data and its high dimensionality. The objective of this research is twofold. First, to evaluate the quality of SISLAC and its data values and generate a new, improved version that meets the minimum quality requirements to be used by different interests or practical applications. Second, to demonstrate the potential of improved soil profile databases to generate more accurate information on soil properties, by conducting a case study to estimate the spatial variability of the percentage of soil organic carbon using 192 profiles in a 1473 km2 region located in the department of Valle del Cauca, Colombia. The findings show that 15 percent of the existing soil profiles had an inaccurate description of the diagnostic horizons. Further correction of an 4.5 additional percent of existing inconsistencies improved overall data quality. The improved database consists of 41,691 profiles and is available for public use at https://doi.org/10.5281/zenodo.6540710 (Díaz-Guadarrama, S. & Guevara, M., 2022). The updated profiles were segmented using algorithms for quantitative pedology to estimate the spatial variability. We generated segments one centimeter thick along with each soil profile data, then the values of these segments were adjusted using a spline-type function to enhance vertical continuity and reliability. Vertical variability was estimated up to 150 cm in-depth, while ordinary kriging predicts horizontal variability at three depth intervals, 0 to 5, 5 to 15, and 15 to 30 cm, at 250 m-spatial resolution, following the standards of the GlobalSoilMap project. Finally, the leave-one-out cross-validation provides information for evaluating the kriging model performance, obtaining values for the RMSE index between 1.77 % and 1.79 % and the R2 index greater than 0.5. The results show the usability of SISLAC database to generate spatial information on soil properties and suggest further efforts to collect a more significant amount of data to guide sustainable soil management.
Abstract. One of the largest challenges with soil information around the world is how to harmonize archived soil data from different sources and how to make it accessible to soil scientist. In Ecuador, there have been two major projects that have provided soil information, but the methodology of these projects, although comparable, did not coincide, especially with respect to how information was reported. Here, we present a new soil database for Ecuador, comprising 13 542 soil profiles with 51 713 measured soil horizons, including 92 different edaphic variables. The original data were in a non-editable format (i.e., PDF), which made it difficult to access and process the information. Our study provides an integrated framework that combines multiple analytic tools for automatically converting legacy soil information from an analog format into usable digital soil mapping inputs across Ecuador. This framework allowed us to incorporate quantitative information on a broad set of soil properties and retrieve qualitative information on soil morphological properties collected in the profile description phase, which is rarely included in soil databases. We present a new harmonized national soil database using a specific methodology to preserve relevant information. The national representativeness of soil information has been enhanced compared with other international databases, and this new database contributes to filling the gaps in publicly available soil information across the country. The database is freely available at https://doi.org/10.6073/pasta/1560e803953c839e7aedef78ff7d3f6c (Armas et al., 2022).
<p>One of the biggest challenges for digital soil mapping is the limited of&#160;field&#160;soil information (e.g., soil profile descriptions, soil sample analysis) for representing soil variability across scales. Global initiatives such as the Global Soil&#160;Partnership&#160;(GSP) and the development of a <strong>Global Soil Information System</strong>&#160;(GloSIS), World Soil Information Service (WoSis) or SoilGrids250m for global pedometric mapping highlight new opportunities but the crescent need of new and better soil datasets across the world. Soil datasets are increasingly required for the development of soil monitoring baselines, soil protection and sustainable land use strategies, and to better understand the response of soils to global environmental change.&#160; However, soil surveys are a very challenging task due to their high acquisition costs such data and operational complexity. The use of legacy soil data can reduce these sampling efforts.</p><p>The main objective of this research was the rescue, synthesis and harmonization of legacy&#160;soil profile information&#160;collected between 2009 and 2015 for different purposes (e.g., soil or natural resources inventory) across Ecuador. This project will support the creation of a soil information system at the national scale following international standards for archiving and sharing soil information (e.g., GPS or the GlobalSoilMap.net project). This new information could be useful to increase the accuracy of current digital soil information across the country and the future development of digital soil properties maps.</p><p>We provided an integrated framework combining multiple data analytic tools (e.g., python libraries, pandas, openpyxl or pdftools) for the automatic conversion of text in paper format (e.g., pdf, jpg) legacy soil information, as much the qualitative soil description as analytical data, &#160;to usable digital soil mapping inputs (e.g., spatial datasets) across Ecuador. For the conversion, we used text data mining techniques to automatically extract the information. We based on regular expressions using consecutive sequences algorithms of common patterns not only to search for terms, but also relationships between terms. Following this approach, we rescued information of 13.696 profiles in .pdf, .jpg format and compiled a database consisting of 10 soil-related variables.</p><p>The new database includes historical soil information that automatically converted a generic tabular database form (e.g., .csv) information.</p><p>As a result, we substantially improved the representation of soil information in Ecuador that can be used to support current soil information initiatives such as the WoSis, Batjes et al. 2019, with only 94 pedons available for Ecuador, the Latin American Soil Information System (SISLAC, http://54.229.242.119/sislac/es),&#160; and the United Nations goals&#160; towards increasing soil carbon sequestration areas or decreasing land desertification trends.&#160; In our database there are almost 13.696 soil profiles at the national scale, with soil-related (e.g., depth, organic carbon, salinity, texture) with positive implications for digital soil properties mapping.&#160;</p><p>With this work we increased opportunities for digital soil mapping across Ecuador. This contribution could be used to generate spatial indicators of land degradation at a national scale (e.g., salinity, erosion).</p><p>This dataset could support new knowledge for more accurate environmental modelling and to support land use management decisions at the national scale.</p><p>&#160;</p>
Abstract. One of the largest challenges with soil information around the world is how to harmonize archived soil data from different sources and how make it usable to extract knowledge. In Ecuador there have been two major projects that provided soil information, whose methodology, although comparable, did not coincide, especially regarding the structure of how information was reported. Here, we present a new soil database for Ecuador, comprising 13 542 soil profiles with over 51 713 measured soil horizons, including 92 different edaphic variables. Original data was in a non-editable format (i.e., PDF) making it difficult to access and process the information. Our study provides an integrated framework combining multiple data analytic tools for the automatic conversion of legacy soil information from analog format to usable digital soil mapping inputs across Ecuador. This framework allowed to incorporate quantitative information of a broad set of soil properties and retrieve qualitative information on soil morphological properties collected in the profile description phase, which is rarely included in soil databases. A new harmonized national database was generated using specific methodology to rescue relevant information. National representativeness of soil information has been enhanced compared to other international databases, and this new database contributes to filling the gaps of publicly available soil information across the country. The database is freely available to registered users at https://doi.org/doi:10.6073/pasta/1560e803953c839e7aedef78ff7d3f6c (Armas, et al., 2022).
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.