Whilst it is established that almost all cultivated coffee (Coffea arabica L.) varieties originated in Yemen after some coffee seeds were introduced into Yemen from neighboring Ethiopia, the actual coffee genetic diversity in Yemen and its significance to the coffee world had never been explored. We observed five genetic clusters. The first cluster, which we named the Ethiopian-Only (EO) cluster, was made up exclusively of the Ethiopian accessions. This cluster was clearly separated from the Yemen and cultivated varieties clusters, hence confirming the genetic distance between wild Ethiopian accessions and coffee cultivated varieties around the world. The second cluster, which we named the SL-17 cluster, was a small cluster of cultivated worldwide varieties and included no Yemen samples. Two other clusters were made up of worldwide varieties and Yemen samples. We named these the Yemen Typica-Bourbon cluster and the Yemen SL-34 cluster. Finally, we observed one cluster that was unique to Yemen and was not related to any known cultivated varieties and not even to any known Ethiopian accession: we name this cluster the New-Yemen cluster. We discuss the consequences of these findings and their potential to pave the way for further comprehensive genetic improvement projects for the identification of major resilience/adaptation and cup quality genes that have been shaped through the domestication process of C. arabica.
While Ethiopia and South Sudan are the native habitats for Coffea arabica, Yemen is considered an important domestication center for this coffee species as most Arabica coffee grown around the world can be traced back to Yemen. Furthermore, climatic conditions in Yemen are hot and extremely dry. As such, Yemeni coffee trees likely have genetic merits with respect to climate resilience. However, until recently, very little was known about the genetic landscape of Yemeni coffee. The Yemeni coffee sector identifies coffee trees according to numerous vernacular names such as Udaini, Tufahi or Dawairi. However, the geographical landscape of these names and their correlation with the genetic background of the coffee trees have never been explored. In this study, we investigated the geographic occurrence of vernacular names in 148 coffee farms across the main coffee areas of Yemen. Then, we used microsatellite markers to genotype 88 coffee trees whose vernacular name was ascertained by farmers. We find a clear geographical pattern for the use of vernacular coffee names. However, the vernacular names showed no significant association with genetics. Our results support the need for a robust description of different coffee types in Yemen based on their genetic background for the benefit of Yemeni farmers.
The coffee species Coffea arabica is facing numerous challenges regarding climate change, pests and disease pressure. Improved varieties will be part of the solution. Making optimal use of the scarce genetic diversity of the species is hence essential. In this paper, we present the first study of C. arabica genetic diversity covering its complete native habitat in Ethiopia together with its main domestication centers: Yemen and Hararghe region in Ethiopia. All in all, 555 samples were analyzed with a set of Single Sequence Repeat markers. Through admixture genetic analysis, six clusters were identified. A total of two “Core Ethiopian” clusters did not participate in the domestication of the species. There were four clusters that were part of the “Domestication Pathway” of C. arabica. The first one was named “Ethiopian Legacy” as it represents the genetic link between “Core Ethiopia” and the “Domestication Pathway” in Yemen and Hararghe. The geographic origin of this cluster in Ethiopia was the south of Ethiopia, namely Gedio, Guji and Sidama, which hence appears as the source of coffee seeds that led to the domestication of C. arabica. In Yemen, in addition to the “Ethiopian Legacy” cluster, we confirmed the “Typica/Bourbon” and “New-Yemen” clusters. In Hararghe, the “Harrar” cluster, never described before, likely originates from a re-introduction of domesticated coffee from Yemen into this region of Ethiopia. Cultivated varieties around the world today originate from the “Ethiopian Legacy” and “Typica/Bourbon” clusters and but none are related to the “new-Yemen” and “Harrar” clusters. Implications for breeding strategies are discussed.
Yemeni smallholder coffee farmers face several challenges, including the ongoing civil conflict, limited rainfall levels for irrigation, and a lack of post-harvest processing infrastructure. Decades of political instability have also affected the quality, accessibility, and reputation of Yemeni coffee beans. Despite these challenges, Yemeni coffee is highly valued for its unique flavor profile and is considered one of the most valuable coffees in the world. Due to its exclusive nature and perceived value, it is also a prime target for food fraud and adulteration. This is the first study to identify the potential of Near Infrared Spectroscopy (NIRS) and chemometrics – more specifically, the discriminant analysis (PCA-LDA) – as a promising, fast, and cost-effective tool for the traceability of Yemeni coffee and sustainability of the Yemeni coffee sector. The NIR spectral signatures of whole green coffee beans from Yemeni regions (Al Mahwit, Dhamar, Ibb, Saada, and Sana'a), and other origins (n = 221) were discriminated and predicted with accuracy, sensitivity, and specificity ≥ 98% using PCA-LDA models. These results show that the chemical composition of green coffee and other factors captured on the spectral signatures can influence the discrimination of the geographical origin, a crucial component of coffee valuation in the international markets.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
customersupport@researchsolutions.com
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.
Copyright © 2025 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.