Challenges in Bringing Pangenome Research Into Breeding: A Case Study in Rice.
Nie S, Li F, Li R, Wang J, Ma Y
Summary
8.0/10Researchers are creating detailed genetic maps of rice that reveal genes for higher yields and disease resistance. But turning this scientific discovery into practical breeding tools requires developing better computer systems and AI interfaces specifically designed for farmers and breeders.
Key Findings
Rice pangenome research has identified extensive structural variations, presence/absence variants, and novel genes linked to yield, disease resistance, and stress tolerance traits
Pangenomic data enables new molecular breeding strategies and trait discovery that outperform traditional single-reference genome approaches
Adoption barriers include complexity of graph-based genetic data structures, shortage of breeder-friendly tools, and lack of agricultural-oriented AI/ML interfaces despite promising ML potential
Original Abstract
Crop breeding has entered the pangenomics era, unlocking a far more comprehensive view of genetic diversity than a single reference genome can capture. In rice (Oryza sativa), a staple crop critical to global food security, the construction of pangenome resources has uncovered extensive structural variations (SVs), presence/absence variations (PAVs) and novel genes that underpin key agronomic traits. As the rice pangenome matures from a research resource into a practical breeding tool, it promises to accelerate the development of higher-yielding, stress-resilient and disease-resistant varieties. This transition represents a pivotal advance toward sustainable agriculture and enhanced global food security, while also establishing a model for applying pangenomics to other crops. Here, we review how rice pangenome research, encompassing both cultivated and wild species, has advanced trait discovery from yield improvement and disease resistance to stress tolerance and enabled new molecular breeding strategies. Despite these advances, several challenges remain before pangenomic data can be routinely integrated into breeding pipelines. The complexity of graph-based data structures, difficulties in detecting multiallelic variants from population-wide resequencing data and the lack of breeder-friendly genotyping tools are significant barriers. Additionally, while artificial intelligence (AI) and machine learning (ML) approaches show great promise for interpreting complex pangenomic data and accelerating trait discovery by genomic selection, their practical adoption is hindered by the absence of breeder-oriented interfaces, integration challenges with multi-omics data and high computational demands. Overcoming these issues will require interdisciplinary collaboration, robust infrastructure and innovations focused on practical breeding needs across diverse crop species.
Species Mentioned
Rice is a cereal grain and in its domesticated form is the staple food of over half of the world's population, particularly in Asia and Africa. Rice is the seed of the grass species Oryza sativa —or, much less commonly, Oryza glaberrima. Asian rice was domesticated in China some 13,500 to 8,200 y...
open_in_new Wikipedia