TY - JOUR
T1 - Unraveling the genome of a high yielding Colombian sugarcane hybrid
AU - Trujillo-Montenegro, Jhon Henry
AU - Rodríguez Cubillos, María Juliana
AU - Loaiza, Cristian Darío
AU - Quintero, Manuel
AU - Espitia-Navarro, Héctor Fabio
AU - Salazar Villareal, Fredy Antonio
AU - Viveros Valens, Carlos Arturo
AU - González Barrios, Andrés Fernando
AU - De Vega, José
AU - Duitama, Jorge
AU - Riascos, John J.
N1 - Data Availability Statement: The original contributions presented in the study are publicly available. This data can be found here: NCBI repository, accession number: PRJNA713858 (https://www.ncbi.nlm.nih.gov/bioproject/713858).
Funding: This work was supported with funding from CENICAÑA provided by the sugarcane mills and producers from the Cauca river valley in Colombia. JT was awarded a COLCIENCIAS scholarship from the Colombian Administrative Department of Science, Technology and Innovation. The work of JD was supported by internal funds of Universidad de los Andes through the FAPA initiative led by the Vice-presidency of Research and Knowledge Creation.
PY - 2021/8/13
Y1 - 2021/8/13
N2 - Recent developments in High Throughput Sequencing (HTS) technologies and bioinformatics, including improved read lengths and genome assemblers allow the reconstruction of complex genomes with unprecedented quality and contiguity. Sugarcane has one of the most complicated genomes among grassess with a haploid length of 1Gbp and a ploidies between 8 and 12. In this work, we present a genome assembly of the Colombian sugarcane hybrid CC 01-1940. Three types of sequencing technologies were combined for this assembly: PacBio long reads, Illumina paired short reads, and Hi-C reads. We achieved a median contig length of 34.94 Mbp and a total genome assembly of 903.2 Mbp. We annotated a total of 63,724 protein coding genes and performed a reconstruction and comparative analysis of the sucrose metabolism pathway. Nucleotide evolution measurements between orthologs with close species suggest that divergence between Saccharum officinarum and Saccharum spontaneum occurred <2 million years ago. Synteny analysis between CC 01-1940 and the S. spontaneum genome confirms the presence of translocation events between the species and a random contribution throughout the entire genome in current sugarcane hybrids. Analysis of RNA-Seq data from leaf and root tissue of contrasting sugarcane genotypes subjected to water stress treatments revealed 17,490 differentially expressed genes, from which 3,633 correspond to genes expressed exclusively in tolerant genotypes. We expect the resources presented here to serve as a source of information to improve the selection processes of new varieties of the breeding programs of sugarcane.
AB - Recent developments in High Throughput Sequencing (HTS) technologies and bioinformatics, including improved read lengths and genome assemblers allow the reconstruction of complex genomes with unprecedented quality and contiguity. Sugarcane has one of the most complicated genomes among grassess with a haploid length of 1Gbp and a ploidies between 8 and 12. In this work, we present a genome assembly of the Colombian sugarcane hybrid CC 01-1940. Three types of sequencing technologies were combined for this assembly: PacBio long reads, Illumina paired short reads, and Hi-C reads. We achieved a median contig length of 34.94 Mbp and a total genome assembly of 903.2 Mbp. We annotated a total of 63,724 protein coding genes and performed a reconstruction and comparative analysis of the sucrose metabolism pathway. Nucleotide evolution measurements between orthologs with close species suggest that divergence between Saccharum officinarum and Saccharum spontaneum occurred <2 million years ago. Synteny analysis between CC 01-1940 and the S. spontaneum genome confirms the presence of translocation events between the species and a random contribution throughout the entire genome in current sugarcane hybrids. Analysis of RNA-Seq data from leaf and root tissue of contrasting sugarcane genotypes subjected to water stress treatments revealed 17,490 differentially expressed genes, from which 3,633 correspond to genes expressed exclusively in tolerant genotypes. We expect the resources presented here to serve as a source of information to improve the selection processes of new varieties of the breeding programs of sugarcane.
KW - assembly
KW - CC 01-1940
KW - CENICAÑA
KW - drought
KW - genome
KW - RNASeq
KW - sugarcane
UR - http://www.scopus.com/inward/record.url?scp=85114316398&partnerID=8YFLogxK
U2 - 10.3389/fpls.2021.694859
DO - 10.3389/fpls.2021.694859
M3 - Article
AN - SCOPUS:85114316398
SN - 1664-462X
VL - 12
JO - Frontiers in Plant Science
JF - Frontiers in Plant Science
M1 - 694859
ER -