1. Hevea brasiliensis, the Pará rubber tree, sharinga tree, seringueira, or most commonly, rubber tree or rubber plant, is a flowering plant belonging to the spurge family Euphorbiaceae originally native to the Amazon basin, but is now pantropical in distribution due to introductions
2. Hevea brasiliensis is a tall deciduous tree growing to a height of up to 43 m (141 ft) in the wild. Cultivated trees are usually much smaller because drawing off the latex restricts their growth.
3. In the wild the tree can reach a height of up to 140 feet (43 m). The white or yellow latex occurs in latex vessels in the bark, mostly outside the phloem.
4. The rubber tree takes between seven and ten years to deliver the first harvest.
5. The South American rubber tree grew only in the Amazon rainforest, and increasing demand and the discovery of the vulcanization procedure in 1839 led to the rubber boom in that region, enriching the cities of Belém, Santarém, and Manaus in Brazil and Iquitos, Peru, from 1840 to 1913. In Brazil, before the name was changed to 'Seringueira' the initial name of the plant was 'pará rubber tree', derived from the name of the province of Grão-Pará. In Peru, the tree was called 'árbol del caucho', and the latex extracted from it was called 'caucho'.
The rubber tree (Hevea brasiliensis) is grown in tropical regions and is the major source of natural rubber. Using traditional breeding approaches, the latex yield has increased by sixfold in the last century. However, the underlying genetic basis of rubber yield improvement is largely unknown. Here, we present a high-quality, chromosome-level genome sequence of the wild rubber tree, the first report on selection signatures and a genome-wide association study (GWAS) of its yield traits. Population genomic analysis revealed a moderate population divergence between the Wickham clones and wild accessions. Interestingly, it is suggestive that H. brasiliensis and six relatives of the Hevea genus might belong to the same species. The selective sweep analysis found 361 obvious signatures in the domesticated clones associated with 245 genes. In a 15-year field trial, GWAS identified 155 marker–trait associations with latex yield, in which 326 candidate genes were found. Notably, six genes related to sugar transport and metabolism, and four genes related to ethylene biosynthesis and signalling are associated with latex yield. The homozygote frequencies of the causal nonsynonymous SNPs have been greatly increased under selection, which may have contributed to the fast latex yield improvement during the short domestication history. Our study provides insights into the genetic basis of the latex yield trait and has implications for genomic-assisted breeding by offering valuable resources in this new domesticated crop.
Genome size | 1.9 Gb |
Total ungapped length | 1.7 Gb |
Number of chromosomes | 18 |
Number of organelles | 0 |
Number of scaffolds | 754 |
Scaffold N50 | 101.7 Mb |
Scaffold L50 | 9 |
Number of contigs | 2,181 |
Contig N50 | 3.2Mb |
Contig L50 | 147 |
GC percent | 34.5 |
Genome coverage | 300.0x |
The rubber tree wild germplasm was used for de novo genome sequencing. The accession was collected from the Vila Bela district, Moto Crosso States, Brazil. The accession is characterized by low heterozygosity. One hundred and seven Wickham clones, 34 IRRDB 1981’ wild germplasm accessions and six species of the Hevea genus were used for a population study. The wild germplasms were primarily collected from Acre (AC), Moto Crosso (MT) and Rondonia (RO) in Brazil (IRRDB 1981 expedition). We selected 11 RO, 10 MT, 11AC and two other accessions to study the germplasm evolution. These accessions represent the genetic diversity in the regions of Hevea origin and the major rubber tree plantations. The rubber trees are planted in the National Rubber Tree Germplasm Repository (Danzhou, Hainan, 19°52′N, 109°50′E, South China, 2006–2007).
The rubber tree natural population resequencing was conducted at Biomarker, China. The sequencing process was performed on a NovaSeq 6000 platform according to the standard protocol outlined by Illumina. The raw reads were filtered to eliminate the reads with adapters (Phred score<10, or N>10%). The clean reads were then mapped to the compiled reference genome using BWA, and the statistics were assessed. The SNPs were called using an accelerated GATK HaplotypeCaller program. The SNP calling was accelerated using a Sentieon-genomic pipeline . For single-sample SNP and genotype calling, several filtering steps were performed to remove: InDels with quality scores <30, SNPs with more than two alleles, SNPs at or within 5 bp from InDels, SNPs with genotyping quality scores (GQ) <10 and SNPs with extremely low or extremely high coverage. The obtained SNPs were finally annotated with SnpEff v4.3t.
For the population study, the VCF files were filtered to remove InDels, nonbiallelic SNPs and SNPs with a missing rate>90%. Population structure was analysed using the maximum-likelihood approach implemented in ADMIXTURE v1.3.0. Individual-based clustering analysis was conducted and cross-validated to explore convergence. The optimum number of clusters was determined. The subgroup value was preset for the SNPs selected using PLINK2 v2.00a3LM to construct the neighbour-joining tree . The phylogenetic tree was constructed using the Kimura 2-parameter model using the MEGA5 software in the presence of 1000 bootstrap replicates . The interactive tree of life tool was used to display the neighbour-joining tree. PCA for all SNPs was conducted using EIGENSOFT , using the default parameters.
The genomic regions that underwent a putative selective sweep were investigated to identify genes under selection pressure. The candidate genes were annotated in functional categories based on GO and KEGG databases. The enriched GO terms of the genes that underwent positive or negative selections were assessed using the BiNGO 3.03 with GO_full categories. Overrepresented GO categories were enriched following a hypergeometric test. The significance threshold was set as 0.05 after a Benjamini and Hocheberg FDR correction.
Cheng H et al., "Chromosome-level wild Hevea brasiliensis genome provides new tools for genomic-assisted breeding and valuable loci to elevate rubber yield.", Plant Biotechnol J, 2023 May;21(5):1058-1072 DOI:10.1111/pbi.14018