(A)The sequenced strain named GREEN.
(B)Assembly process. (a) Hiseq short reads assembly; (b) Aligning Hiseq high-quality short reads to error-prone long PacBio reads (errors are indicated
by black bars). After correction, overlapsbetweenPacBio corrected long reads andHiseq short reads scaffolds canbedetected; (c) PBJelly gap filling; (d)
gap closer.
(C) The contig N50 and scaffold N50 status at every assembly step and the fraction of theD. officinalegenome represented by gene-sized scaffolds.
Primary Yaxis (red andblue) shows N50 length for every assembly step, secondary Yaxis (green) shows the percentage of estimated gene-size scaffolds
R3.5 kb (the average length of a plant gene).