机构:[1]Institute of Chinese Materia Medica, China Academy of Chinese Medical Sciences, Beijing 100700, China[2]Guangdong Provincial Hospital of Chinese Medicine, Guangzhou 510006, China广东省中医院[3]National Data Center of Traditional Chinese Medicine, China Academy of Chinese Medical Sciences, Beijing 100700, China[4]Institute of Basic Research in Clinical Medicine, China Academy of Chinese Medical Sciences, Beijing 100700, China[5]Department of Biophysics, School of Basic Medical Sciences, Peking University Health Science Center, Beijing 100191, China[6]State Key Laboratory of Molecular Reaction Dynamics, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China[7]Waters Corporation Shanghai Science & Technology Co Ltd, Shanghai 201206, China[8]Institute of Crop Science, Chinese Academy of Agricultural Sciences/Key Laboratory of Crop Physiology and Ecology, Ministry of Agriculture, Beijing 100081, China[9]Department of Pharmacology, School of Medicine, Yale University, New Haven, 06510, CT, USA
Ginseng, which contains ginsenosides as bioactive compounds, has been regarded as an important traditional medicine for several millennia. However, the genetic background of ginseng remains poorly understood, partly because of the plant's large and complex genome composition. We report the entire genome sequence of Panax ginseng using next-generation sequencing. The 3.5-Gb nucleotide sequence contains more than 60% repeats and encodes 42 006 predicted genes. Twenty-two transcriptome datasets and mass spectrometry images of ginseng roots were adopted to precisely quantify the functional genes. Thirty-one genes were identified to be involved in the mevalonic acid pathway. Eight of these genes were annotated as 3-hydroxy-3-methylglutaryl-CoA reductases, which displayed diverse structures and expression characteristics. A total of 225 UDP-glycosyltransferases (UGTs) were identified, and these UGTs accounted for one of the largest gene families of ginseng. Tandem repeats contributed to the duplication and divergence of UGTs. Molecular modeling of UGTs in the 71st, 74th, and 94th families revealed a regiospecific conserved motif located at the N-terminus. Molecular docking predicted that this motif captures ginsenoside precursors. The ginseng genome represents a valuable resource for understanding and improving the breeding, cultivation, and synthesis biology of this key herb.
基金:
National Natural Science Foundation of China
(81403053, 81503469), the China Academy of Chinese Medical Sciences (ZZ0808021), the
Guangdong Provincial Hospital of Chinese Medicine Special Fund (2015KT1817), the China
Academy of Chinese Medical Sciences Special Fund for Health Service Development of
Chinese Medicine (ZZ0908067), and National Cancer Institute, NIH, USA (CA154295).