芸薹属作物花粉发育相关基因MS1的生物信息学分析
Bioinformatics analysis for pollen development related gene MS1 of Brassica crops
-
摘要: 目的利用生物信息学分析芸薹属作物MS1基因的结构功能,为作物杂种优势利用提供理论参考.方法以拟南芥花粉发育关键基因AtMS1为参考序列,通过BLAST比对获得同源基因序列,运用生物信息学方法对其编码氨基酸序列进行预测分析.结果从甘蓝型油菜、白菜、甘蓝等芸薹属作物基因组中获得4条同源序列,与AtMS1基因的相似性在88.0%以上,均含有3个外显子,其CDS序列长度均为2004 bp,编码667个氨基酸.4个芸薹属作物MS1蛋白均含有1个植物同源结构域(Plant homeodomain,PHD),属于亲水性不稳定蛋白,定位于细胞核,磷酸化以丝氨酸(Ser)为主,以苏氨酸(Thr)和酪氨酸(Tyr)为辅;二级结构均由α-螺旋、β-转角、延伸链和无规则卷曲组成,其中α-螺旋所占比例最高,在40.00%以上,β-转角所占比例最低,仅为10.00%左右;其三级结构大致相同,均为球状的功能结构域.4个芸薹属作物MS1蛋白和AtMS1蛋白序列的相似性为95.69%.4个芸薹属作物MS1蛋白的PHD结构域序列高度保守,仅有3个位点氨基酸残基存在差异.19个不同植物的MS1同源蛋白聚为两大类,其中琴叶拟南芥、亚麻荠、萝卜的MS1蛋白与4个芸薹属作物MS1蛋白及拟南芥AtMS1蛋白聚为一类,均属于十字花科植物,即MS1蛋白的聚类结果与植物系统分类结果相吻合.结论芸薹属作物MS1基因属于PHD-finger基因家族,其序列高度保守,参与调控花粉发育成熟过程.Abstract: ObjectiveBioinformatics tools were used to analyze the structure and function of gene MS1 in Brassica, and provide reference for utilization of crop heterosis.MethodSequence of Arabidopsis thaliana L.pollen development gene AtMS1 was used as reference sequence.BLAST alignment was adopted to obtain homologous gene sequence,and bioinformatic method was carried out to predict the sequence of encoded amino acids.ResultFour homologous sequen-ces were obtained from genome of B.napus L.,B.rapa L.and B.oleracea L.,whose similarities with gene AtMS1 were more than 88.0%.They all contained three exons,encoded 667 amino acids with CDS sequence length as 2004 bp.The four MS1 proteins of Brassica crops all contained one plant homeodomain(PHD),belonging to hydrophilic unstable pro-teins.They all located in nucleus,with serine(Ser)was the main phosphorylation and threonine(Thr)and tyrosine(Tyr) as auxiliary phosphorylation.The secondary structure was made up by α-helix,β-turn,extended strand and random coil. The proportion of α-helix was the highest(over 40.00%),and that of β-turn the lowest(10.00%).Their tertiary structures were similar,which were spherical function domain.The similarity of sequence between MS1 proteins of four Brassica crops and A.thaliana protein AtMS1was 95.69%.PHD structure sequences of MS1 proteins from the four Brassica crops were highly conservative.Differences were only existed in amino acid residue of three sites.MS1 homologous proteins of 19 different crops clustered into two types.MS1 proteins of A.lyrata,Camelina sativa and Raphanus sativus L.clustered into one type with MS1 proteins of four Brassica crops and protein AtMS1 of A.thaliana,which all belonged to Crucife-rae.The results indicated that the cluster of MS1 proteins was in line with plant systematics.ConclusionGene MS1 of Brassica crops belong to PHD-finger gene family.Their sequences are highly conservative,and play a role in regulating pollen development and maturity.