大刺鳅基因组中微卫星分布特征分析

Distribution characteristics of microsatellite in genome of Mastacembelus armatus

  • 摘要: 【目的】通过查找和分析大刺鳅基因组中微卫星序列的分布规律及特征,为开展大刺鳅种质资源评估和分子标记辅助育种提供理论基础。【方法】基于大刺鳅基因组序列数据,采用MISA挖掘微卫星信息,利用生物信息学方法分析大刺鳅基因组中微卫星序列的分布规律及组成特征。【结果】从大刺鳅基因组中共筛选获得19020个微卫星,其中16025个为完全型微卫星;分布于2号染色体上的微卫星数目最多,24号染色体上的微卫星数目最少,分别占微卫星总数的10.26%和1.94%;单核苷酸为数目最多的微卫星,六核苷酸的最少。在完全型微卫星中,数目最多的前4种基元类型依次是A/T、AC/GT、G/C和AG/CT,分别占微卫星总数的37.82%、29.57%、6.88%和6.77%。微卫星基元重复次数的分布范围在5~104次,主要发生在重复5~6次和10~11次。微卫星长度区间为10~192 bp,主要分布在10~36 bp,占其总数的92.46%;微卫星数目最多的长度为12 bp,占其总数的15.22%。Pearson相关分析结果表明,大刺鳅微卫星长度与其数目呈极显著负相关(P<0.01)。GO功能注释分析结果发现,微卫星的基因主要富集到生物学过程、分子功能和细胞组分3个类群,富集最显著的条目为细胞核。【结论】利用基因组数据筛选大刺鳅基因组中微卫星可用性高且具有较高的多态性潜能,可为高效开发微卫星分子标记提供遗传学数据。

     

    Abstract: 【Objective】The distribution and characteristics of microsatellite sequences were searched and analyzed in the genome of Mastacembelus armatus, which could provide a theoretical basis for the evaluation of M. armatus germplasm resources and marker-assisted breeding. 【Method】Based on genomic sequence data of M. armatus, microsatellite loci were searched by MISA software and their distribution and composition characteristics in the genome of M. armatus were evaluated by bioinformatics analysis. 【Result】19020 microsatellites were obtained from genome of M. armatus including 16025 perfect microsatellites. The number of microsatellites distributed on chromosome 2 was the most, and the least was on chromosome 24, which accounting for 10.26% and 1.94% of the total number of microsatellites respectively.Mononucletide was the most abundant microsatellite while hexanucleotide was the lowest. The first four types of perfect microsatellites in the genome of M. armatus were A/T, AC/GT, G/C and AG/CT, which accounting for 37.82%, 29.57%, 6.88% and 6.77% of the total number respectively. The range of the repeat times of motifs was 5-104 with the main repeat times including 5-6 and 10-11. The length interval of microsatellites was 10-192 bp, and the main length interval was 10-36 bp, which taking up 92.46% of the total number. The maximum number of microsatellites was 12 bp in length, which accounting for 15.22% of the total number of microsatellites. Pearson correlation analysis showed that there was extremely significantly negative correlation between the length and number of microsatellites for M. armatus(P<0.01). Functional annotation and analysis of GO showed that microsatellites genes were mainly enriched in three groups including biological process, molecular function and cellular component, and nucleus was the most significantly enriched term. 【Conclusion】The microsatellites obtained based on the genome of M. armatus have high availability and polymorphism potential, which can provide genetic data for the efficient development of microsatellite molecular markers.

     

/

返回文章
返回