基于不同标度伪氨基酸组成预测脂肪酶的类型

科微学术

生物工程学报

首页 > 过刊浏览>2008年第24卷第11期 >1968-1974

基于不同标度伪氨基酸组成预测脂肪酶的类型
DOI:
                        
                    
CSTR:
                        
                    
作者:
                        张光亚张光亚
华侨大学生物工程与技术系，厦门 361021
在期刊界中查找
在百度中查找
在本站中查找
李红春李红春
华侨大学生物工程与技术系，厦门 361021
在期刊界中查找
在百度中查找
在本站中查找
高嘉强高嘉强
华侨大学生物工程与技术系，厦门 361021
在期刊界中查找
在百度中查找
在本站中查找
方柏山方柏山
华侨大学生物工程与技术系，厦门 361021
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:高等学校博士学科点专项科研基金项目(No. 20070385001), 福建省自然科学基金项目(No. 2007J0360)资助。

Prediction of Lipases Types by Different Scale Pseudo-amino Acid Composition

Author:

Guangya Zhang
Guangya Zhang
Institute of Industrial Biotechnology, Huaqiao University, Quanzhou 362021, China
在期刊界中查找
在百度中查找
在本站中查找
Hongchun Li
Hongchun Li
Institute of Industrial Biotechnology, Huaqiao University, Quanzhou 362021, China
在期刊界中查找
在百度中查找
在本站中查找
Jiaqiang Gao
Jiaqiang Gao
Institute of Industrial Biotechnology, Huaqiao University, Quanzhou 362021, China
在期刊界中查找
在百度中查找
在本站中查找
Baishan Fang
Baishan Fang
Institute of Industrial Biotechnology, Huaqiao University, Quanzhou 362021, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

the Research Fund for the Doctoral Program of Higher Education (No. 20070385001) and the Nature Science Foundation of Fujian Province (No. 2007J0360)

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

从序列出发预测某蛋白质是否为脂肪酶以及属于哪种脂肪酶具有重要的理论和应用价值。提出了基于Z标度和T标度的伪氨基酸组成方法提取序列特征值, 采用了k-近邻算法回答上述问题。经参数选择后, 三种方法在各自最优运行参数下, 其10倍交叉验证的结果为: 对脂肪酶和非脂肪酶预测精度分别为92.8%、91.4%和91.3%; 对脂肪酶类型预测的精度分别为92.3%、90.3%和89.7%。其中基于Z标度伪氨基酸组成效果最佳, 基于T标度的次之, 但均明显优于其他6种常见的特征值提取方法, 并对其可能的原因进行了探讨。

关键词:脂肪酶, Z-标度, T-标度, 伪氨基酸组成, k-近邻

Abstract:

Lipases are widely used enzymes in biotechnology. Although they catalyze the same reaction, their sequences vary. Therefore, it is highly desired to develop a fast and reliable method to identify the types of lipases according to their sequences, or even just to confirm whether they are lipases or not. By proposing two scales based pseudo amino acid composition approaches to extract the features of the sequences, a powerful predictor based on k-nearest neighbor was introduced to address the problems. The overall success rates thus obtained by the 10-fold cross-validation test were shown as below: for predicting lipases and nonlipase, the success rates were 92.8%, 91.4% and 91.3%, respectively. For lipase types, the success rates were 92.3%, 90.3% and 89.7%, respectively. Among them, the Z scales based pseudo amino acid composition was the best, T scales was the second. They outperformed significantly than 6 other frequently used sequence feature extraction methods. The high success rates yielded for such a stringent dataset indicate predicting the types of lipases is feasible and the different scales pseudo amino acid composition might be a useful tool for extracting the features of protein sequences, or at lease can play a complementary role to many of the other existing approaches.

Key words:Lipase, Z-scales, T-scales, pseudo-amino acid composition, k-nearest neighbor

引用本文

张光亚,李红春,高嘉强,方柏山. 基于不同标度伪氨基酸组成预测脂肪酶的类型[J]. 生物工程学报, 2008, 24(11): 1968-1974

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2008-03-11
最后修改日期:
录用日期:
在线发布日期:
出版日期:

文章二维码

通信地址：中国科学院微生物研究所邮编：100101

电话：010-64807509 E-mail：cjb@im.ac.cn

技术支持：北京勤云科技发展有限公司

科微学术

生物工程学报

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码