计算机工程与科学

CPSE-Bio: 基于云计算的生物问题求解环境

展开
  • 上海大学计算机工程与科学学院, 上海200444
谢江(1971—), 女, 副教授, 博士, 研究方向为生物信息学、高性能计算. E-mail: jiangxsh@shu.edu.cn

收稿日期: 2012-11-29

  网络出版日期: 2013-02-28

基金资助

国家教育部博士点基金资助项目(20113108120022); 上海市科委重点资助项目(11510500300); 上海市重点学科建设资助项目(J50103)

CPSE-Bio: A Cloud-Based Biological Problem Solving Environment

Expand
  • School of Compute Engineering and Science, Shanghai University, Shanghai 200444, China

Received date: 2012-11-29

  Online published: 2013-02-28

摘要

生物信息学是结合计算机技术解决生物问题, 利用计算机能力加速生物研究的交叉性学科. 问题求解环境(problem solving environment, PSE)是一类面向科学问题求解的计算平台, 研究人员通过使用PSE可以高效地参与和开展科学研究. 由于生物数据规模通常很大, 而这些数据随着生物技术的发展仍在不断地增加, 因此, 传统单机单系统PSE已无法满足生物计算需求. 介绍上海大学计算机工程与科学学院高性能计算研究所生物信息学研究团队将云计算技术与PSE相结合, 实现基于云环境的问题求解环境CPSE-Bio, 并对其中2个代表性模块, 即PPI(protein-protein interaction)多数据库网络查询(multi-database retrieval technology, MDRT)模块和蛋白质挖掘(protein mining, PM)模块, 进行性能分析和比较.

本文引用格式

谢江, 王旻超, 易荣贵, 夏上云, 张武 . CPSE-Bio: 基于云计算的生物问题求解环境[J]. 上海大学学报(自然科学版), 2013 , 19(1) : 21 -25 . DOI: 10.3969/j.issn.1007-2861.2013.01.004

Abstract

Bioinformatics is an interdisciplinary subject which combines biology with computer science to address biological problems. The purpose of problem solving environment (PSE) is to solve scientific problems and provide an effective platform for researchers. As the scale of biological data is huge and data increase rapidly with the development of the biology technology, it is hard for the traditional PSE based on a sequential computer system to meet the processing emand. This paper reviews the work of the Bioinformatics Group at the School of Computer Engineering and Science, Shanghai University. PSE with the cloud technology and implement a bioinformatics PSE named CPSE-Bio, based on cloud computing are combined. The performances of two main modules in the CPSEBio, multi-database retrieval technology (MDRT) and protein mining (PM), are evaluated and analyzed.

参考文献

 
[1] 俞艳, 郭胜利, 何建华. 基于Web 服务的土地适应性评价PSE 设计与实现[J]. 武汉大学学报, 2006(6): 544-547.

[2] Hesper B, Hogeweg P. Bioinformatica: een werk concept [J]. Kameleon, 1970, 1(6): 28-29.

[3] Gallopoulos E, Houstis E, Rice J R. Computer as Thinker/Doer: problem-solving environments for computational science [J]. IEEE Computational Science and

Engineering, 1994, 2(1): 11-23.

[4] Boonmee C, Kawata S. Computer-assisted simulation environment for partial-differential-equation problem, 1. data structure and steering of problem solving process [J]. Transactions of the Japan Society for Computational Engineering and Science, Paper No. 19980001,1998.

[5] Boonmee C, Kawata S. Computer-assisted simulation environment for partial-differential-equation problem, 2. Visualization and steering of problem solving process

[J]. Transactions of the Japan Society for Computational Engineering and Science, Paper No. 19980002, 1998.

[6] Kawata S, Boonmee C. Visual steering of the simulation process in a scientific numerical simulation environment NCAS [J]. Enabling Technologies for Computational

Science, 2000, 548: 291-300.

[7] Mu M. PDE mart: a network-based problem solving environment for PDEs [J]. ACM Trans Mathematical Software, 2005, 31(4): 508-531.

[8] Mao G Y, Mu M. Grid-based PDE. Mart: a PDEoriented PSE for grid computing [C]// The 1st International Conference on e-Science and Grid Computing. 2005: 464-469.

[9] Kobashi H, Kawata S. PSE park: a framework to construct problem solving environments [C]// The 12th PSE Workshop. 2009.

[10] Akarsu E, Fox G C, Furmanski W. Webflow-highlevel programming environment and visual authoring toolkit for high performance distributed computing

[C]// ACM/IEEE conference on Supercomputing. 1998: 1-7.

[11] Goodale T, Allen G, Lanfermann G. The cactus framework and toolkit: design and applications [C]// High Performance Computing for Computational Science—VECPAR 2002. 2003, 2565: 197-227.

[12] The Grid Portal Toolkit [EB/OL]. [2012-11-19]. http:// gridport. npaci.edu.

[13] Apweiler R, Bairoch A. UniProt: the universal protein knowledge base [J]. Nucleic Acids Res, 2004, 32(1): 115-119.

[14] 赵志康, 谢江, 李松倍. 基于Web Service 的蛋白质相互作用网络PSE [J]. 计算机工程与设计, 2009, 30(18):4326-4329.

[15] 毛国勇, 张晓斌, 谢江. 面向生物信息学的网格问题求解平台[J]. 计算机工程, 2010, 36(11): 253-255.

[16] 毛国勇, 张晓斌, 谢江. 基于Web 的PSE-Bio 交互式可视化[J]. 计算机工程与应用, 2010, 46(31): 77-79.

[17] Xie J, Zhang X B, Zhang W. PSE-Bio: a grid enabled problem solving environment for bioinformatics [C]// The 3rd IEEE International Conference on e-Science

and Grid Computing. 2007: 529-535.

[18] Xie J, Zhang Y W, Zhang W. Studies of agent composition model of PSE-Bio workflow [C]// The 4th IEEE International Conference on e-Science. 2008: 743-748.

[19] Mao G Y, Xie J. SVG-based interactive visualization of PSE-Bio [C]// The 2009 High Performance Computing and Application. 2010: 288-294.

[20] Yu L, Xie J, Cheng X. BNMatch: a cytoscape plugin for querying and visualizing matched similar networks [C]// The 2010 International Conference on

Computer and Computational Intelligence. 2010: 476-478.

[21] Cheng X, Xie J, Yi R G. Data management and application on CPSE-Bio [C]// The 2011 Proceedings of the International Conference on Human-Centric Computing.

2011: 591-599.

[22] Xie J, Yi R G, Tan J. Multi-database retrieval technology on CPSE-Bio [C]// The 6th International Conference on Computer Sciences and Convergence Information

Technology. 2011: 280-284.

[23] Xie J, Xia S Y. Virtual technology on CPSE-Bio [C]// The 7th International Conference on Computing and Convergence Technology. 2012: 1478-1481.

[24] Cytoscape [EB/OL]. [2012-11-19]. http://www.cytoscape.org.

[25] He H, Singh K. Closure-tree: an index structure for graph queries [C]// The 22nd International Conference on Data Engineering. 2006: 38.

[26] 张建勋, 古志民, 郑超. 云计算研究进展综述[J]. 计算机应用研究, 2010, 27(2): 429-433.

[27] Bader G, Cary M, Sander C. Pathguide: a pathway resource list [J]. Nucleic Acids Res, 2006, 34(S): 504-506.

[28] Wu C H. The protein information resource [J]. Nucleic Acids Res, 2003, 31(1): 345-347.

[29] Orchard S. Protein interaction data curation: the International Molecular Exchange (IMEx) consortium [J]. Nature Methods, 2012, 9(4): 345-350.

[30] Borden T, Hennessy J, Rymarczyk J. Multiple operating systems on one processor complex [J]. IBM Systems Journal, 1989, 28: 104-123.

[31] Habib I. Virtualization with KVM [J]. Linux J, 2008, 166: 8.
文章导航

/