Grid Computing Cluster â the Development and ... - Lim Lian Tze
Grid Computing Cluster â the Development and ... - Lim Lian Tze
Grid Computing Cluster â the Development and ... - Lim Lian Tze
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
Image: Binary data everywhere. Illustration by ©Rodolfo Clix (http://www.sxc.hu/profile/clix).DYNAMIC REPLICA MANAGEMENTIN DATA GRID ENVIRONMENT<strong>Grid</strong> <strong>Computing</strong> LabSchool of Computer SciencesTEAM MEMBERSProject Leader : AP Chan Huah YongResearchers : Aloysius Indrayanto, Muhammad Muzzammil bin Mohd SalahudinContributors : Siamak Sarmady, Wong Jik Soon, Seow Kwen Jin, Zeinab Noorian, Ang Sin Keat, Cheng Wai KhuenINTRODUCTIONAccessing, managing, querying, <strong>and</strong>programming a database system arenormally dependent on <strong>the</strong> type of<strong>the</strong> database deployment, st<strong>and</strong>ards,conventions defined by <strong>the</strong> databasevendor, <strong>and</strong> <strong>the</strong> language that will beused to define <strong>the</strong> query. Currently,<strong>the</strong> Relational Database ManagementSystem (RDBMS) is one of <strong>the</strong> mostwidely used database system. Queryon <strong>the</strong>se kind of databases are usuallydone by using a language namedStructured Query Language (SQL).RDBMS is currently implemented<strong>and</strong> supported by many different vendors.Every vendor has its ownmethod or convention on how usersshould access its database system.This convention is defined as a setof programming procedure calledApplication Programming Interface(API). These differences would causea developer (programmer) <strong>the</strong> needto learn many different API if he/shewants to use <strong>and</strong> integrate RDBMS deploymentsfrom multiple vendors.SQL is a st<strong>and</strong>ardized language.However, <strong>the</strong> st<strong>and</strong>ard set of SQL comm<strong>and</strong>snormally are not enough fora vendor. Hence, database vendorswould normally add non-st<strong>and</strong>ard extensionsto <strong>the</strong> SQL so that users canuse <strong>the</strong>ir RDBMS more efficiently. Thisextension would cause an administratoror a developer <strong>the</strong> need to learnmany non-st<strong>and</strong>ard SQL comm<strong>and</strong>s.In addition, some vendors also providea custom API which will need tobe used if a developer wants an evenmore efficient access to <strong>the</strong> RDBMS.Some vendors only provide customAPIs to access <strong>the</strong> RDBMS’ specific features(no extension to <strong>the</strong> SQL language).Learning many different APIs <strong>and</strong>non-st<strong>and</strong>ard SQL extensions wouldnot be efficient for a developer. Timecould be wasted to factorize whatcan be done <strong>and</strong> what cannot bedone with a particular RDBMS product/deployment.Hence, interoperatingRDBMS from multiple vendorswould be a maintenance problem.Performing a custom distributedquery between two different RDBMSthat are installed in two differenthosts (computers) would be impossibledue to <strong>the</strong> fact that <strong>the</strong> vendorsuse <strong>the</strong>ir own proprietary protocolsfor communication between<strong>the</strong>ir RDBMS servers <strong>and</strong> clients. Also,not all RDBMS vendors support distributedquery between two (or more)hosts. In addition, <strong>the</strong> cost of <strong>the</strong>software license <strong>and</strong> deployment for<strong>the</strong>se kind of databases are expensive.A large database vendors, such asOracle <strong>and</strong> IBM DB2, provides supportsfor distributed query. Hence,<strong>the</strong>ir system is called DistributedDatabase System (DDS). However, <strong>the</strong>two systems may not cooperate wellwith each o<strong>the</strong>r. Hence, performinga distributed query between Oracle<strong>and</strong> DB2 would be difficult.The VDDBMS project aims to providea compatibility layer betweendifferent RDBMS implementations <strong>and</strong><strong>the</strong> end users. The main objective isto allow distributed queries betweentwo or more RDBMS from differentvendors with <strong>the</strong> same SQL <strong>and</strong> API.14