TY - CHAP
T1 - Big Data Interoperability Framework for Malaysian Public Open Data
AU - Ibrahim, Najhan Muhamad
AU - Hussin, Amir Aatieff Amir
AU - Hassan, Khairul Azmi
AU - Breathnach, Ciara
N1 - Publisher Copyright:
© 2021, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2021
Y1 - 2021
N2 - Massive quantities of Malaysia Open Data are available in the public domain such as provided by data.gov.my. However, most of the available datasets are not integrated. Some are unstructured and structured following its source of datasets. Naturally, the datasets cannot interconnect or ‘interoperable’ with one another, which leads to Big Data (BD) problem. Advances in the database management system and interconnect linked data techniques to connect database systems, provide extraordinary opportunities to create relationships between distributed datasets for a particular objective. Fast-growing in computing technologies, which lead to the digitization, which lead to the capability to query various open datasets. Public Open Data come in varying sources, sizes, and formats. These Big and Small datasets formats pose various integration problems for Information Technology Frameworks. To generate meaningful linked-data to support the purposes of our study the relationship between these disparate datasets needs to be identified and integrated. This paper proposes a BD interoperability framework to integrate Malaysian public health open data. The main goal to enable the potential application with current technologies to extract and discover from Public Open Data. It would reduce the overall cost for healthcare with better prevention mechanism to be placed at the right time. By having a public open big data framework in health, we would predict the pattern of future disease that may take several years to understand.
AB - Massive quantities of Malaysia Open Data are available in the public domain such as provided by data.gov.my. However, most of the available datasets are not integrated. Some are unstructured and structured following its source of datasets. Naturally, the datasets cannot interconnect or ‘interoperable’ with one another, which leads to Big Data (BD) problem. Advances in the database management system and interconnect linked data techniques to connect database systems, provide extraordinary opportunities to create relationships between distributed datasets for a particular objective. Fast-growing in computing technologies, which lead to the digitization, which lead to the capability to query various open datasets. Public Open Data come in varying sources, sizes, and formats. These Big and Small datasets formats pose various integration problems for Information Technology Frameworks. To generate meaningful linked-data to support the purposes of our study the relationship between these disparate datasets needs to be identified and integrated. This paper proposes a BD interoperability framework to integrate Malaysian public health open data. The main goal to enable the potential application with current technologies to extract and discover from Public Open Data. It would reduce the overall cost for healthcare with better prevention mechanism to be placed at the right time. By having a public open big data framework in health, we would predict the pattern of future disease that may take several years to understand.
KW - Big data
KW - Interoperability framework
KW - Public open data
UR - http://www.scopus.com/inward/record.url?scp=85105523983&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-70713-2_39
DO - 10.1007/978-3-030-70713-2_39
M3 - Chapter
AN - SCOPUS:85105523983
T3 - Lecture Notes on Data Engineering and Communications Technologies
SP - 421
EP - 429
BT - Lecture Notes on Data Engineering and Communications Technologies
PB - Springer Science and Business Media Deutschland GmbH
ER -