Evolution of Data Management Systems🔗
Fundamental Concepts, Methods and Applications🔗
In this three part class, students will cover the history of data management systems, from file systems to databases to distributed cloud storage. This class is given over the length of the Data Engineering course. Questions from the first two parts are integrated into the exam on cloud computing, and questions from the Cloud DMS section are integrated into the Dask notebook evaluation.
Objectives🔗
The objectives of this course are: - Introduce the fundamental concepts - Describe, in a synthetic way, the main characteristics of the evolution of DMS (Data Management Systems) - Highlight targeted application classes.
Key Words🔗
Data Management Systems, Uni-processor DBMS, Parallel DBMS, Data Integration Systems,Big Data, Cloud Data Management Systems, High Performance, Scalability, Elasticity, Multi-store/Poly-store Systems
Targeted Skills🔗
- Effectively exploit the DMS according to the environment (uniprocessor, parallel, distributed, cloud) in a perspective of decision support within an organization.
- Ability to choose, in a relevant way, a DMS in multiple environments for an optimal functioning of the applications of an organization
Indicative Program🔗
-
Introduction to Main Problems of Data Management
- From File Management Systems FMS to Database MS DBMS
- Motivations, Objectives, Organizations & Drawbacks
- Databases & Rel. DBMS: Motivations & Objectives
- Resources:
-
Parallel Database Systems
- Objectives and Parallel Architecture Models
- Data Partitioning Strategies
- Parallel Query Processing
- Motivations & Objectives
- Characteristics and Challenges
- Resources:
-
From Distributed DB to Data Integration Systems DIS
- An Ex. of DBD, Motivations & Objectives
- Designing of DDB
- Distributed Query Processing
- An Ex. of DIS
- Motivations & Objectives
- Mediator-Adapters Architecture
- Design of a Global Schema (GAV, LAV)
- Query Processing Methodologies
-
Cloud Data Management Systems CDMS
- Motivations and Objectives
- Main Characteristics of Big Data and CDMS
- Classification of Cloud Data Management Systems CDMS
- Advantages and Weakness of Parallel RDBMS and CDMS
- Comparison between Parallel RDBMS and CDMS
- Introduction to Multi-store/Ploystore Systems
- Resources:
-
Conclusion
- Maturity of Cloud DMS
- Key Criteria for Choosing a Data Management System
Additional Reading🔗
-
Principles of Distributed Database Systems, M. Tamer Ozsu and Patrick Valduriez; Springer-Verlag ; Fourth Edition, December 2019.
-
Data Management in the Cloud: Challenges and Opportunities Divyakant Agrawal, Sudipto Das, and Amr El Abbadi; Synthesis Lectures on Data Management, December 2012, Vol. 4, No. 6 , Pages 1-138.
-
Query Processing in Parallel Relational Database Systems; H. Lu, B.-C Ooi and K.-L. Tan; IEEE Computer Society Press, CA, USA, 1994.
-
Traitement parallèle dans les bases de données relationnelles : concepts, méthodes et applications Abdelkader Hameurlain, Pierre Bazex, Franck Morvan; Cépaduès Editions, Octobre 1996.