Skip to content

Evolution of Data Management Systems🔗

Fundamental Concepts, Methods and Applications🔗

In this three part class, students will cover the history of data management systems, from file systems to databases to distributed cloud storage. This class is given over the length of the Data Engineering course. Questions from the first two parts are integrated into the exam on cloud computing, and questions from the Cloud DMS section are integrated into the Dask notebook evaluation.

Objectives🔗

The objectives of this course are: - Introduce the fundamental concepts - Describe, in a synthetic way, the main characteristics of the evolution of DMS (Data Management Systems) - Highlight targeted application classes.

Key Words🔗

Data Management Systems, Uni-processor DBMS, Parallel DBMS, Data Integration Systems,Big Data, Cloud Data Management Systems, High Performance, Scalability, Elasticity, Multi-store/Poly-store Systems

Targeted Skills🔗

  • Effectively exploit the DMS according to the environment (uniprocessor, parallel, distributed, cloud) in a perspective of decision support within an organization.
  • Ability to choose, in a relevant way, a DMS in multiple environments for an optimal functioning of the applications of an organization

Indicative Program🔗

  1. Introduction to Main Problems of Data Management

  2. Parallel Database Systems

    • Objectives and Parallel Architecture Models
    • Data Partitioning Strategies
    • Parallel Query Processing
    • Motivations & Objectives
    • Characteristics and Challenges
    • Resources:
  3. From Distributed DB to Data Integration Systems DIS

    • An Ex. of DBD, Motivations & Objectives
    • Designing of DDB
    • Distributed Query Processing
    • An Ex. of DIS
    • Motivations & Objectives
    • Mediator-Adapters Architecture
    • Design of a Global Schema (GAV, LAV)
    • Query Processing Methodologies
  4. Cloud Data Management Systems CDMS

  5. Conclusion

    • Maturity of Cloud DMS
    • Key Criteria for Choosing a Data Management System

Additional Reading🔗

  1. Principles of Distributed Database Systems, M. Tamer Ozsu and Patrick Valduriez; Springer-Verlag ; Fourth Edition, December 2019.

  2. Data Management in the Cloud: Challenges and Opportunities Divyakant Agrawal, Sudipto Das, and Amr El Abbadi; Synthesis Lectures on Data Management, December 2012, Vol. 4, No. 6 , Pages 1-138.

  3. Query Processing in Parallel Relational Database Systems; H. Lu, B.-C Ooi and K.-L. Tan; IEEE Computer Society Press, CA, USA, 1994.

  4. Traitement parallèle dans les bases de données relationnelles : concepts, méthodes et applications Abdelkader Hameurlain, Pierre Bazex, Franck Morvan; Cépaduès Editions, Octobre 1996.