Summary Interactive Data Transformation (320099-M-6)
73 views 4 purchases
Course
Interactive Data Transformation (320099M6)
Institution
Tilburg University (UVT)
Detailed summary of all lectures and additional notes, explanations and examples for the course "Interactive Data Transformation" at Tilburg University which is part of the Master Data Science and Society. Course was given by : E. Ioannou during the second semester, block four of the academic year ...
Tilburg University
Study Program: Master Data Science and Society
Academic Year 2021/2022, Semester 2, Block 4 (April to June 2022)
Course: Interactive Data Transformation (320099-M-6)
Lecturers: E. Ioannou
,Lecture 1: Data Base Management Systems and SQL
Database management systems
Reasons for DBMS
• Data redundancy and inconsistency: Multiple file formats, duplication in different files
o Redundancies: same info in more than one places
o Inconsistencies: different values for the same info
• Difficulty in accessing data: Need to write a new program to carry out each new task
• Data isolation: Multiple files and formats
• Integrity problems: Integrity constraints become buried in program code rather than being
stated explicitly, hard to add new constraints or change existing ones
• Atomicity of updates: Failures may leave data in an inconsistent state with partial updates
• Concurrent access by multiple users: Needed for performance, Uncontrolled concurrent
accesses can lead to inconsistencies
• Security problems: Hard to provide user access to some, but not all, data
DBMS Architecture
• Database (DB)
o Collection of data with the same structure
o Including correlations and relationships
o Common purpose, i.e., defined for a particular use
o Shared, i.e., used by several users
• Database Management System (DBMS)
o Collection of programs over DB
o Define, i.e., specify the data types, structure, constraints
o Build & manipulate, i.e., store on disk, retrieve, update
o Administrate, e.g., manage access rights
• Applications
o Access to DB for performing queries
o Android app, Web application, etc.
o Users, Applications, Systems, Clients
• Internal Layer
o Storing the data
o Software for structuring the data
o Efficient access methods (keys, indices, etc.)
• Logical Layer: Optimization of queries
o Resolving conflicting accesses, i.e., multiple users
o Guarantees constant availability even in case of failures
• External Layer: communication with users
o Analysis of user requests (queries)
o Access control
o Answer presentation
, Development Process, Lifecycle
1. Planning: Preliminary understanding of the business situation and how information systems
might help solving the problem
a. Analyzing current data processing
b. Analyzing general business functions and needs
2. Analysis: Analyze the business situation thoroughly to determine requirements and to
structure those
a. Develop Conceptual Schema (Entity Relationship Diagram)
b. Corresponds to a detailed, technology independent specification of the overall
organizational data structure
3. Design
a. Logical representation of the DB: Transform the
conceptual schema to a data management system
b. Physical: Specify how data are stored in a
computer’s secondary memory by a specific DBMS
4. Implementation
a. Build database implementation
b. Populate with data
c. Install application(s) and test
d. Complete documentation and training materials
5. Maintenance
a. Monitor the operation and usefulness of the system
b. Repair by fixing errors in database and applications
c. Enhance by analyzing the database and applications
to ensure that evolving information requirements are met
Types of DBMS
• Traditional database management systems: Text and numerical data
• Multimedia database management systems: Multimedia data
• Spatial database management systems: Geographic and geometric data
• Data Warehouses
Relational Data Model
• An approach to managing data by representing it grouped into relations
• Relational Database Management Systems (RDBMS):
o A database management system that manages data as a collection of tables in which
all relationships are represented by common values in related tables
Relational Schema
• A database consists of a finite set of relations (often referred to as tables)
• Each relation has a name and a set of attributes
• Each attribute has a name and a domain
o Domain is the collection of values an attribute may take: Null, numeric (integer,
float), Boolean, date, timestamp, string etc.
• A relation contains a set of instances (rows / tuples / records)
The benefits of buying summaries with Stuvia:
Guaranteed quality through customer reviews
Stuvia customers have reviewed more than 700,000 summaries. This how you know that you are buying the best documents.
Quick and easy check-out
You can quickly pay through credit card or Stuvia-credit for the summaries. There is no membership needed.
Focus on what matters
Your fellow students write the study notes themselves, which is why the documents are always reliable and up-to-date. This ensures you quickly get to the core!
Frequently asked questions
What do I get when I buy this document?
You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.
Satisfaction guarantee: how does it work?
Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.
Who am I buying these notes from?
Stuvia is a marketplace, so you are not buying this document from us, but from seller hannahgruber. Stuvia facilitates payment to the seller.
Will I be stuck with a subscription?
No, you only buy these notes for $6.51. You're not tied to anything after your purchase.