0% found this document useful (0 votes)

7 views

Chapter-06

Chapter 6 of COMP255 discusses the normalization of database tables, outlining its importance in database design and detailing the various normal forms (1NF, 2NF, 3NF, BCNF, and 4NF). It emphasizes the need to eliminate redundancies and anomalies through proper structuring of tables and the application of normalization rules. The chapter also addresses the concept of denormalization and provides a data-modeling checklist to ensure compliance with design principles.

Uploaded by

Xenos Playground aka Boxman Studios

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

Chapter-06

Uploaded by

Xenos Playground aka Boxman Studios

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 46

COMP255

Chapter 6
Normalization of Database Tables

1
Learning Objectives
●
Explain normalization and its role in the database design process
●
Identify and describe each of the normal forms: 1NF, 2NF, 3NF, BCNF, and 4NF
●
Explain how normal forms can be transformed from lower normal forms to
higher normal forms
●
Apply normalization rules to evaluate and correct table structures
●
Identify situations that require denormalization to generate information efficiently
●
Use a data-modeling checklist to check that the ERD meets a set of minimum
requirements

2
Normalization
●
The process of improving the database design
●
Assigns attributes to tables based on
determination (Chapter 3)
– Knowing the value of one attribute makes it
possible to determine the value of another
●
Reduces redundancies and anomalies

3
Data with Bad Structure

4
Issues
●
Data grouped by project (repeating groups)
●
Many redundancies
– Employee name
– Job class
– Hourly charge
●
Primary key?

5
Goals
●
Create well-formed relations (tables)
– Each table represents a single subject
– Each row/column intersection contains only one value and not a
group of values
– No data item will be unnecessarily stored in more than one table
– All nonprime attributes in a table are dependent on the primary
key
– Each table has no insertion, update, or deletion anomalies

6
Normal Forms

7
Definitions from Chapter 3
●
Determination
– State in which knowing the value of one attribute makes it possible to
determine the value of another
●
Functional dependence
– Within a relation R, an attribute B is functionally dependent on an
attribute A if and only if a given value of attribute A determines exactly
one value of attribute B
– The relationship “B is dependent on A” is equivalent to “A determines
B” and is written as A → B

8
Bad Dependencies
●
Partial dependency
●
Transitive dependency

9
Partial dependency
●
Functional dependence in which the
determinant is only part of the primary key
– Assumption: one candidate key
– Straight forward
– Easy to identify

10
Partial Dependency
●
(PROJ_NUM, EMP_NUM) is the primary key
●
Dependencies
– (PROJ_NUM, EMP_NUM) → (EMP_NAME, HOURS)
– EMP_NUM → EMP_NAME
●
Then the functional dependence EMP_NUM → EMP_NAME is partial
dependency
Dependency on primary key

PROJ_NUM EMP_NUM EMP_NAME HOURS

Partial dependency
11
Transitive dependency
●
Attribute is dependent on another attribute that
is not part of the primary key
– More difficult to identify among a set of data
– Occurs only when a functional dependence exists
among nonprime attributes

12
Transitive Dependency
●
EMP_NUM is the primary key
●
Dependencies
– (EMP_NUM) → (EMP_NAME, MGR_NUM, MGR_NAME)
– MGR_NUM → MGR_NAME
●
MGR_NUM is not part of the primary key
– MGR_NUM is a transitive dependency

Dependency on primary key

EM_NUM EMP_NAME MGR_NUM MGR_NAME

Transitive dependency
13
To First Normal Form (1NF)
●
Eliminate repeating groups
●
Identify primary key
●
Identify dependencies
– Draw a dependency diagram

14
Bad to 1NF

Primary Key: PROJ_NUM and EMP_NUM

15
Dependency Diagram

16
Still Problems
●
Update anomalies
– Modifying the JOB_CLASS for employee Annelise Jones requires updating many
entries; otherwise, it will generate data inconsistencies
●
Insertion anomalies
– Adding a new employee requires the employee to be assigned to a project and
therefore to enter duplicate project information. If the employee is not yet assigned to
a project, a phantom project must be created to complete the employee data entry
●
Deletion anomalies
– Suppose that only one employee is associated with a given project. If that employee
is deleted, the project information will also be deleted.

17
To Second Normal Form (2NF)
●
Make new tables to remove partial dependencies
●
Table is in 2NF if:
– It is in 1NF
– Has no partial dependencies
●
Tables with a single attribute primary key and in
1NF are already 2NF

18
Not in 2NF

19
To Third Normal Form (3NF)
●
Make new tables to remove transitive
dependencies
●
Table is in 3NF when it:
– Is in 2NF
– Contains no transitive dependencies

20
Now in 3NF

21
More Things to Look At
●
Evaluate PK assignments and naming
conventions
– Long JOB_CODE entries not the best
– Use a surrogate primary key

22
More...
●
Refine attribute atomicity
– Atomic attribute: cannot be further subdivided
– Atomicity: characteristic of an atomic attribute
●
Possibly split employee name into first, last,
middle initial

23
More...
●
Identify new attributes and new relationships
– Will probably need to store more attributes on
employees
– Original data showed project managers
●
Add relationship

24
More...
●
Refine primary keys as required for data
granularity
– Granularity: Level of detail represented by the
values stored in a table’s row
●
What does ASSIGN_HOURS represent?
– Identify time frame and update design

25
More...
●
Maintain historical accuracy
– JOB_CHG_HOUR can change over time
– Save value at time of employee assignment
●
Evaluate using derived attributes
– Possibly store total costs (hours * hourly charge)

26
Final Results

27
Issues with Surrogate Keys

●
Two entries for the same job
●
Trade off designers have to make

28
Boyce-Codd Normal Form (BCNF)
●
Note the dependency ●
It is not partial
C→B ●
Table is not in BCNF
●
It is not transitive

29
BCNF Formal Definition
●
A special type of third normal form (3NF) in which every
determinant is a candidate key
– Determinant: Any attribute in a specific row whose value directly
determines other values in that row
– Candidate key: A minimal superkey; that is, a key that does not contain
a subset of attributes that is itself a superkey
– Superkey: An attribute or attributes that uniquely identify each entity in
a table
●
A table in BCNF must be in 3NF

30
Concrete Example

31
Fourth Normal Form (4NF)
●
Rules
– All attributes must be dependent on the primary key, but they
must be independent of each other
– No row may contain two or more multivalued facts about an entity
●
Table is in 4NF when it:
– Is in 3NF
– Has no multivalued dependencies

32
Example
●
An Employee can volunteer for many organizations
●
An Employee can have many assignments
●
All three table versions are not good

33
Split into Additional Tables

34
Normalization and Data Design
●
Normalization should be part of the design process
– Proposed entities must meet required the normal form before
table structures are created
●
Principles and normalization procedures to be
understood to redesign and modify databases
– ERD is created through an iterative process
– Normalization focuses on the characteristics of specific entities

35
Denormalization
●
Opposing design goals
– Creation of normalized relations
– Processing requirements and speed
●
As tables are decomposed to conform to
normalization requirements
– Number of database tables expands

36
Denormalization
●
Joining a larger number of tables
– Takes additional input/output (I/O) operations and processing logic
– Reduces system speed
●
But, defects in unnormalized tables will happen
– Data updates are less efficient because tables are larger
– Indexing is more cumbersome
– No simple strategies for creating virtual tables known as views

37
Data Modeling Checklist
●
Business rules
– Properly document and verify all business rules with the end users
– Ensure that all business rules are written precisely, clearly, and
simply
– The business rules must help identify entities, attributes,
relationships, and constraints
– Identify the source of all business rules, and ensure that each
business rule is justified, dated, and signed off by an approving
authority

38
Data Modeling Checklist
●
Data modeling
– Naming conventions: all names should be limited in
length (database-dependent size)

39
Naming Conventions
●
Entity names:
– Should be nouns that are familiar to business and should be short
and meaningful
– Should document abbreviations, synonyms, and aliases for each
entity
– Should be unique within the model
– For composite entities, may include a combination of abbreviated
names of the entities linked through the composite entity

40
Naming Conventions
●
Attribute names:
– Should be unique within the entity
– Should use the entity abbreviation as a prefix
– Should be descriptive of the characteristic
– Should use suffixes such as _ID, _NUM, or _CODE for the PK
attribute
– Should not be a reserved word
– Should not contain spaces or special characters such as @, !, or &

41
Naming Conventions
●
Relationship names:
– Should be active or passive verbs that clearly
indicate the nature of the relationship

42
Data Modeling Checklist
●
Entities:
– Each entity should represent a single subject
– Each entity should represent a set of distinguishable entity instances
– All entities should be in 3NF or higher
– Any entities below 3NF should be justified
– Granularity of the entity instance should be clearly defined
– PK should be clearly defined and support the selected data
granularity

43
Data Modeling Checklist
●
Attributes:
– Should be simple and single-valued (atomic data)
– Should document default values, constraints, synonyms, and
aliases
– Derived attributes should be clearly identified and include source(s)
– Should not be redundant unless this is required for transaction
accuracy, performance, or maintaining a history
– Nonkey attributes must be fully dependent on the PK attribute

44
Data Modeling Checklist
●
Relationships:
– Should clearly identify relationship participants
– Should clearly define participation, connectivity, and
document cardinality

45
Data Modeling Checklist
●
ER model:
– Should be validated against expected processes: inserts, updates, and
deletions
– Should evaluate where, when, and how to maintain a history
– Should not contain redundant relationships except as required (see
attributes)
– Should minimize data redundancy to ensure single-place updates
– Should conform to the minimal data rule: All that is needed is there,
and all that is there is needed

21st Century Boys v02, (2007) (Obxist)
No ratings yet
21st Century Boys v02, (2007) (Obxist)
205 pages
Guide To SQL 9th Edition Pratt Solutions Manual
67% (6)
Guide To SQL 9th Edition Pratt Solutions Manual
21 pages
Relativism in Ethics - William Shaw
No ratings yet
Relativism in Ethics - William Shaw
4 pages
Database Design: Normalization
No ratings yet
Database Design: Normalization
27 pages
Lec10 Normalization PDF
No ratings yet
Lec10 Normalization PDF
50 pages
Norma
No ratings yet
Norma
62 pages
Normalization of Database Models
No ratings yet
Normalization of Database Models
43 pages
Chapter 5-T323 Introduction to the Relational Database
No ratings yet
Chapter 5-T323 Introduction to the Relational Database
37 pages
Topic 07
No ratings yet
Topic 07
56 pages
Topic 5 Normalization of Database Tables(Stu)
No ratings yet
Topic 5 Normalization of Database Tables(Stu)
58 pages
Normalization of Database Tables
No ratings yet
Normalization of Database Tables
52 pages
Chapter6_NormalizationDatabaseTables_Part4 (2)
No ratings yet
Chapter6_NormalizationDatabaseTables_Part4 (2)
38 pages
Chapter 13
No ratings yet
Chapter 13
31 pages
Unit 4
No ratings yet
Unit 4
6 pages
LESSON-7.-Normalization-of-Database-Tables
No ratings yet
LESSON-7.-Normalization-of-Database-Tables
34 pages
DBMS Module-IV
No ratings yet
DBMS Module-IV
9 pages
Lect. 5- Normalization[1][1]
No ratings yet
Lect. 5- Normalization[1][1]
22 pages
ADBMS Lec4
No ratings yet
ADBMS Lec4
35 pages
SAD_CHAPTER2
No ratings yet
SAD_CHAPTER2
37 pages
376420_LEC06_Normalization_Up
No ratings yet
376420_LEC06_Normalization_Up
51 pages
What Is A Relational Database?
No ratings yet
What Is A Relational Database?
13 pages
Data Normalization
No ratings yet
Data Normalization
38 pages
DBMS Module-IV
No ratings yet
DBMS Module-IV
9 pages
Data Normalization
No ratings yet
Data Normalization
43 pages
Normalization New
No ratings yet
Normalization New
44 pages
Database Design With Normalization
No ratings yet
Database Design With Normalization
30 pages
Normalization
No ratings yet
Normalization
27 pages
Data Normalization
No ratings yet
Data Normalization
41 pages
3designing A Database
No ratings yet
3designing A Database
17 pages
Study Material: Vivekananda College Thakurpukur
No ratings yet
Study Material: Vivekananda College Thakurpukur
10 pages
Lecture 9 & 10 - Normalization
No ratings yet
Lecture 9 & 10 - Normalization
31 pages
Normalization(V2)
No ratings yet
Normalization(V2)
29 pages
DB Normalization by Prof - Manikandan
No ratings yet
DB Normalization by Prof - Manikandan
23 pages
IM Module 3, Lesson 3
No ratings yet
IM Module 3, Lesson 3
51 pages
Lec 5 Normalization
No ratings yet
Lec 5 Normalization
25 pages
5 normalizationDBMS
No ratings yet
5 normalizationDBMS
12 pages
Database Normalisation 101
No ratings yet
Database Normalisation 101
9 pages
Week 5 Normalization Complete Aa
No ratings yet
Week 5 Normalization Complete Aa
41 pages
Data Management and Database Design: INFO 6210 Week #4
No ratings yet
Data Management and Database Design: INFO 6210 Week #4
44 pages
Normalization
No ratings yet
Normalization
31 pages
4. Normalization
No ratings yet
4. Normalization
44 pages
Chapter 5
No ratings yet
Chapter 5
34 pages
4.4 Normalization
No ratings yet
4.4 Normalization
55 pages
Database Management System and ER Modelling
No ratings yet
Database Management System and ER Modelling
48 pages
Databases Lecture 5
No ratings yet
Databases Lecture 5
34 pages
Normalization of Database Tables
No ratings yet
Normalization of Database Tables
27 pages
Normalization Lecture
No ratings yet
Normalization Lecture
47 pages
Normalisation in DataBase
No ratings yet
Normalisation in DataBase
28 pages
Chapter05 Updated
No ratings yet
Chapter05 Updated
52 pages
Database Management - CH 06 Normalization
No ratings yet
Database Management - CH 06 Normalization
22 pages
Database and SQL
No ratings yet
Database and SQL
65 pages
Lecture 8 - Normalisation
No ratings yet
Lecture 8 - Normalisation
7 pages
Database-unit-4-Normilization-1-1
No ratings yet
Database-unit-4-Normilization-1-1
38 pages
RDBMS
No ratings yet
RDBMS
46 pages
What Is Normalization ? Why Should We Use It?
No ratings yet
What Is Normalization ? Why Should We Use It?
9 pages
Normalization
No ratings yet
Normalization
46 pages
Lecture 7 _ 8- Normalization
No ratings yet
Lecture 7 _ 8- Normalization
30 pages
Database Testing
No ratings yet
Database Testing
45 pages
Normalisation
No ratings yet
Normalisation
21 pages
Session 06
No ratings yet
Session 06
51 pages
Unit III Dbms
No ratings yet
Unit III Dbms
23 pages
Oracle SQL and PL/SQL
From Everand
Oracle SQL and PL/SQL
Niraj Gupta
4.5/5 (8)
C# Functional: Monads from Zero to Hero
From Everand
C# Functional: Monads from Zero to Hero
Carlos Bueno
No ratings yet
Chapter_3_J_v8.0_V04 (1)
No ratings yet
Chapter_3_J_v8.0_V04 (1)
150 pages
Chapter-04
No ratings yet
Chapter-04
29 pages
Chapter-02
No ratings yet
Chapter-02
45 pages
Chapter-14
No ratings yet
Chapter-14
35 pages
Ch_2 C_V7.01_J
No ratings yet
Ch_2 C_V7.01_J
37 pages
Chapter-08-2
No ratings yet
Chapter-08-2
20 pages
SQL Triggers & Functions
No ratings yet
SQL Triggers & Functions
16 pages
Columnar Database
No ratings yet
Columnar Database
18 pages
SQL Views & Procedures
No ratings yet
SQL Views & Procedures
23 pages
Query Optimization
No ratings yet
Query Optimization
10 pages
Review - Normal Forms2
No ratings yet
Review - Normal Forms2
17 pages
Intro-Databases For Big Data
No ratings yet
Intro-Databases For Big Data
10 pages
CAP Theorem
No ratings yet
CAP Theorem
15 pages
Chapter 6 Management A Practical Introduction
No ratings yet
Chapter 6 Management A Practical Introduction
6 pages
SQL Queries5
No ratings yet
SQL Queries5
20 pages
SQL Functions
No ratings yet
SQL Functions
18 pages
Deutsch GroupFormation 1973
No ratings yet
Deutsch GroupFormation 1973
20 pages
Review of DB Concepts
No ratings yet
Review of DB Concepts
27 pages
Eliot PsychoanalyticInterpretationGroup 1920
No ratings yet
Eliot PsychoanalyticInterpretationGroup 1920
21 pages
Quality Indicators For The Care of Older Adults W Disabilities in Longterm Care Wbased On Maslow Hierarchy of Needs
No ratings yet
Quality Indicators For The Care of Older Adults W Disabilities in Longterm Care Wbased On Maslow Hierarchy of Needs
7 pages
A Suggested Modification To Maslow's Need Hierarchy
No ratings yet
A Suggested Modification To Maslow's Need Hierarchy
6 pages
Examining Maslow's Hierarchy Need Theory in The Social Media Adoption
No ratings yet
Examining Maslow's Hierarchy Need Theory in The Social Media Adoption
11 pages
BLAME! Master Edition v01 (2016) (Digital) (Danke-Empire)
No ratings yet
BLAME! Master Edition v01 (2016) (Digital) (Danke-Empire)
396 pages
The Great Divide Drivers of Polarization
No ratings yet
The Great Divide Drivers of Polarization
13 pages
86EIGHTY-SIX Vol 10 Light Novel Fragmental Neoteny - Asato Asato
No ratings yet
86EIGHTY-SIX Vol 10 Light Novel Fragmental Neoteny - Asato Asato
289 pages
BLAME! Master Edition v03 (2017) (Digital) (Danke-Empire)
No ratings yet
BLAME! Master Edition v03 (2017) (Digital) (Danke-Empire)
341 pages
BLAME! Master Edition v02 (2016) (Digital) (Danke-Empire)
No ratings yet
BLAME! Master Edition v02 (2016) (Digital) (Danke-Empire)
364 pages
Reviving The Lost Tort of Defamation A Proposal To Stem The Flow of Fake News
No ratings yet
Reviving The Lost Tort of Defamation A Proposal To Stem The Flow of Fake News
26 pages
Module II Normal Form (NF1, NF2, NF3, BCNF)
No ratings yet
Module II Normal Form (NF1, NF2, NF3, BCNF)
9 pages
DDMQBA
No ratings yet
DDMQBA
27 pages
BASK
No ratings yet
BASK
10 pages
Compound Key
No ratings yet
Compound Key
13 pages
Designing A Database
No ratings yet
Designing A Database
16 pages
DBMS Report VARSHAAAA
No ratings yet
DBMS Report VARSHAAAA
32 pages
Interpreting The General Definition of Third Normal Form: Attribute of R Meets Both of The Following Conditions
No ratings yet
Interpreting The General Definition of Third Normal Form: Attribute of R Meets Both of The Following Conditions
46 pages
CS331 - Chapter5 Normalization
No ratings yet
CS331 - Chapter5 Normalization
35 pages
SQL for Data Science
No ratings yet
SQL for Data Science
107 pages
ch 14-Final-normalization
No ratings yet
ch 14-Final-normalization
39 pages
FALLSEM2023-24 BCSE302L TH VL2023240101371 2023-05-24 Reference-Material-I
No ratings yet
FALLSEM2023-24 BCSE302L TH VL2023240101371 2023-05-24 Reference-Material-I
34 pages
Canonical Cover & Normal Forms
No ratings yet
Canonical Cover & Normal Forms
15 pages
Top 52 DBMS Interview Questions (2022)
No ratings yet
Top 52 DBMS Interview Questions (2022)
24 pages
Merit Databse
No ratings yet
Merit Databse
53 pages
Unit - 3
No ratings yet
Unit - 3
40 pages
DBMS Normalization Normalization: Types of Normal Forms
No ratings yet
DBMS Normalization Normalization: Types of Normal Forms
17 pages
Query Language
No ratings yet
Query Language
44 pages
Back To 'Certificate Final Exam/': Incorrect 0.00 Points Out of 1.00
No ratings yet
Back To 'Certificate Final Exam/': Incorrect 0.00 Points Out of 1.00
15 pages
Database Normalization
No ratings yet
Database Normalization
10 pages
What Is Functional Dependency?: Re Exivity: If Y Is A Subset of X, Then X Y Holds by Re Exivity Rule
No ratings yet
What Is Functional Dependency?: Re Exivity: If Y Is A Subset of X, Then X Y Holds by Re Exivity Rule
17 pages
LTIMINDTREE INTERVIEW PREPARATIONS
No ratings yet
LTIMINDTREE INTERVIEW PREPARATIONS
7 pages
QB With Answers
No ratings yet
QB With Answers
12 pages
VL2022230501912 Da
No ratings yet
VL2022230501912 Da
2 pages
1NF To 5NF-Normalization With Eg
50% (2)
1NF To 5NF-Normalization With Eg
13 pages
A Lalitha Associate Professor Avinash Degree College: Unit-II Database Integrity and Normalization
No ratings yet
A Lalitha Associate Professor Avinash Degree College: Unit-II Database Integrity and Normalization
23 pages
DDM-3
No ratings yet
DDM-3
43 pages
Assignment No. 1: A) B) C) D) A) B)
No ratings yet
Assignment No. 1: A) B) C) D) A) B)
1 page
Advanced Normalization Transparencies
No ratings yet
Advanced Normalization Transparencies
30 pages
Normalization Update2
No ratings yet
Normalization Update2
16 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Chapter-06

Uploaded by

Chapter-06

Uploaded by

COMP255

PROJ_NUM EMP_NUM EMP_NAME HOURS

Dependency on primary key

EM_NUM EMP_NAME MGR_NUM MGR_NAME

Primary Key: PROJ_NUM and EMP_NUM

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.