0% found this document useful (0 votes)

5 views97 pages

PDF

The document is a comprehensive guide to PostgreSQL exercises created by Alisdair Owen, covering various SQL topics such as simple queries, joins, data modification, aggregation, and string operations. It includes a structured table of contents, sample SQL queries, and expected results, along with explanations of database schema for a fictional country club. Users are encouraged to actively solve the exercises to enhance their understanding of PostgreSQL rather than just reading through the guide.

Uploaded by

sanjeev mohite

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views97 pages

PDF

Uploaded by

sanjeev mohite

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 97

PostgreSQL Exercises

This is a compilation of all the questions and answers on Alisdair Owen's PostgreSQL Exercises . Don't
forget that actually solving these problems will make you go further than just skimming through this guide,
so make sure to pay PostgreSQL Exercises a visit.

Table of Contents

Getting Started

I want to use my own Postgres system

Schema
Simple SQL Queries

Retrieve everything from a table

Retrieve specific columns from a table
Control which rows are retrieved
Control which rows are retrieved, Part 2
Basic string searches
Matching against multiple possible values
Classify results into bucket
Working with dates
Removing duplicates, and ordering results
Combining results from multiple queries
Simple aggregation
More aggregation
Joins and Subqueries

Retrieve the start times of members' bookings

Work out the start times of bookings for tennis courts
Produce a list of all members who have recommended another member
Produce a list of all members, along with their recommender
Produce a list of all members who have used a tennis court
Produce a list of costly bookings
Produce a list of all members, along with their recommender, using no joins
Produce a list of costly bookings, using a subquery
Modifying Data

Insert some data into a table

Insert multiple rows of data into a table
Insert calculated data into a table
Update some existing data
Update multiple rows and columns at the same time
Update a row based on the contents of another row
Delete all bookings
Delete a member from the c-members table
Delete based on a subquery
Aggregation
Count the number of facilities
Count the number of expensive facilities
Count the number of recommendations each member makes
List the total slots booked per facility
List the total slots booked per facility in a given month
List the total slots booked per facility per month
Find the count of members who have made at least one booking
List facilities with more than 1000 slots booked
Find the total revenue of each facility
Find facilities with a total revenue less than 1000
Output the facility id that has the highest number of slots booked
List the total slots booked per facility per month, Part 2
List the total hours booked per named facility
List each member's first booking after September 1st 2012
Produce a list of member names, with each row containing the total member count
Produce a numbered list of members
Output the facility id that has the highest number of slots booked, again
Rank members by (rounded) hours used
Find the top three revenue generating facilities
Classify facilities by value
Calculate the payback time for each facility
Calculate a rolling average of total revenue
Working with Timestamps

Produce a timestamp for 1 a.m. on the 31st of August 2012

Subtract timestamps from each other
Generate a list of all the dates in October 2012
Get the day of the month from a timestamp
Work out the number of seconds between timestamps
Work out the number of days in each month of 2012
Work out the number of days remaining in the month
Work out the end time of bookings
Return a count of bookings for each month
Work out the utilisation percentage for each facility by month
String Operations
Format the names of members
Find facilities by a name prefix
Perform a case-insensitive search
Find telephone numbers with parentheses
Pad zip codes with leading zeroes
Count the number of members whose surname starts with each letter of the alphabet
Clean up telephone numbers
Recursive Queries

Find the upward recommendation chain for member ID 27

Find the downward recommendation chain for member ID 1
Produce a CTE that can return the upward recommendation chain for any member

Getting Started

It's pretty simple to get going with the exercises: all you have to do is open the exercises , take a look at
the questions, and try to answer them!
The dataset for these exercises is for a newly created country club, with a set of members, facilities such as
tennis courts, and booking history for those facilities. Amongst other things, the club wants to understand
how they can use their information to analyse facility usage/demand. Please note: this dataset is designed
purely for supporting an interesting array of exercises, and the database schema is flawed in several
aspects - please don't take it as an example of good design. We'll start off with a look at the Members
table:

CREATE TABLE cd.members

(
memid integer NOT NULL,
surname character varying(200) NOT NULL,
f rstname character varying(200) NOT NULL,
address character varying(300) NOT NULL,
zipcode integer NOT NULL,
telephone character varying(20) NOT NULL,
recommendedby integer,
joindate timestamp not null,
CONSTRAINT members_pk PRIMARY KEY (memid),
CONSTRAINT fk_members_recommendedby FOREIGN KEY (recommendedby)
REFERENCES cd.members(memid) ON DELETE SET NULL
);

Each member has an ID (not guaranteed to be sequential), basic address information, a reference to the
member that recommended them (if any), and a timestamp for when they joined. The addresses in the
dataset are entirely (and unrealistically) fabricated.

CREATE TABLE cd.facilities

(
facid integer NOT NULL,
name character varying(100) NOT NULL,
membercost numeric NOT NULL,
guestcost numeric NOT NULL,
initialoutlay numeric NOT NULL,
monthlymaintenance numeric NOT NULL,
CONSTRAINT facilities_pk PRIMARY KEY (facid)
);

The facilities table lists all the bookable facilities that the country club possesses. The club stores id/name
information, the cost to book both members and guests, the initial cost to build the facility, and estimated
monthly upkeep costs. They hope to use this information to track how financially worthwhile each facility
is.

CREATE TABLE cd.bookings

(
bookid integer NOT NULL,
facid integer NOT NULL,
memid integer NOT NULL,
starttime timestamp NOT NULL,
slots integer NOT NULL,
CONSTRAINT bookings_pk PRIMARY KEY (bookid),
CONSTRAINT fk_bookings_facid FOREIGN KEY (facid) REFERENCES
cd.facilities(facid),
CONSTRAINT fk_bookings_memid FOREIGN KEY (memid) REFERENCES cd.members(memid)
);
Finally, there's a table tracking bookings of facilities. This stores the facility id, the member who made the
booking, the start of the booking, and how many half hour 'slots' the booking was made for. This
idiosyncratic design will make certain queries more difficult, but should provide you with some interesting
challenges - as well as prepare you for the horror of working with some real-world databases :-).

Okay, that should be all the information you need. You can select a category of query to try from the menu
above, or alternatively start from the beginning .

I want to use my own Postgres system

No problem! Getting up and running isn't too hard. First, you'll need an install of PostgreSQL, which you
can get from here . Once you have it started, download the SQL .

Finally, run psql -U <username> -f clubdata.sql -d postgres -x -q to create the 'exercises'

database, the Postgres 'pgexercises' user, the tables, and to load the data in. Note that you may find that
the sort order of your results differs from those shown on the web site: that's probably because your
Postgres is set up using a different locale to that used by PGExercises (which uses the C locale)

When you're running queries, you may find psql a little clunky. If so, I recommend trying out pgAdmin or
the Eclipse database development tools.

Schema

Simple SQL Queries

This category deals with the basics of SQL. It covers select and where clauses, case expressions, unions,
and a few other odds and ends. If you're already educated in SQL you will probably find these exercises
fairly easy. If not, you should find them a good point to start learning for the more difficult categories
ahead!

If you struggle with these questions, I strongly recommend Learning SQL , by Alan Beaulieu, as a concise
and well-written book on the subject. If you're interested in the fundamentals of database systems (as
opposed to just how to use them), you should also investigate An Introduction to Database Systems by C.J.
Date.

Retrieve everything from a table

How can you retrieve all the information from the cd.facilities table?

Expected results:
facid name membercost guestcost initialoutlay monthlymaintenance

0 Tennis Court 1 5 25 10000 200

1 Tennis Court 2 5 25 8000 200

2 Badminton Court 0 15.5 4000 50

3 Table Tennis 0 5 320 10

4 Massage Room 1 35 80 4000 3000

5 Massage Room 2 35 80 4000 3000

6 Squash Court 3.5 17.5 5000 80

7 Snooker Table 0 5 450 15

8 Pool Table 0 5 400 15

Answer:

select * from cd.facilities;

The SELECT statement is the basic starting block for queries that read information out of the database. A
minimal select statement is generally comprised of select [some set of columns] from [some
table or group of tables] .

In this case, we want all of the information from the facilities table. The from section is easy - we just need
to specify the cd.facilities table. 'cd' is the table's schema - a term used for a logical grouping of
related information in the database.

Next, we need to specify that we want all the columns. Conveniently, there's a shorthand for 'all columns' -
*. We can use this instead of laboriously specifying all the column names.

Retrieve specific columns from a table

You want to print out a list of all of the facilities and their cost to members. How would you retrieve a list
of only facility names and costs?

Expected results:
name membercost

Tennis Court 1 5

Tennis Court 2 5

Badminton Court 0

Table Tennis 0

Massage Room 1 35

Massage Room 2 35

Squash Court 3.5

Snooker Table 0

Pool Table 0

Answer:

select name, membercost from cd.facilities;

For this question, we need to specify the columns that we want. We can do that with a simple comma-
delimited list of column names specified to the select statement. All the database does is look at the
columns available in the FROM clause, and return the ones we asked for, as illustrated below

Generally speaking, for non-throwaway queries it's considered desirable to specify the names of the
columns you want in your queries rather than using *. This is because your application might not be able
to cope if more columns get added into the table.

Control which rows are retrieved

How can you produce a list of facilities that charge a fee to members?

Expected results:
facid name membercost guestcost initialoutlay monthlymaintenance

0 Tennis Court 1 5 25 10000 200

1 Tennis Court 2 5 25 8000 200

Massage Room
4 35 80 4000 3000
1

Massage Room
5 35 80 4000 3000
2

6 Squash Court 3.5 17.5 5000 80

Answer:

select * from cd.facilities where membercost > 0;

The FROM clause is used to build up a set of candidate rows to read results from. In our examples so far,
this set of rows has simply been the contents of a table. In future we will explore joining, which allows us
to create much more interesting candidates.

Once we've built up our set of candidate rows, the WHERE clause allows us to filter for the rows we're
interested in - in this case, those with a membercost of more than zero. As you will see in later exercises,
WHERE clauses can have multiple components combined with boolean logic - it's possible to, for
instance, search for facilities with a cost greater than 0 and less than 10. The filtering action of the WHERE
clause on the facilities table is illustrated below:

Control which rows are retrieved, Part 2

How can you produce a list of facilities that charge a fee to members, and that fee is less than 1/50th of the
monthly maintenance cost? Return the facid, facility name, member cost, and monthly maintenance of the
facilities in question.

Expected results:

facid name membercost monthlymaintenance

4 Massage Room 1 35 3000

5 Massage Room 2 35 3000

Answer:
select facid, name, membercost, monthlymaintenance
from cd.facilities
where
membercost > 0 and
(membercost < monthlymaintenance/50.0);

The WHERE clause allows us to filter for the rows we're interested in - in this case, those with a
membercost of more than zero, and less than 1/50th of the monthly maintenance cost. As you can see, the
massage rooms are very expensive to run thanks to staffing costs!

When we want to test for two or more conditions, we use AND to combine them. We can, as you might
expect, use OR to test whether either of a pair of conditions is true.

You might have noticed that this is our first query that combines a WHERE clause with selecting specific
columns. You can see in the image below the effect of this: the intersection of the selected columns and
the selected rows gives us the data to return. This may not seem too interesting now, but as we add in
more complex operations like joins later, you'll see the simple elegance of this behaviour.

Basic string searches

How can you produce a list of all facilities with the word 'Tennis' in their name?

Expected results:

facid name membercost guestcost initialoutlay monthlymaintenance

0 Tennis Court 1 5 25 10000 200

1 Tennis Court 2 5 25 8000 200

3 Table Tennis 0 5 320 10

Answer:

select *
from cd.facilities
where
name like '%Tennis%';

SQL's LIKE operator provides simple pattern matching on strings. It's pretty much universally
implemented, and is nice and simple to use - it just takes a string with the % character matching any string,
and _ matching any single character. In this case, we're looking for names containing the word 'Tennis', so
putting a % on either side fits the bill.
There's other ways to accomplish this task: Postgres supports regular expressions with the ~ operator, for
example. Use whatever makes you feel comfortable, but do be aware that the LIKE operator is much
more portable between systems.

Matching against multiple possible values

How can you retrieve the details of facilities with ID 1 and 5? Try to do it without using the OR operator.

Expected results:

facid name membercost guestcost initialoutlay monthlymaintenance

1 Tennis Court 2 5 25 8000 200

Massage Room
5 35 80 4000 3000
2

Answer:

select *
from cd.facilities
where
facid in (1,5);

The obvious answer to this question is to use a WHERE clause that looks like where facid = 1 or
facid = 5 . An alternative that is easier with large numbers of possible matches is the IN operator. The
IN operator takes a list of possible values, and matches them against (in this case) the facid. If one of the
values matches, the where clause is true for that row, and the row is returned.

The IN operator is a good early demonstrator of the elegance of the relational model. The argument it
takes is not just a list of values - it's actually a table with a single column. Since queries also return tables,
if you create a query that returns a single column, you can feed those results into an IN operator. To give
a toy example:

select *
from cd.facilities
where
facid in (
select facid from cd.facilities
);

This example is functionally equivalent to just selecting all the facilities, but shows you how to feed the
results of one query into another. The inner query is called a subquery .

Classify results into bucket

How can you produce a list of facilities, with each labelled as 'cheap' or 'expensive' depending on if their
monthly maintenance cost is more than $100? Return the name and monthly maintenance of the facilities
in question.

Expected results:
name cost

Tennis Court 1 expensive

Tennis Court 2 expensive

Badminton Court cheap

Table Tennis cheap

Massage Room 1 expensive

Massage Room 2 expensive

Squash Court cheap

Snooker Table cheap

Pool Table cheap

Answer:

select name,
case when (monthlymaintenance > 100) then
'expensive'
else
'cheap'
end as cost
from cd.facilities;

This exercise contains a few new concepts. The first is the fact that we're doing computation in the area of
the query between SELECT and FROM . Previously we've only used this to select columns that we want
to return, but you can put anything in here that will produce a single result per returned row - including
subqueries.

The second new concept is the CASE statement itself. CASE is effectively like if/switch statements in
other languages, with a form as shown in the query. To add a 'middling' option, we would simply insert
another when then section.

Finally, there's the AS operator. This is simply used to label columns or expressions, to make them
display more nicely or to make them easier to reference when used as part of a subquery.

Working with dates

How can you produce a list of members who joined after the start of September 2012? Return the memid,
surname, firstname, and joindate of the members in question.

Expected results:
memid surname firstname joindate

24 Sarwin Ramnaresh 2012-09-01 08:44:42

26 Jones Douglas 2012-09-02 18:43:05

27 Rumney Henrietta 2012-09-05 08:42:35

28 Farrell David 2012-09-15 08:22:05

29 Worthington-Smyth Henry 2012-09-17 12:27:15

30 Purview Millicent 2012-09-18 19:04:01

33 Tupperware Hyacinth 2012-09-18 19:32:05

35 Hunt John 2012-09-19 11:32:45

36 Crumpet Erica 2012-09-22 08:36:38

37 Smith Darren 2012-09-26 18:08:45

Answer:

select memid, surname, f rstname, joindate

from cd.members
where joindate '2012-09-01';

This is our first look at SQL timestamps. They're formatted in descending order of magnitude: YYYY-MM-DD
HH MM SS.nnnnnn . We can compare them just like we might a unix timestamp, although getting the
differences between dates is a little more involved (and powerful!). In this case, we've just specified the
date portion of the timestamp. This gets automatically cast by postgres into the full timestamp 2012-09-
01 00 00 00 .

Removing duplicates, and ordering results

How can you produce an ordered list of the first 10 surnames in the members table? The list must not
contain duplicates.

Expected results:
surname

Bader

Baker

Boothe

Butters

Coplin

Crumpet

Dare

Farrell

GUEST

Genting

Answer:

select distinct surname

from cd.members
order by surname
limit 10;

There's three new concepts here, but they're all pretty simple.

Specifying DISTINCT after SELECT removes duplicate rows from the result set. Note that this
applies to rows : if row A has multiple columns, row B is only equal to it if the values in all columns are
the same. As a general rule, don't use DISTINCT in a willy-nilly fashion - it's not free to remove
duplicates from large query result sets, so do it as-needed.
Specifying ORDER BY (after the FROM and WHERE clauses, near the end of the query) allows results
to be ordered by a column or set of columns (comma separated).
The LIMIT keyword allows you to limit the number of results retrieved. This is useful for getting
results a page at a time, and can be combined with the OFFSET keyword to get following pages. This
is the same approach used by MySQL and is very convenient - you may, unfortunately, find that this
process is a little more complicated in other DBs.

Combining results from multiple queries

You, for some reason, want a combined list of all surnames and all facility names. Yes, this is a contrived
example :-). Produce that list!

Expected results:
surname

Tennis Court 2

Worthington-Smyth

Badminton Court

Pinker

Dare

Bader

Mackenzie

Crumpet

Massage Room 1

Squash Court

Answer:

select surname
from cd.members
union
select name
from cd.facilities;

The UNION operator does what you might expect: combines the results of two SQL queries into a single
table. The caveat is that both results from the two queries must have the same number of columns and
compatible data types.

UNION removes duplicate rows, while UNION ALL does not. Use UNION ALL by default, unless you
care about duplicate results.

Simple aggregation
You'd like to get the signup date of your last member. How can you retrieve this information?

Expected results:

latest

2012-09-26 18:08:45

Answer:

select max(joindate) as latest

from cd.members;

This is our first foray into SQL's aggregate functions. They're used to extract information about whole
groups of rows, and allow us to easily ask questions like:

What's the most expensive facility to maintain on a monthly basis?

Who has recommended the most new members?
How much time has each member spent at our facilities?

The MAX aggregate function here is very simple: it receives all the possible values for joindate, and outputs
the one that's biggest. There's a lot more power to aggregate functions, which you will come across in
future exercises.

More aggregation
You'd like to get the first and last name of the last member(s) who signed up - not just the date. How can
you do that?

Expected results:

firstname surname joindate

Darren Smith 2012-09-26 18:08:45

Answer:

select f rstname, surname, joindate

from cd.members
where joindate =
(select max(joindate)
from cd.members);

In the suggested approach above, you use a subquery to find out what the most recent joindate is. This
subquery returns a scalar table - that is, a table with a single column and a single row. Since we have just
a single value, we can substitute the subquery anywhere we might put a single constant value. In this case,
we use it to complete the WHERE clause of a query to find a given member.

You might hope that you'd be able to do something like below:

select f rstname, surname, max(joindate)

from cd.members

Unfortunately, this doesn't work. The MAX function doesn't restrict rows like the WHERE clause does - it
simply takes in a bunch of values and returns the biggest one. The database is then left wondering how to
pair up a long list of names with the single join date that's come out of the max function, and fails.
Instead, you're left having to say 'find me the row(s) which have a join date that's the same as the
maximum join date'.

As mentioned by the hint, there's other ways to get this job done - one example is below. In this approach,
rather than explicitly finding out what the last joined date is, we simply order our members table in
descending order of join date, and pick off the first one. Note that this approach does not cover the
extremely unlikely eventuality of two people joining at the exact same time :-).

select f rstname, surname, joindate

from cd.members
order by joindate desc
limit 1;

Joins and Subqueries

This category deals primarily with a foundational concept in relational database systems: joining. Joining
allows you to combine related information from multiple tables to answer a question. This isn't just
beneficial for ease of querying: a lack of join capability encourages denormalisation of data, which
increases the complexity of keeping your data internally consistent.

This topic covers inner, outer, and self joins, as well as spending a little time on subqueries (queries within
queries). If you struggle with these questions, I strongly recommend Learning SQL , by Alan Beaulieu, as a
concise and well-written book on the subject.

Retrieve the start times of members' bookings

How can you produce a list of the start times for bookings by members named 'David Farrell'?

Expected results:

starttime

2012-09-18 09:00:00

2012-09-18 17:30:00

2012-09-18 13:30:00

2012-09-18 20:00:00

2012-09-19 09:30:00

2012-09-19 15:00:00

2012-09-19 12:00:00

2012-09-20 15:30:00

2012-09-20 11:30:00

2012-09-20 14:00:00

Answer:

select bks.starttime
from
cd.bookings bks
inner join cd.members mems
on mems.memid = bks.memid
where
mems.f rstname='David'
and mems.surname='Farrell';

The most commonly used kind of join is the INNER JOIN . What this does is combine two tables based on
a join expression - in this case, for each member id in the members table, we're looking for matching
values in the bookings table. Where we find a match, a row combining the values for each table is
returned. Note that we've given each table an alias (bks and mems). This is used for two reasons: firstly,
it's convenient, and secondly we might join to the same table several times, requiring us to distinguish
between columns from each different time the table was joined in.
Let's ignore our select and where clauses for now, and focus on what the FROM statement produces. In all
our previous examples, FROM has just been a simple table. What is it now? Another table! This time, it's
produced as a composite of bookings and members. You can see a subset of the output of the join below:

For each member in the members table, the join has found all the matching member ids in the bookings
table. For each match, it's then produced a row combining the row from the members table, and the row
from the bookings table.

Obviously, this is too much information on its own, and any useful question will want to filter it down. In
our query, we use the start of the SELECT clause to pick columns, and the WHERE clause to pick rows,
as illustrated below:

That's all we need to find David's bookings! In general, I encourage you to remember that the output of
the FROM clause is essentially one big table that you then filter information out of. This may sound
inefficient - but don't worry, under the covers the DB will be behaving much more intelligently :-).

One final note: there's two different syntaxes for inner joins. I've shown you the one I prefer, that I find
more consistent with other join types. You'll commonly see a different syntax, shown below:

select bks.starttime
from
cd.bookings bks,
cd.members mems
where
mems.f rstname='David'
and mems.surname='Farrell'
and mems.memid = bks.memid;

This is functionally exactly the same as the approved answer. If you feel more comfortable with this syntax,
feel free to use it!

Work out the start times of bookings for tennis courts

How can you produce a list of the start times for bookings for tennis courts, for the date '2012-09-21'?
Return a list of start time and facility name pairings, ordered by the time.

Expected results:
start name

2012-09-21 08:00:00 Tennis Court 1

2012-09-21 08:00:00 Tennis Court 2

2012-09-21 09:30:00 Tennis Court 1

2012-09-21 10:00:00 Tennis Court 2

2012-09-21 11:30:00 Tennis Court 2

2012-09-21 12:00:00 Tennis Court 1

2012-09-21 13:30:00 Tennis Court 1

2012-09-21 14:00:00 Tennis Court 2

2012-09-21 15:30:00 Tennis Court 1

2012-09-21 16:00:00 Tennis Court 2

2012-09-21 17:00:00 Tennis Court 1

2012-09-21 18:00:00 Tennis Court 2

Answer:

select bks.starttime as start, facs.name as name

from
cd.facilities facs
inner join cd.bookings bks
on facs.facid = bks.facid
where
facs.facid in (0,1) and
bks.starttime '2012-09-21' and
bks.starttime < '2012-09-22'
order by bks.starttime;

This is another INNER JOIN query, although it has a fair bit more complexity in it! The FROM part of the
query is easy - we're simply joining facilities and bookings tables together on the facid. This produces a
table where, for each row in bookings, we've attached detailed information about the facility being
booked.

On to the WHERE component of the query. The checks on starttime are fairly self explanatory - we're
making sure that all the bookings start between the specified dates. Since we're only interested in tennis
courts, we're also using the IN operator to tell the database system to only give us back facility IDs 0 or 1
- the IDs of the courts. There's other ways to express this: We could have used where facs.facid = 0
or facs.facid = 1 , or even where facs.name like 'Tennis%' .

The rest is pretty simple: we SELECT the columns we're interested in, and ORDER BY the start time.

Produce a list of all members who have recommended another member

How can you output a list of all members who have recommended another member? Ensure that there are
no duplicates in the list, and that results are ordered by (surname, firstname).
Expected results:

firstname surname

Florence Bader

Timothy Baker

Gerald Butters

Jemima Farrell

Matthew Genting

David Jones

Janice Joplette

Millicent Purview

Tim Rownam

Darren Smith

Tracy Smith

Ponder Stibbons

Burton Tracy

Answer:

select distinct recs.f rstname as f rstname, recs.surname as surname

from
cd.members mems
inner join cd.members recs
on recs.memid = mems.recommendedby
order by surname, f rstname;

Here's a concept that some people find confusing: you can join a table to itself! This is really useful if you
have columns that reference data in the same table, like we do with recommendedby in cd.members.

If you're having trouble visualising this, remember that this works just the same as any other inner join.
Our join takes each row in members that has a recommendedby value, and looks in members again for the
row which has a matching member id. It then generates an output row combining the two members
entries. This looks like the diagram below:

Note that while we might have two 'surname' columns in the output set, they can be distinguished by their
table aliases. Once we've selected the columns that we want, we simply use DISTINCT to ensure that
there are no duplicates.

Produce a list of all members, along with their recommender

How can you output a list of all members, including the individual who recommended them (if any)?
Ensure that results are ordered by (surname, firstname).

Expected results:
memfname memsname recfname recsname

Florence Bader Ponder Stibbons

Anne Baker Ponder Stibbons

Timothy Baker Jemima Farrell

Tim Boothe Tim Rownam

Gerald Butters Darren Smith

Joan Coplin Timothy Baker

Erica Crumpet Tracy Smith

Nancy Dare Janice Joplette

David Farrell

Jemima Farrell

GUEST GUEST

Matthew Genting Gerald Butters

John Hunt Millicent Purview

David Jones Janice Joplette

Douglas Jones David Jones

Janice Joplette Darren Smith

Anna Mackenzie Darren Smith

Charles Owen Darren Smith

David Pinker Jemima Farrell

Millicent Purview Tracy Smith

Tim Rownam

Henrietta Rumney Matthew Genting

Ramnaresh Sarwin Florence Bader

Darren Smith

Jack Smith Darren Smith

Tracy Smith

Ponder Stibbons Burton Tracy

Burton Tracy

Hyacinth Tupperware
memfname memsname recfname recsname

Henry Worthington-Smyth Tracy Smith

Answer:

select mems.f rstname as memfname, mems.surname as memsname, recs.f rstname as

recfname, recs.surname as recsname
from
cd.members mems
left outer join cd.members recs
on recs.memid = mems.recommendedby
order by memsname, memfname;

Let's introduce another new concept: the LEFT OUTER JOIN . These are best explained by the way in
which they differ from inner joins. Inner joins take a left and a right table, and look for matching rows
based on a join condition ( ON ). When the condition is satisfied, a joined row is produced. A LEFT OUTER
JOIN operates similarly, except that if a given row on the left hand table doesn't match anything, it still
produces an output row. That output row consists of the left hand table row, and a bunch of NULLS in
place of the right hand table row.

This is useful in situations like this question, where we want to produce output with optional data. We
want the names of all members, and the name of their recommender if that person exists . You can't
express that properly with an inner join.

As you may have guessed, there's other outer joins too. The RIGHT OUTER JOIN is much like the LEFT
OUTER JOIN , except that the left hand side of the expression is the one that contains the optional data.
The rarely-used FULL OUTER JOIN treats both sides of the expression as optional.

Produce a list of all members who have used a tennis court

How can you produce a list of all members who have used a tennis court? Include in your output the name
of the court, and the name of the member formatted as a single column. Ensure no duplicate data, and
order by the member name.

Expected results:
member facility

Anne Baker Tennis Court 2

Anne Baker Tennis Court 1

Burton Tracy Tennis Court 2

Burton Tracy Tennis Court 1

Charles Owen Tennis Court 2

Charles Owen Tennis Court 1

Darren Smith Tennis Court 2

David Farrell Tennis Court 2

David Farrell Tennis Court 1

David Jones Tennis Court 1

David Jones Tennis Court 2

David Pinker Tennis Court 1

Douglas Jones Tennis Court 1

Erica Crumpet Tennis Court 1

Florence Bader Tennis Court 1

Florence Bader Tennis Court 2

GUEST GUEST Tennis Court 2

GUEST GUEST Tennis Court 1

Gerald Butters Tennis Court 1

Gerald Butters Tennis Court 2

Henrietta Rumney Tennis Court 2

Jack Smith Tennis Court 1

Jack Smith Tennis Court 2

Janice Joplette Tennis Court 1

Janice Joplette Tennis Court 2

Jemima Farrell Tennis Court 2

Jemima Farrell Tennis Court 1

Joan Coplin Tennis Court 1

John Hunt Tennis Court 1

John Hunt Tennis Court 2

member facility

Matthew Genting Tennis Court 1

Millicent Purview Tennis Court 2

Nancy Dare Tennis Court 2

Nancy Dare Tennis Court 1

Ponder Stibbons Tennis Court 2

Ponder Stibbons Tennis Court 1

Ramnaresh Sarwin Tennis Court 2

Ramnaresh Sarwin Tennis Court 1

Tim Boothe Tennis Court 1

Tim Boothe Tennis Court 2

Tim Rownam Tennis Court 1

Tim Rownam Tennis Court 2

Timothy Baker Tennis Court 2

Timothy Baker Tennis Court 1

Tracy Smith Tennis Court 2

Tracy Smith Tennis Court 1

Answer:

select distinct mems.f rstname ' ' mems.surname as member, facs.name as

facility
from
cd.members mems
inner join cd.bookings bks
on mems.memid = bks.memid
inner join cd.facilities facs
on bks.facid = facs.facid
where
bks.facid in (0,1)
order by member

This exercise is largely a more complex application of what you've learned in prior questions. It's also the
first time we've used more than one join, which may be a little confusing for some. When reading join
expressions, remember that a join is effectively a function that takes two tables, one labelled the left table,
and the other the right. This is easy to visualise with just one join in the query, but a little more confusing
with two.

Our second INNER JOIN in this query has a right hand side of cd.facilities. That's easy enough to grasp.
The left hand side, however, is the table returned by joining cd.members to cd.bookings. It's important to
emphasise this: the relational model is all about tables. The output of any join is another table. The output
of a query is a table. Single columned lists are tables. Once you grasp that, you've grasped the
fundamental beauty of the model.
As a final note, we do introduce one new thing here: the operator is used to concatenate strings.

Produce a list of costly bookings

How can you produce a list of bookings on the day of 2012-09-14 which will cost the member (or guest)
more than $30? Remember that guests have different costs to members (the listed costs are per half-hour
'slot'), and the guest user is always ID 0. Include in your output the name of the facility, the name of the
member formatted as a single column, and the cost. Order by descending cost, and do not use any
subqueries.

Expected results:

member facility cost

GUEST GUEST Massage Room 2 320

GUEST GUEST Massage Room 1 160

GUEST GUEST Tennis Court 2 150

Jemima Farrell Massage Room 1 140

GUEST GUEST Tennis Court 1 75

GUEST GUEST Tennis Court 2 75

GUEST GUEST Tennis Court 1 75

Matthew Genting Massage Room 1 70

Florence Bader Massage Room 2 70

GUEST GUEST Squash Court 70.0

Jemima Farrell Massage Room 1 70

Ponder Stibbons Massage Room 1 70

Burton Tracy Massage Room 1 70

Jack Smith Massage Room 1 70

GUEST GUEST Squash Court 35.0

Answer:

select mems.f rstname ' ' mems.surname as member,

facs.name as facility,
case
when mems.memid = 0 then
bks.slots facs.guestcost
else
bks.slots facs.membercost
end as cost
from
cd.members mems
inner join cd.bookings bks
on mems.memid = bks.memid
inner join cd.facilities facs
on bks.facid = facs.facid
where
bks.starttime '2012-09-14' and
bks.starttime < '2012-09-15' and (
(mems.memid = 0 and bks.slots facs.guestcost > 30) or
(mems.memid 0 and bks.slots facs.membercost > 30)
)
order by cost desc;

This is a bit of a complicated one! While its more complex logic than we've used previously, there's not an
awful lot to remark upon. The WHERE clause restricts our output to sufficiently costly rows on 2012-09-14,
remembering to distinguish between guests and others. We then use a CASE statement in the column
selections to output the correct cost for the member or guest.

Produce a list of all members, along with their recommender, using no joins
How can you output a list of all members, including the individual who recommended them (if any),
without using any joins? Ensure that there are no duplicates in the list, and that each firstname + surname
pairing is formatted as a column and ordered.

Expected results:
member recommender

Anna Mackenzie Darren Smith

Anne Baker Ponder Stibbons

Burton Tracy

Charles Owen Darren Smith

Darren Smith

David Farrell

David Jones Janice Joplette

David Pinker Jemima Farrell

Douglas Jones David Jones

Erica Crumpet Tracy Smith

Florence Bader Ponder Stibbons

GUEST GUEST

Gerald Butters Darren Smith

Henrietta Rumney Matthew Genting

Henry Worthington-Smyth Tracy Smith

Hyacinth Tupperware

Jack Smith Darren Smith

Janice Joplette Darren Smith

Jemima Farrell

Joan Coplin Timothy Baker

John Hunt Millicent Purview

Matthew Genting Gerald Butters

Millicent Purview Tracy Smith

Nancy Dare Janice Joplette

Ponder Stibbons Burton Tracy

Ramnaresh Sarwin Florence Bader

Tim Boothe Tim Rownam

Tim Rownam

Timothy Baker Jemima Farrell

Tracy Smith
Answer:

select distinct mems.f rstname ' ' mems.surname as member,

(select recs.f rstname ' ' recs.surname as recommender
from cd.members recs
where recs.memid = mems.recommendedby
)
from
cd.members mems
order by member;

This exercise marks the introduction of subqueries. Subqueries are, as the name implies, queries within a
query. They're commonly used with aggregates, to answer questions like 'get me all the details of the
member who has spent the most hours on Tennis Court 1'.

In this case, we're simply using the subquery to emulate an outer join. For every value of member, the
subquery is run once to find the name of the individual who recommended them (if any). A subquery that
uses information from the outer query in this way (and thus has to be run for each row in the result set) is
known as a correlated subquery .

Produce a list of costly bookings, using a subquery

The Produce a list of costly bookings exercise contained some messy logic: we had to calculate the
booking cost in both the WHERE clause and the CASE statement. Try to simplify this calculation using
subqueries. For reference, the question was:

Expected results:
member facility cost

GUEST GUEST Massage Room 2 320

GUEST GUEST Massage Room 1 160

GUEST GUEST Tennis Court 2 150

Jemima Farrell Massage Room 1 140

GUEST GUEST Tennis Court 1 75

GUEST GUEST Tennis Court 2 75

GUEST GUEST Tennis Court 1 75

Matthew Genting Massage Room 1 70

Florence Bader Massage Room 2 70

GUEST GUEST Squash Court 70.0

Jemima Farrell Massage Room 1 70

Ponder Stibbons Massage Room 1 70

Burton Tracy Massage Room 1 70

Jack Smith Massage Room 1 70

GUEST GUEST Squash Court 35.0

Answer:

select member, facility, cost from (

select
mems.f rstname ' ' mems.surname as member,
facs.name as facility,
case
when mems.memid = 0 then
bks.slots facs.guestcost
else
bks.slots facs.membercost
end as cost
from
cd.members mems
inner join cd.bookings bks
on mems.memid = bks.memid
inner join cd.facilities facs
on bks.facid = facs.facid
where
bks.starttime '2012-09-14' and
bks.starttime < '2012-09-15'
) as bookings
where cost > 30
order by cost desc;

This answer provides a mild simplification to the previous iteration: in the no-subquery version, we had to
calculate the member or guest's cost in both the WHERE clause and the CASE statement. In our new
version, we produce an inline query that calculates the total booking cost for us, allowing the outer query
to simply select the bookings it's looking for. For reference, you may also see subqueries in the FROM
clause referred to as inline views .

Modifying Data

Querying data is all well and good, but at some point you're probably going to want to put data into your
database! This section deals with inserting, updating, and deleting information. Operations that alter your
data like this are collectively known as Data Manipulation Language, or DML.

In previous sections, we returned to you the results of the query you've performed. Since modifications
like the ones we're making in this section don't return any query results, we instead show you the updated
content of the table you're supposed to be working on. You can compare this with the table shown in
'Expected Results' to see how you've done.

If you struggle with these questions, I strongly recommend Learning SQL , by Alan Beaulieu.

Insert some data into a table

The club is adding a new facility - a spa. We need to add it into the facilities table. Use the following values:

facid: 9, Name: 'Spa', membercost: 20, guestcost: 30, initialoutlay: 100000, monthlymaintenance: 800.

Expected results:

facid name membercost guestcost initialoutlay monthlymaintenance

0 Tennis Court 1 5 25 10000 200

1 Tennis Court 2 5 25 8000 200

2 Badminton Court 0 15.5 4000 50

3 Table Tennis 0 5 320 10

4 Massage Room 1 35 80 4000 3000

5 Massage Room 2 35 80 4000 3000

6 Squash Court 3.5 17.5 5000 80

7 Snooker Table 0 5 450 15

8 Pool Table 0 5 400 15

9 Spa 20 30 100000 800

Answer:
insert into cd.facilities
(facid, name, membercost, guestcost, initialoutlay, monthlymaintenance)
values (9, 'Spa', 20, 30, 100000, 800);

INSERT INTO VALUES is the simplest way to insert data into a table. There's not a whole lot to
discuss here: VALUES is used to construct a row of data, which the INSERT statement inserts into the
table. It's a simple as that.

You can see that there's two sections in parentheses. The first is part of the INSERT statement, and
specifies the columns that we're providing data for. The second is part of VALUES , and specifies the
actual data we want to insert into each column.

If we're inserting data into every column of the table, as in this example, explicitly specifying the column
names is optional. As long as you fill in data for all columns of the table, in the order they were defined
when you created the table, you can do something like the following:

insert into cd.facilities values (9, 'Spa', 20, 30, 100000, 800);

Generally speaking, for SQL that's going to be reused I tend to prefer being explicit and specifying the
column names.

Insert multiple rows of data into a table

In the previous exercise, you learned how to add a facility. Now you're going to add multiple facilities in
one command. Use the following values:

facid: 9, Name: 'Spa', membercost: 20, guestcost: 30, initialoutlay: 100000, monthlymaintenance: 800.
facid: 10, Name: 'Squash Court 2', membercost: 3.5, guestcost: 17.5, initialoutlay: 5000,
monthlymaintenance: 80.

Expected results:

facid name membercost guestcost initialoutlay monthlymaintenance

0 Tennis Court 1 5 25 10000 200

1 Tennis Court 2 5 25 8000 200

2 Badminton Court 0 15.5 4000 50

3 Table Tennis 0 5 320 10

4 Massage Room 1 35 80 4000 3000

5 Massage Room 2 35 80 4000 3000

6 Squash Court 3.5 17.5 5000 80

7 Snooker Table 0 5 450 15

8 Pool Table 0 5 400 15

9 Spa 20 30 100000 800

10 Squash Court 2 3.5 17.5 5000 80

Answer:
insert into cd.facilities
(facid, name, membercost, guestcost, initialoutlay, monthlymaintenance)
values
(9, 'Spa', 20, 30, 100000, 800),
(10, 'Squash Court 2', 3.5, 17.5, 5000, 80);

VALUES can be used to generate more than one row to insert into a table, as seen in this example.
Hopefully it's clear what's going on here: the output of VALUES is a table, and that table is copied into
cd.facilities, the table specified in the INSERT command.

While you'll most commonly see VALUES when inserting data, Postgres allows you to use VALUES
wherever you might use a SELECT . This makes sense: the output of both commands is a table, it's just
that VALUES is a bit more ergonomic when working with constant data.

Similarly, it's possible to use SELECT wherever you see a VALUES . This means that you can INSERT
the results of a SELECT . For example:

insert into cd.facilities

(facid, name, membercost, guestcost, initialoutlay, monthlymaintenance)
SELECT 9, 'Spa', 20, 30, 100000, 800
UNION ALL
SELECT 10, 'Squash Court 2', 3.5, 17.5, 5000, 80;

In later exercises you'll see us using INSERT SELECT to generate data to insert based on the
information already in the database.

Insert calculated data into a table

Let's try adding the spa to the facilities table again. This time, though, we want to automatically generate
the value for the next facid, rather than specifying it as a constant. Use the following values for everything
else:

Name: 'Spa', membercost: 20, guestcost: 30, initialoutlay: 100000, monthlymaintenance: 800.

Expected results:

facid name membercost guestcost initialoutlay monthlymaintenance

0 Tennis Court 1 5 25 10000 200

1 Tennis Court 2 5 25 8000 200

2 Badminton Court 0 15.5 4000 50

3 Table Tennis 0 5 320 10

4 Massage Room 1 35 80 4000 3000

5 Massage Room 2 35 80 4000 3000

6 Squash Court 3.5 17.5 5000 80

7 Snooker Table 0 5 450 15

8 Pool Table 0 5 400 15

9 Spa 20 30 100000 800

Answer:

insert into cd.facilities

(facid, name, membercost, guestcost, initialoutlay, monthlymaintenance)
select (select max(facid) from cd.facilities)+1, 'Spa', 20, 30, 100000, 800;

In the previous exercises we used VALUES to insert constant data into the facilities table. Here, though,
we have a new requirement: a dynamically generated ID. This gives us a real quality of life improvement,
as we don't have to manually work out what the current largest ID is: the SQL command does it for us.

Since the VALUES clause is only used to supply constant data, we need to replace it with a query instead.
The SELECT statement is fairly simple: there's an inner subquery that works out the next facid based on
the largest current id, and the rest is just constant data. The output of the statement is a row that we insert
into the facilities table.

While this works fine in our simple example, it's not how you would generally implement an incrementing
ID in the real world. Postgres provides SERIAL types that are auto-filled with the next ID when you insert
a row. As well as saving us effort, these types are also safer: unlike the answer given in this exercise,
there's no need to worry about concurrent operations generating the same ID.

Update some existing data

We made a mistake when entering the data for the second tennis court. The initial outlay was 10000 rather
than 8000: you need to alter the data to fix the error.

Expected results:

facid name membercost guestcost initialoutlay monthlymaintenance

0 Tennis Court 1 5 25 10000 200

1 Tennis Court 2 5 25 10000 200

2 Badminton Court 0 15.5 4000 50

3 Table Tennis 0 5 320 10

4 Massage Room 1 35 80 4000 3000

5 Massage Room 2 35 80 4000 3000

6 Squash Court 3.5 17.5 5000 80

7 Snooker Table 0 5 450 15

8 Pool Table 0 5 400 15

Answer:

update cd.facilities
set initialoutlay = 10000
where facid = 1;
The UPDATE statement is used to alter existing data. If you're familiar with SELECT queries, it's pretty
easy to read: the WHERE clause works in exactly the same fashion, allowing us to filter the set of rows we
want to work with. These rows are then modified according to the specifications of the SET clause: in this
case, setting the initial outlay.

The WHERE clause is extremely important. It's easy to get it wrong or even omit it, with disastrous results.
Consider the following command:

update cd.facilities
set initialoutlay = 10000;

There's no WHERE clause to filter for the rows we're interested in. The result of this is that the update
runs on every row in the table! This is rarely what we want to happen.

Update multiple rows and columns at the same time

We want to increase the price of the tennis courts for both members and guests. Update the costs to be 6
for members, and 30 for guests.

facid name membercost guestcost initialoutlay monthlymaintenance

0 Tennis Court 1 6 30 10000 200

1 Tennis Court 2 6 30 8000 200

2 Badminton Court 0 15.5 4000 50

3 Table Tennis 0 5 320 10

4 Massage Room 1 35 80 4000 3000

5 Massage Room 2 35 80 4000 3000

6 Squash Court 3.5 17.5 5000 80

7 Snooker Table 0 5 450 15

8 Pool Table 0 5 400 15

Answer:

update cd.facilities
set
membercost = 6,
guestcost = 30
where facid in (0,1);

The SET clause accepts a comma separated list of values that you want to update.

Update a row based on the contents of another row

We want to alter the price of the second tennis court so that it costs 10% more than the first one. Try to do
this without using constant values for the prices, so that we can reuse the statement if we want to.

Expected results:

facid name membercost guestcost initialoutlay monthlymaintenance

0 Tennis Court 1 5 25 10000 200

1 Tennis Court 2 5.5 27.5 8000 200

2 Badminton Court 0 15.5 4000 50

3 Table Tennis 0 5 320 10

4 Massage Room 1 35 80 4000 3000

5 Massage Room 2 35 80 4000 3000

6 Squash Court 3.5 17.5 5000 80

7 Snooker Table 0 5 450 15

8 Pool Table 0 5 400 15

Answer:

update cd.facilities facs

set
membercost = (select membercost * 1.1 from cd.facilities where facid = 0),
guestcost = (select guestcost * 1.1 from cd.facilities where facid = 0)
where facs.facid = 1;

Updating columns based on calculated data is not too intrinsically difficult: we can do so pretty easily using
subqueries. You can see this approach in our selected answer.

As the number of columns we want to update increases, standard SQL can start to get pretty awkward: you
don't want to be specifying a separate subquery for each of 15 different column updates. Postgres
provides a nonstandard extension to SQL called UPDATE FROM that addresses this: it allows you to
supply a FROM clause to generate values for use in the SET clause. Example below:

update cd.facilities facs

set
membercost = facs2.membercost * 1.1,
guestcost = facs2.guestcost * 1.1
from (select * from cd.facilities where facid = 0) facs2
where facs.facid = 1;

Delete all bookings

As part of a clearout of our database, we want to delete all bookings from the cd.bookings table. How can
we accomplish this?

Expected results:
bookid facid memid starttime slots

Answer:

delete from cd.bookings;

The DELETE statement does what it says on the tin: deletes rows from the table. Here, we show the
command in its simplest form, with no qualifiers. In this case, it deletes everything from the table.
Obviously, you should be careful with your deletes and make sure they're always limited - we'll see how to
do that in the next exercise.

An alternative to unqualified DELETE s is the following:

truncate cd.bookings;

TRUNCATE also deletes everything in the table, but does so using a quicker underlying mechanism. It's
not perfectly safe in all circumstances , though, so use judiciously. When in doubt, use DELETE .

Delete a member from the cd.members table

We want to remove member 37, who has never made a booking, from our database. How can we achieve
that?

Expected results:
memid surname firstname address zipcode telephone recommendedby joindate

2012-
(000) 000-
0 GUEST GUEST GUEST 0 07-01
0000
00:00:00

2012-
8 Bloomsbury 555-555-
1 Smith Darren 4321 07-02
Close, Boston 5555
12:02:05

2012-
8 Bloomsbury 555-555-
2 Smith Tracy 4321 07-02
Close, New York 5555
12:08:23

2012-
23 Highway Way, (844) 693-
3 Rownam Tim 23423 07-03
Boston 0723
09:32:15

2012-
20 Crossing Road, (833) 942-
4 Joplette Janice 234 1 07-03
New York 4710
10:25:05

2012-
1065 Huntingdon (844) 078-
5 Butters Gerald 56754 1 07-09
Avenue, Boston 4130
10:44:09

2012-
3 Tunisia Drive, (822) 354-
6 Tracy Burton 45678 07-15
Boston 9973
08:52:55

2012-
6 Hunting Lodge (833) 776-
7 Dare Nancy 10383 4 07-25
Way, Boston 4001
08:59:12

3 Bloomsbury 2012-
(811) 433-
8 Boothe Tim Close, Reading, 234 3 07-25
2547
00234 16:02:35

2012-
5 Dragons Way, (833) 160-
9 Stibbons Ponder 87630 6 07-25
Winchester 3900
17:09:05

2012-
52 Cheshire Grove, (855) 542-
10 Owen Charles 28563 1 08-03
Winchester, 28563 5251
19:42:37

2012-
976 Gnats Close, (844) 536-
11 Jones David 33862 4 08-06
Reading 8036
16:32:55

2012-
55 Powdery Street, 844-076-
12 Baker Anne 80743 9 08-10
Boston 5141
14:23:22

2012-
103 Firth Avenue, (855) 016-
13 Farrell Jemima 57392 08-10
North Reading 0163
14:28:01
memid surname firstname address zipcode telephone recommendedby joindate

2012-
252 Binkington (822) 163-
14 Smith Jack 69302 1 08-10
Way, Boston 3254
16:22:05

2012-
264 Ursula Drive, (833) 499-
15 Bader Florence 84923 9 08-10
Westford 3527
17:52:03

2012-
329 James Street, 833-941-
16 Baker Timothy 58393 13 08-15
Reading 0824
10:34:25

2012-
5 Impreza Road, 811 409-
17 Pinker David 65332 13 08-16
Boston 6734
11:32:47

4 Nunnington 2012-
(811) 972-
20 Genting Matthew Place, Wingfield, 52365 5 08-19
1377
Boston 14:55:55

2012-
64 Perkington Lane, (822) 661-
21 Mackenzie Anna 64577 1 08-26
Reading 2898
09:32:05

85 Bard Street, 2012-

(822) 499-
22 Coplin Joan Bloomington, 43533 16 08-29
2232
Boston 08:32:41

2012-
12 Bullington Lane, (822) 413-
24 Sarwin Ramnaresh 65464 15 09-01
Boston 1470
08:44:42

2012-
976 Gnats Close, 844 536-
26 Jones Douglas 11986 11 09-02
Reading 8036
18:43:05

2012-
3 Burkington Plaza, (822) 989-
27 Rumney Henrietta 78533 20 09-05
Boston 8876
08:42:35

2012-
437 Granite Farm (855) 755-
28 Farrell David 43532 09-15
Road, Westford 9876
08:22:05

2012-
Worthington- 55 Jagbi Way, North (855) 894-
29 Henry 97676 2 09-17
Smyth Reading 3758
12:27:15

2012-
641 Drudgery Close, (855) 941-
30 Purview Millicent 34232 2 09-18
Burnington, Boston 9786
19:04:01

33 Cheerful Plaza, 2012-

(822) 665-
33 Tupperware Hyacinth Drake Road, 68666 09-18
5327
Westford 19:32:05
memid surname firstname address zipcode telephone recommendedby joindate

2012-
5 Bullington Lane, (899) 720-
35 Hunt John 54333 30 09-19
Boston 6978
11:32:45

2012-
Crimson Road, (811) 732-
36 Crumpet Erica 75655 2 09-22
North Reading 4816
08:36:38

Answer:

delete from cd.members where memid = 37;

This exercise is a small increment on our previous one. Instead of deleting all bookings, this time we want
to be a bit more targeted, and delete a single member that has never made a booking. To do this, we
simply have to add a WHERE clause to our command, specifying the member we want to delete. You can
see the parallels with SELECT and UPDATE statements here.

There's one interesting wrinkle here. Try this command out, but substituting in member id 0 instead. This
member has made many bookings, and you'll find that the delete fails with an error about a foreign key
constraint violation. This is an important concept in relational databases, so let's explore a little further.

Foreign keys are a mechanism for defining relationships between columns of different tables. In our case
we use them to specify that the memid column of the bookings table is related to the memid column of
the members table. The relationship (or 'constraint') specifies that for a given booking, the member
specified in the booking must exist in the members table. It's useful to have this guarantee enforced by
the database: it means that code using the database can rely on the presence of the member. It's hard
(even impossible) to enforce this at higher levels: concurrent operations can interfere and leave your
database in a broken state.

PostgreSQL supports various different kinds of constraints that allow you to enforce structure upon your
data. For more information on constraints, check out the PostgreSQL documentation on foreign keys

Delete based on a subquery

In our previous exercises, we deleted a specific member who had never made a booking. How can we
make that more general, to delete all members who have never made a booking?