0% found this document useful (0 votes)

72 views

Notes 03 Sorting PDF

Quicksort works by recursively dividing the array into two halves based on a pivot element. The pivot element is chosen, and all elements less than or equal to the pivot are moved before it and all greater elements moved after it. The process is then recursively applied to the two sub-arrays. Once the recursion bottoms out on arrays of size 0 or 1, the entire array is sorted.

Uploaded by

hussein hammoud

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views

Notes 03 Sorting PDF

Uploaded by

hussein hammoud

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 126

3 Eﬃcient Sorting

Sebastian Wild
18 February 2020

version 2020-02-24 15:38

Outline

3 Eﬃcient Sorting
3.1 Mergesort
3.2 Quicksort
3.3 Comparison-Based Lower Bound
3.4 Integer Sorting
3.5 Parallel computation
3.6 Parallel primitives
3.7 Parallel sorting
Why study sorting?
� fundamental problem of computer science that is still not solved
Algorithm with optimal #comparisons in worst case?
� building brick of many more advanced algorithms
� for preprocessing
� as subroutine

� playground of manageable complexity

to practice algorithmic techniques

Here:
� “classic” fast sorting method

� parallel sorting

1
Part I
The Basics
Rules of the game
� Given:
� array 𝐴[0..𝑛 − 1] of 𝑛 objects
� a total order relation ≤ among 𝐴[0], . . . , 𝐴[𝑛 − 1]
(a comparison function)

� Goal: rearrange (=permute) elements within 𝐴,

so that 𝐴 is sorted, i. e., 𝐴[0] ≤ 𝐴[1] ≤ · · · ≤ 𝐴[𝑛 − 1]

� for now: 𝐴 stored in main memory (internal sorting)

single processor (sequential sorting)

2
3.1 Mergesort
Clicker Question

How does mergesort work?

A Split elements around median, then recurse on small /
large elements.

B Recurse on left / right half, then combine sorted halves.

C Grow sorted part on left, repeatedly add next element

to sorted range.

D Repeatedly choose 2 elements and swap them if they

are out of order.

E Don’t know.

pingo.upb.de/622222
3
Clicker Question

How does mergesort work?

A Split elements around median, then recurse on small /
large elements.

B Recurse on left / right half, then combine sorted halves. �

C Grow sorted part on left, repeatedly add next element
to sorted range.

D Repeatedly choose 2 elements and swap them if they

are out of order.

E Don’t know.

pingo.upb.de/622222
3
Merging sorted lists

4
Merging sorted lists

run1 run2 result

4
Merging sorted lists

run1 run2 result

4
Merging sorted lists

run1 run2 result

4
Merging sorted lists

run1 run2 result

4
Merging sorted lists

run1 run2 result

4
Merging sorted lists

run1 run2 result

4
Merging sorted lists

run1 run2 result

4
Merging sorted lists

run1 run2 result

4
Merging sorted lists

run1 run2 result

4
Merging sorted lists

run1 run2 result

4
Merging sorted lists

run1 run2 result

4
Merging sorted lists

run1 run2 result

4
Merging sorted lists

run1 run2 result

4
Merging sorted lists

run1 run2 result

4
Merging sorted lists

run1 run2 result

4
Merging sorted lists

run1 run2 result

4
Merging sorted lists

run1 run2 result

4
Merging sorted lists

run1 run2 result

4
Merging sorted lists

run1 run2 result

4
Merging sorted lists

run1 run2 result

4
Merging sorted lists

run1 run2 result

4
Merging sorted lists

run1 run2 result

4
Mergesort
1 procedure mergesort(𝐴[𝑙..𝑟])
2 𝑛 := 𝑟 − 𝑙 + 1 � recursive procedure; divide & conquer
3 if 𝑛 ≥ 1 return
� � � merging needs
4 𝑚 := 𝑙 + 𝑛2
� temporary storage for result
5 mergesort(𝐴[𝑙..𝑚 − 1])
of same size as merged runs
6 mergesort(𝐴[𝑚..𝑟])
7 merge(𝐴[𝑙..𝑚 − 1], 𝐴[𝑚..𝑟], buf ) � to read and write each element twice
8 copy buf to 𝐴[𝑙..𝑟] (once for merging, once for copying back)

5
Mergesort
1 procedure mergesort(𝐴[𝑙..𝑟])
2 𝑛 := 𝑟 − 𝑙 + 1 � recursive procedure; divide & conquer
3 if 𝑛 ≥ 1 return
� � � merging needs
4 𝑚 := 𝑙 + 𝑛2
� temporary storage for result
5 mergesort(𝐴[𝑙..𝑚 − 1])
of same size as merged runs
6 mergesort(𝐴[𝑚..𝑟])
7 merge(𝐴[𝑙..𝑚 − 1], 𝐴[𝑚..𝑟], buf ) � to read and write each element twice
8 copy buf to 𝐴[𝑙..𝑟] (once for merging, once for copying back)

Analysis:� count “element visits” (read and/or write)

0 𝑛≤1
𝐶(𝑛) = same for best and worst case!
𝐶(�𝑛/2�) + 𝐶(�𝑛/2�) + 2𝑛 𝑛 ≥ 2

Simpliﬁcation 𝑛 = 2 𝑘
�
0 𝑘≤0
𝐶(2 𝑘 ) = = 2 · 2 𝑘 + 22 · 2 𝑘−1 + 23 · 2 𝑘−2 + · · · + 2 𝑘 · 21 = 2𝑘 · 2 𝑘
2 · 𝐶(2 𝑘−1
)+2·2 𝑘
𝑘≥1
𝐶(𝑛) = 2𝑛 lg(𝑛) = Θ(𝑛 log 𝑛)
5
and for arbitrary 𝑛 we have 𝐶(𝑛) ≤ 𝐶(next larger power of 2) ≤ 4𝑛 lg(𝑛) + 2𝑛 = Θ(𝑛 log 𝑛)
Mergesort – Discussion
optimal time complexity of Θ(𝑛 log 𝑛) in the worst case

stable sorting method i. e., retains relative order of equal-key items

memory access is sequential (scans over arrays)

requires Θ(𝑛) extra space

there are in-place merging methods,
but they are substantially more complicated
and not (widely) used

6
3.2 Quicksort
Clicker Question

How does quicksort work?

A split elements around median, then recurse on small /
large elements.

B recurse on left / right half, then combine sorted halves.

C grow sorted part on left, repeatedly add next element to

sorted range.

D repeatedly choose 2 elements and swap them if they are

out of order.

E Don’t know.

pingo.upb.de/622222
7
Clicker Question

How does quicksort work?

A split elements around median, then recurse on small /
large elements. �
B recurse on left / right half, then combine sorted halves.

C grow sorted part on left, repeatedly add next element to

sorted range.

D repeatedly choose 2 elements and swap them if they are

out of order.

E Don’t know.

pingo.upb.de/622222
7
Partitioning around a pivot

8
Partitioning around a pivot

<𝑝

8
Partitioning around a pivot

�
<𝑝

8
Partitioning around a pivot

<𝑝

8
Partitioning around a pivot

<𝑝 >𝑝

8
Partitioning around a pivot

<𝑝
✗
>𝑝

8
Partitioning around a pivot

<𝑝
✗
>𝑝

8
Partitioning around a pivot

<𝑝
✗
>𝑝
�
>𝑝

8
Partitioning around a pivot

<𝑝
✗
>𝑝 >𝑝

8
Partitioning around a pivot

<𝑝
✗
>𝑝
✗
<𝑝 >𝑝

8
Partitioning around a pivot

<𝑝
✗
>𝑝
✗
<𝑝 >𝑝

8
Partitioning around a pivot

<𝑝
�
<𝑝
�
>𝑝 >𝑝

8
Partitioning around a pivot

<𝑝 <𝑝 >𝑝 >𝑝

8
Partitioning around a pivot

<𝑝 <𝑝
�
<𝑝 >𝑝 >𝑝

8
Partitioning around a pivot

<𝑝 <𝑝 <𝑝 >𝑝 >𝑝

8
Partitioning around a pivot

<𝑝 <𝑝 <𝑝

�
<𝑝 >𝑝 >𝑝

8
Partitioning around a pivot

<𝑝 <𝑝 <𝑝 <𝑝 >𝑝 >𝑝

8
Partitioning around a pivot

<𝑝 <𝑝 <𝑝 <𝑝

✗
>𝑝 >𝑝 >𝑝

8
Partitioning around a pivot

<𝑝 <𝑝 <𝑝 <𝑝

✗
>𝑝 >𝑝 >𝑝

8
Partitioning around a pivot

<𝑝 <𝑝 <𝑝 <𝑝

✗
>𝑝
�
>𝑝 >𝑝 >𝑝

8
Partitioning around a pivot

<𝑝 <𝑝 <𝑝 <𝑝

✗
>𝑝 >𝑝 >𝑝 >𝑝

8
Partitioning around a pivot

<𝑝 <𝑝 <𝑝 <𝑝

✗✗
>𝑝 <𝑝 >𝑝 >𝑝 >𝑝

8
Partitioning around a pivot

<𝑝 <𝑝 <𝑝 <𝑝

✗✗
>𝑝 <𝑝 >𝑝 >𝑝 >𝑝

8
Partitioning around a pivot

<𝑝 <𝑝 <𝑝 <𝑝

��
<𝑝 >𝑝 >𝑝 >𝑝 >𝑝

8
Partitioning around a pivot

<𝑝 <𝑝 <𝑝 <𝑝 <𝑝 >𝑝 >𝑝 >𝑝 >𝑝

8
Partitioning around a pivot

<𝑝 <𝑝 <𝑝 <𝑝 <𝑝 >𝑝 >𝑝 >𝑝 >𝑝

8
Partitioning around a pivot

<𝑝 <𝑝 <𝑝 <𝑝 <𝑝 >𝑝 >𝑝 >𝑝 >𝑝

8
Partitioning around a pivot

� no extra space needed

� visits each element once

� returns rank/position of pivot

<𝑝 <𝑝 <𝑝 <𝑝 <𝑝 >𝑝 >𝑝 >𝑝 >𝑝

8
Partitioning – Detailed code
Beware: details easy to get wrong; use this code!

1 procedure partition(𝐴, 𝑏)
2 // input: array 𝐴[0..𝑛 − 1], position of pivot 𝑏 ∈ [0..𝑛 − 1]
3 swap(𝐴[0], 𝐴[𝑏])
4 𝑖 := 0, 𝑗 := 𝑛
5 while true do
6 do 𝑖 := 𝑖 + 1 while 𝑖 < 𝑛 and 𝐴[𝑖] < 𝐴[0]
7 do 𝑗 := 𝑗 − 1 while 𝑗 ≥ 1 and 𝐴[𝑗] > 𝐴[0]
8 if 𝑖 ≥ 𝑗 then break (goto 8)
9 else swap(𝐴[𝑖], 𝐴[𝑗])
10 end while
11 swap(𝐴[0], 𝐴[𝑗])
12 return 𝑗

Loop invariant (5–10): 𝐴 𝑝 ≤𝑝 ? ≥𝑝

𝑖 𝑗
9
Quicksort
1 procedure quicksort(𝐴[𝑙..𝑟]) � recursive procedure; divide & conquer
2 if 𝑙 ≥ 𝑟 then return
3 𝑏 := choosePivot(𝐴[𝑙..𝑟]) � choice of pivot can be
4 𝑗 := partition(𝐴[𝑙..𝑟], 𝑏) � ﬁxed position � dangerous!
5 quicksort(𝐴[𝑙..𝑗 − 1]) � random
6 quicksort(𝐴[𝑗 + 1..𝑟]) � more sophisticated, e. g., median of 3

10
Quicksort & Binary Search Trees
Quicksort

7 4 2 9 1 3 8 5 6

11
Quicksort & Binary Search Trees
Quicksort

7 4 2 9 1 3 8 5 6

11
Quicksort & Binary Search Trees
Quicksort

7 4 2 9 1 3 8 5 6

4 2 1 3 5 6 9 8

11
Quicksort & Binary Search Trees
Quicksort

7 4 2 9 1 3 8 5 6

4 2 1 3 5 6 7 9 8

11
Quicksort & Binary Search Trees
Quicksort

7 4 2 9 1 3 8 5 6

4 2 1 3 5 6 7 9 8

11
Quicksort & Binary Search Trees
Quicksort

7 4 2 9 1 3 8 5 6

4 2 1 3 5 6 7 9 8

2 1 3 4 5 6

11
Quicksort & Binary Search Trees
Quicksort

7 4 2 9 1 3 8 5 6

4 2 1 3 5 6 7 9 8

2 1 3 4 5 6

11
Quicksort & Binary Search Trees
Quicksort

7 4 2 9 1 3 8 5 6

4 2 1 3 5 6 7 9 8

2 1 3 4 5 6 8 9

11
Quicksort & Binary Search Trees
Quicksort

7 4 2 9 1 3 8 5 6

4 2 1 3 5 6 7 9 8

2 1 3 4 5 6 8 9

1 2 3 5 6 8

11
Quicksort & Binary Search Trees
Quicksort

7 4 2 9 1 3 8 5 6

4 2 1 3 5 6 7 9 8

2 1 3 4 5 6 8 9

1 2 3 5 6 8

1 3 6

11
Quicksort & Binary Search Trees
Quicksort

7 4 2 9 1 3 8 5 6

4 2 1 3 5 6 7 9 8

2 1 3 4 5 6 8 9

1 2 3 5 6 8

1 3 6