Professional Documents
Culture Documents
ANALYSIS OF VARIOUS
SORTING TECHNIQUES
ABSTRACT
Sorting is one of the most important
operations performed by computers. In the
days of magnetic tape storage before
modern data-bases, it was almost certainly
the most common operation performed by
computers as most "database" updating
was done by sorting transactions and
merging them with a master file. It's still
important for presentation of data extracted
from databases: most people prefer to get
reports sorted into some relevant order
before wading through pages of data!
MODULES
The modules include
(I). Implementation of various Comparisonbased Sorting Techniques
(II) Analyze the performance of various
Comparison-based Sorting Techniques
(III) Implementation of various in-place,
stable, external, Recursive and non-recursive
Sorting Techniques (IV) Analyze the
performance of in-place, stable, external,
Recursive and non-recursive Sorting
Technique
Output:
A permutation (reordering) a1, a2, . . . , an of the input
sequence such that a1 a2 an
PROBLEM: SORTING
arranging elements of set into order
Solution:
Insertion Sort
Quicksort
Mergesort
Heapsort
Shellsort
Radix Sorting
Optimality:
Lower bounds for Sorting by
Comparison of Keys
STRUCTURE OF DATA
SOME DEFINITIONS
Internal Sort
The data to be sorted is all stored in the computers main
memory.
External Sort
Some of the data to be sorted might be stored in some
external, slower, device.
In Place Sort
The amount of extra space required to sort the data is
constant with the input size.
STABILITY
A STABLE sort preserves relative order of records
with equal keys
Sorted on first key:
INSERTION SORT
Idea: like sorting a hand of playing cards
Start with an empty left hand and the cards facing down
on the table.
Remove one card at a time from the table, and insert it
into the correct position in the left hand
compare it with each of the cards already in the hand, from
right to left
10
INSERTION SORT
6 10 2 4 3 6
12
11
INSERTION SORT
6 10 2 4
36
12
12
INSERTION SORT
6 10
24 3
6
12
13
INSERTION SORT
input array
5
3
sorted
right sub-array
unsorted
14
INSERTION SORT
15
Termination:
When the loop terminates, the invariant gives us a useful
property that helps show that the algorithm is correct
Stop the induction when the loop terminates
16
17
18
Invariant: at the start of the for loop the elements in A[1 . . j-1]
are in sorted order
19
cost
times
for j 2 to n
do key A[ j ]
c1
0
c4
ij-1
while i > 0 and A[i] > key
do A[i + 1] A[i]
ii1
A[i + 1] key
c5
c6
n
n-1
n-1
n
n-1j 2 t j
j 2
n
j 2
(t j 1)
(t j 1)
c7
n-1
T (n) c1n c2 (n 1) c4 (n 1) c5 t j c6 t j 1 c7 t j 1 c8 (n 1)
n
j 2
j 2
j 2
20
Disadvantages
(n2) running time in worst and average case
n2/2 comparisons and exchanges
21
1
j
22
EXAMPLE
8
i=1
i=1
i=1
i=1
4
4
6
6
i=3
i=4
9
9
2
2
3
3
4
4
9
j
i=5
i=2
i=1
i=1
i=1
9
j
i=6
9
i=7
j
23
BUBBLE SORT
Alg.: BUBBLESORT(A)
for i 1 to length[A]
do for j length[A] downto i + 1
do if A[j] < A[j -1]
i
then exchange A[j] A[j-1]
8
i=1
1
j
24
c2
c3
i 1
i 1
T(n) = c1(n+1) + c2 (n i 1) c3 (n i ) c4
= (n) + (c2 + c2 + c4)
c4
(n i)
i 1
(n i )
i 1
2
n
(
n
1)
n
n
where (n i ) n i n 2
2
2 2
i 1
i 1
i 1
n
Thus,T(n) = (n2)
25
Disadvantage:
Running time depends only slightly on the amount of
order in the file
26
EXAMPLE
8
27
1
high
low
Merge-Sort (A, low, high)
{
cost
n length[A]
for j 1 to n - 1
do smallest j
n2/2
for i j + 1 to n
comparisons
exchanges
n 1
n 1
n 1
j 1
j 1
j 2
times
c1
c2
c3
n-1
c4 j 1 (n j 1)
n 1
c5
c6
c7
n 1
j 1
n 1
j 1
(n j )
(n j )
n-1
T (n) c1 c2 n c3 (n 1) c4 (n j 1) c5 n j c6 n j c7 (n 1) (n 2 )
29
Algorithm
Worst case Average
Space
Usage
Insertion n^2/2
(n^2)
in place
Quicksort n^2/2
(n log n) log n
Mergesort
n lg n
(n log n) n
Heapsort 2n lg n
(n log n) in place
Ac.Heaps.
n lg n
(n log n) in place
Accelerated Heapsort currently is the method of
choice.