babybalao's blog

Thursday, February 10, 2011

DIFFERENT SORTING

1.Selection Sort

Selection sort performs sorting by repeatly putting the largest element in the unprocessed portion of the array to the end of the this unprocessed portion until the whole array is sorted.

It works by repeatedly element. It works as follows: first find the smallest in the array and exchange it with the element in the first position, then find the second smallest element and exchange it with the element in the second position, and continue in this way until the entire array is sorted.

Selection sort is among the simplest of sorting techniques and it work very well for small files. Furthermore, despite its evident "naïve approach "Selection sort has a quite important application because each item is actually moved at most once, Section sort is a method of choice for sorting files with very large objects (records) and small keys.

The Selection sort spends most of its time trying to find the minimum element in the "unsorted" part of the array.

Example how to implement:

void selectionSort(int numbers[], int array_size)
{
  int i, j;
  int min, temp;

  for (i = 0; i < array_size-1; i++)
  {
    min = i;
    for (j = i+1; j < array_size; j++)
    {
      if (numbers[j] < numbers[min])
        min = j;
    }
    temp = numbers[i];
    numbers[i] = numbers[min];
    numbers[min] = temp;
  }
}

2. Insertion Sort

Insertion sort is an algorithm to arrange a sequence of elements R₁, R₂, …, R_n into order. Elements are removed one by one from the unsorted sequence and inserted into the correct position in a new, sorted sequence S.

Insertion sort is an elementary sorting algorithm. It has a time complexity of Θ(n²), thus being slower than heapsort, merge sort and also shellsort. Insertion sort is well suited for sorting small data sets or for the insertion of new elements into a sorted sequence.

Sorting is usually implemented in-place. As elements are removed from the end of sequence R, the space freed up is used to extend sequence S. After i insertions, R₁, …, R_n-i remain unsorted and the sorted sequence S₁, …, S_i is stored in elements R_n-i+1, …, R_n.

Pseudo Code for Insertion Sort:

function insertion_sort R
    i ← length R
    while i > 0 do
        j ← i
        temp ← R_j
        while j < length R and R_j+1 < temp do
            R_j ← R_j+1
            j ← j + 1
        R_j ← temp
        i ← i - 1

Insertion sort requires n passes to sort a sequence of n elements. With each pass the sorted sequence S increases by one element. On average, Insertion sort will move and compare half of the sorted sequence each pass. The average number of move / compare operations is (n²-n)/4.

The worst case is a sequence sorted in reverse order which requires (n²-n)/2 move / compare operations. Insertion sort runs in Ο(n²) time.

Insertion sort is a stable sorting algorithm. The relative order of equal elements is maintained.

3. Bubble Sort

Bubble Sort is an elementary sorting algorithm. It works by repeatedly exchanging adjacent elements, if necessary. When no exchanges are required, the file is sorted. It is an alternate way of putting the largest element at the highest index in the array uses an algorithm.

The bubble sort is quick for nearly sorted files. For example, files which have been recently sorted but may have had a few more records added at the end.

Like selection sort, the idea of bubble sort is to repeatedly move the largest element to the highest index position of the array. As in selection sort, each iteration reduces the effective size of the array. The two algorithms differ in how this is done. Rather than search the entire effective array to find the largest element, bubble sort focuses on successive adjacent pairs of elements in the array, compares them, and either swaps them or not. In either case, after such a step, the larger of the two elements will be in the higher index position. The focus then moves to the next higher position, and the process is repeated. When the focus reaches the end of the effective array, the largest element will have ``bubbled'' from whatever its original position to the highest index position in the effective array.

Implementation:

void bubbleSort(int numbers[], int array_size)
{
  int i, j, temp;

  for (i = (array_size - 1); i >= 0; i--)
  {
    for (j = 1; j <= i; j++)
    {
      if (numbers[j-1] > numbers[j])
      {
        temp = numbers[j-1];
        numbers[j-1] = numbers[j];
        numbers[j] = temp;
      }
    }
  }
}

4. Shell Sort

Shellsort is one of the oldest sorting algorithms, named after its inventor D.L. Shell (1959) [She 59]. It is fast, easy to understand and easy to implement. However, its complexity analysis is a little more sophisticated.

This algorithm is a simple extension of Insertion sort. Its speed comes from the fact that it exchanges elements that are far apart (the insertion sort exchanges only adjacent elements).

The idea of the Shell sort is to rearrange the file to give it the property that taking every h^th element (starting anywhere) yields a sorted file. Such a file is said to be h-sorted.

Shell sort is the method of choice for many sorting application because it has acceptable running time even for moderately large files and requires only small amount of code that is easy to get working. Having said that, it is worthwhile to replace Shell sort with a sophisticated sort in given sorting problem.

Implementation:

void shellSort(int numbers[], int array_size)
{
  int i, j, increment, temp;

  increment = 3;
  while (increment > 0)
  {
    for (i=0; i < array_size; i++)
    {
      j = i;
      temp = numbers[i];
      while ((j >= increment) && (numbers[j-increment] > temp))
      {
        numbers[j] = numbers[j - increment];
        j = j - increment;
      }
      numbers[j] = temp;
    }
    if (increment/2 != 0)
      increment = increment/2;
    else if (increment == 1)
      increment = 0;
    else
      increment = 1;
  }
}

REFRENCES:

(http://www.personal.kent.edu/~rmuhamma/Algorithms/MyAlgorithms/Sorting/shellSort.htm)

(http://www.inf.fh-flensburg.de/lang/algorithmen/sortieren/shell/shellen.htm)

(http://www.personal.kent.edu/~rmuhamma/Algorithms/MyAlgorithms/Sorting/selectionSort.htm)

(http://www.theteacher99.btinternet.co.uk/theteacher/newalevel/cp1_4_1.htm)

(http://www.personal.kent.edu/~rmuhamma/Algorithms/MyAlgorithms/Sorting/bubbleSort.htm)

Friday, February 4, 2011

ASSIGNMENT NO. 2

1. ANALYSIS OF ALGORITHM

Analysis of algorithm is to determine the amount of resources (such as time and storage) necessary to execute it. It is to analyse an algorithm. It is a field in computer science whose overall goal is an understanding of the complexity of algorithms. The practical goal of algorithm analysis is to predict the performance of different algorithms in order to guide program design decisions.

2. EMPIRICAL ANALYSIS

Empirical Analysis is what we observe or experiment. It means that it is an observation on what we analyse on a certain algorithm and we can determine whats the best algorithm to use.

Steps for Empirical Analysis
a. Decide on the basic operation.
b. Decide on the input sample (range, size,...).
c. Convert the algorithm to a program.
d. Generate a sample of inputs.
e. Run the program; record the observed data.
f. Analyze the data.

3. BIG OH NOTATION

Big oh notation describes the limiting behavior of a function when the argument tends towards a particular value or infinity, usually in terms of simpler functions. Big O notation characterizes functions according to their growth rates: different functions with the same growth rate may be represented using the same O notation. It describe the performance or complexity of an algorithm.

Some common orders of growth along with descriptions:

O(1)

O(1) describes an algorithm that will always execute in the same time (or space) regardless of the size of the input data set.

O(N)

O(N) describes an algorithm whose performance will grow linearly and in direct proportion to the size of the input data set. The example below also demonstrates how Big O favours the worst-case performance scenario; a matching string could be found during any iteration of the for loop and the function would return early, but Big O notation will always assume the upper limit where the algorithm will perform the maximum number of iterations.

O(N²)

O(N²) represents an algorithm whose performance is directly proportional to the square of the size of the input data set. This is common with algorithms that involve nested iterations over the data set. Deeper nested iterations will result in O(N³), O(N⁴) etc.

O(2^N)

O(2^N) denotes an algorithm whose growth will double with each additional element in the input data set. The execution time of an O(2^N) function will quickly become very large.

Thursday, January 13, 2011

UNION-FIND ALGORITHMS

When we compute a given set of elements, it is often useful to break them up or partition them into a number of separate.

A Union-Find data structure maintains a partition of a set X of size n. For each set A in the partition it maintains a representative S(A) contained in A. The initial partition has each element in a set by itself. Operations include Union(S(A), S(B)), where S(A) =6 S(B), replaces A and B by A ∪ B, and speciﬁes a representative for A ∪ B.

A union-find algorithm is an algorithm that performs two useful operations on such a data structure:

Find: Determine which set a particular element is in. Also useful for determining if two elements are in the same set.
Union: Combine or merge two sets into a single set.

Because it supports these two operations, a disjoint-set data structure is sometimes called a union-find data structure or merge-find set. The other important operation, MakeSet, which makes a set containing only a given element a singleton, is generally trivial. With these three operations, many practical partitioning problems can be solved.

(http://en.wikipedia.org/wiki/Disjoint-set_data_structure)
(http://www.cs.berkeley.edu/~karp/greatalgo/lecture08.pdf)