How to implement merge sort in C++

How to Implement Merge Sort in C++

Merge sort is a powerful and efficient sorting algorithm that uses the divide-and-conquer strategy to sort elements. With its O(n log n) time complexity, it is particularly useful for large datasets and is often preferred in scenarios where stability is essential. In this article, we will explore how to implement merge sort in C++, covering its definition, use cases, and providing you with actionable insights and code examples.

Understanding Merge Sort

What is Merge Sort?

Merge sort is a recursive sorting algorithm that divides an array into two halves, sorts each half, and then merges them back together in sorted order. This method ensures that the array is sorted efficiently and with minimal comparisons.

Key Characteristics of Merge Sort

Stable Sort: Merge sort maintains the relative order of records with equal keys.
Time Complexity: O(n log n) in all cases (best, average, worst).
Space Complexity: O(n) due to the temporary arrays used for merging.

When to Use Merge Sort?

Large Datasets: Effective for sorting large arrays where other algorithms may struggle.
Linked Lists: Particularly efficient for sorting linked lists, as they can be split without needing extra space.
Stability Requirement: When you need a stable sort, such as when sorting records with multiple fields.

Implementing Merge Sort in C++

Step 1: Basic Structure

To implement merge sort in C++, we need to create a function that can divide the array and merge the sorted arrays back together. Here’s the basic structure of our program:

#include <iostream>
#include <vector>

void merge(std::vector<int>& arr, int left, int mid, int right);
void mergeSort(std::vector<int>& arr, int left, int right);

int main() {
    std::vector<int> arr = {38, 27, 43, 3, 9, 82, 10};
    mergeSort(arr, 0, arr.size() - 1);

    std::cout << "Sorted array: ";
    for (const int& num : arr) {
        std::cout << num << " ";
    }
    return 0;
}

Step 2: Implementing the Merge Function

The merge function is responsible for combining two sorted halves of the array. Here’s how to implement it:

void merge(std::vector<int>& arr, int left, int mid, int right) {
    int n1 = mid - left + 1;
    int n2 = right - mid;

    std::vector<int> L(n1), R(n2);

    for (int i = 0; i < n1; i++)
        L[i] = arr[left + i];
    for (int j = 0; j < n2; j++)
        R[j] = arr[mid + 1 + j];

    int i = 0, j = 0, k = left;

    while (i < n1 && j < n2) {
        if (L[i] <= R[j]) {
            arr[k] = L[i];
            i++;
        } else {
            arr[k] = R[j];
            j++;
        }
        k++;
    }

    while (i < n1) {
        arr[k] = L[i];
        i++;
        k++;
    }

    while (j < n2) {
        arr[k] = R[j];
        j++;
        k++;
    }
}

Step 3: Implementing the Merge Sort Function

Now we need to implement the mergeSort function, which will recursively divide the array until we reach single elements.

void mergeSort(std::vector<int>& arr, int left, int right) {
    if (left < right) {
        int mid = left + (right - left) / 2;

        mergeSort(arr, left, mid);
        mergeSort(arr, mid + 1, right);
        merge(arr, left, mid, right);
    }
}

Complete Code Example

Here’s the complete code for implementing merge sort in C++:

#include <iostream>
#include <vector>

void merge(std::vector<int>& arr, int left, int mid, int right);
void mergeSort(std::vector<int>& arr, int left, int right);

int main() {
    std::vector<int> arr = {38, 27, 43, 3, 9, 82, 10};
    mergeSort(arr, 0, arr.size() - 1);

    std::cout << "Sorted array: ";
    for (const int& num : arr) {
        std::cout << num << " ";
    }
    return 0;
}

void merge(std::vector<int>& arr, int left, int mid, int right) {
    int n1 = mid - left + 1;
    int n2 = right - mid;

    std::vector<int> L(n1), R(n2);

    for (int i = 0; i < n1; i++)
        L[i] = arr[left + i];
    for (int j = 0; j < n2; j++)
        R[j] = arr[mid + 1 + j];

    int i = 0, j = 0, k = left;

    while (i < n1 && j < n2) {
        if (L[i] <= R[j]) {
            arr[k] = L[i];
            i++;
        } else {
            arr[k] = R[j];
            j++;
        }
        k++;
    }

    while (i < n1) {
        arr[k] = L[i];
        i++;
        k++;
    }

    while (j < n2) {
        arr[k] = R[j];
        j++;
        k++;
    }
}

void mergeSort(std::vector<int>& arr, int left, int right) {
    if (left < right) {
        int mid = left + (right - left) / 2;

        mergeSort(arr, left, mid);
        mergeSort(arr, mid + 1, right);
        merge(arr, left, mid, right);
    }
}

Troubleshooting Common Issues

When implementing merge sort, you may encounter several common issues:

Out of bounds error: Ensure your indices are correctly set during merging.
Memory consumption: If your dataset is large, consider optimizing space usage by implementing an in-place merge sort.
Performance issues: For smaller datasets, simpler algorithms like quicksort or insertion sort may be more efficient.

Conclusion

Merge sort is a highly efficient and reliable sorting algorithm that can handle large datasets with ease. By following the steps outlined in this article, you can implement merge sort in C++ and understand its underlying mechanics. Whether you’re dealing with arrays or linked lists, mastering merge sort will enhance your programming toolkit and improve your algorithmic skills. Happy coding!