Sets

Sets#

Define#

A set is a fundamental data structure in computer science that stores a collection of unique elements. It ensures that no duplicates are allowed, and it doesn’t impose a specific order on the elements.

An unordered_set is a fundamental data structure in computer science that represents a collection of unique elements, similar to a mathematical set. It is implemented as a hash table, providing fast access and ensuring uniqueness of its elements. Unlike a std::set, it does not maintain any specific order of the elements.

Use Cases#

Graph Algorithms {both}

https://raw.githubusercontent.com/kdn251/interviews/master/images/dijkstra.gif

Sets can be used to track visited nodes in graph traversal algorithms.

Database Indexing {set}

https://cdn-media-1.freecodecamp.org/images/0eg06hWYJWhXPt1QNuaDlETYrmnSKAo6Nf44

Sets are used to maintain unique values in database indexes, ensuring fast lookups.

Data Deduplication {unordered_set}

http://3.bp.blogspot.com/-47SCyzU4tMM/UwK23slYgJI/AAAAAAAAUS4/8XZ52p1D044/s1600/deduplication3.gif

Removing duplicates from a list of records, such as emails or customer IDs.

Membership Testing {both}

Sets are efficient for checking whether an element is part of a specific group or category.

Spell Checking {set}

https://helpcenter.onlyoffice.com/OfficeWeb/apps/documenteditor/main/resources/help/en/images/spellchecking.png

In word processing applications, a set can be used to maintain a dictionary of correctly spelled words.

Counting Occurrences {unordered_set}

https://www.w3resource.com/w3r_images/cpp-array-image-exercise-20.png

Counting the frequency of unique elements in a dataset.

Advantages & Disadvantages#

Advantages

Disadvantages

Programming#

CppRef- set

Pseudo- set

Set Data Structure:
  - Initialize an empty set
  - Implement functions for insert, delete, search, and traverse

Code- set

We use the std::unordered_set container from the C++ Standard Library, which is a hash table-based implementation of a set.
We insert, check for existence, and remove elements using the insert, find, and erase methods.
Finally, we display the elements in the set.

#include <iostream>
#include <set>

int main() {
    std::set<int> mySet;
    
    // Insert elements
    mySet.insert(10);
    mySet.insert(5);
    mySet.insert(20);

    // Search for an element
    auto it = mySet.find(5);
    if (it != mySet.end()) {
        std::cout << "Element 5 found in the set.\n";
    }

    // Delete an element
    mySet.erase(10);

    // Traverse the set
    for (const int& element : mySet) {
        std::cout << element << " ";
    }
    std::cout << "\n";

    return 0;
}

Element 5 found in the set.
Set elements: 5 20

CppRef- unordered_set

Pseudo- unordered_set

Unordered Set Data Structure:

Data:
- Initialize an array (buckets) of a fixed size for storing elements.
- Each bucket is a linked list to handle collisions.

Functions:
- Insert(value):
    1. Calculate the hash of the value.
    2. Find the bucket using the hash.
    3. Search the bucket for the value; if not found, append the value to the bucket.

- Contains(value):
    1. Calculate the hash of the value.
    2. Find the bucket using the hash.
    3. Search the bucket for the value; return true if found, false otherwise.

- Remove(value):
    1. Calculate the hash of the value.
    2. Find the bucket using the hash.
    3. Search the bucket for the value, and if found, remove it.

- Display():
    1. Iterate through each bucket and display the elements.

Code- unordered_set