Python Remove Duplicates

## Python: How to Remove Duplicates from a List In Python, removing duplicate elements from a list is a common task. Depending on your specific requirements—such as whether you need to preserve the original order of elements or optimize for performance—there are several ways to achieve this. This tutorial covers the three most common and efficient methods: using the `set` data structure, using a loop with a list comprehension, and using the `dict.fromkeys()` method. --- ## Methods to Remove Duplicates ### Method 1: Using `set()` (Fastest, Does Not Preserve Order) A `set` is an unordered collection of unique elements in Python. By converting a list to a set, Python automatically discards all duplicate values. You can then convert the set back into a list. #### Code Example ```python # Method 1: Using set() original_list = [1, 2, 2, 3, 4, 4, 5] unique_list = list(set(original_list)) print("Method 1 Result:", unique_list) ``` #### How It Works * `set(original_list)` filters out duplicates because sets cannot contain duplicate values. * `list(...)` converts the set back into a standard Python list. * **Note:** Because sets are unordered, this method **does not guarantee** that the original order of elements will be preserved. --- ### Method 2: Using `dict.fromkeys()` (Fast, Preserves Order) In Python 3.7 and later, standard dictionaries are guaranteed to maintain insertion order. The `dict.fromkeys()` method creates a dictionary where the list elements become the dictionary keys. Since dictionary keys must be unique, duplicates are removed while preserving their original order. #### Code Example ```python # Method 2: Using dict.fromkeys() original_list = [1, 2, 2, 3, 4, 4, 5] unique_list = list(dict.fromkeys(original_list)) print("Method 2 Result:", unique_list) ``` #### How It Works * `dict.fromkeys(original_list)` creates a dictionary with keys from the list and values set to `None`. * Since keys must be unique, duplicates are automatically removed. * `list(...)` extracts only the keys from the dictionary, returning them as a list in their original order. --- ### Method 3: Using a Loop / List Comprehension (Preserves Order, Best for Custom Logic) If you need to maintain the original order of elements without relying on dictionary keys, or if you need to apply custom filtering conditions, you can iterate through the list and append elements to a new list only if they are not already present. #### Code Example ```python # Method 3: Using a loop / list comprehension original_list = [1, 2, 2, 3, 4, 4, 5] unique_list = [] [unique_list.append(x) for x in original_list if x not in unique_list] print("Method 3 Result:", unique_list) ``` #### How It Works * The list comprehension iterates through `original_list`. * For each element `x`, it checks if `x` is already in `unique_list`. * If `x` is not in `unique_list`, it is appended. * **Note:** While this method preserves order, checking `x not in unique_list` takes $O(N)$ time for each element, making this method slower ($O(N^2)$ overall complexity) for very large lists. --- ## Output Comparison When you run the code above, you will get the following output: ```text Method 1 Result: [1, 2, 3, 4, 5] Method 2 Result: [1, 2, 3, 4, 5] Method 3 Result: [1, 2, 3, 4, 5] ``` --- ## Summary & Considerations When choosing a method to remove duplicates, consider the following guidelines: | Method | Preserves Order? | Time Complexity | Best Used For | | :--- | :--- | :--- | :--- | | **`set()`** | No | $O(N)$ | Large datasets where element order does not matter. This is the fastest and most memory-efficient method. | | **`dict.fromkeys()`** | Yes (Python 3.7+) | $O(N)$ | Large datasets where you must preserve the original order of elements. | | **Loop / List Comprehension** | Yes | $O(N^2)$ | Small datasets, or when you need to apply custom conditional logic during the deduplication process. |

YouTip

Python Remove Duplicates

📂 Categories