Python | Convert Lists to column tuples

Last Updated : 30 Apr, 2023

Sometimes, while working with data, we can have a problem in which we need to get alike index elements in a single container. This means the columns of Matrix/Multiple lists need to be converted to list of tuples, each comprising of like index elements. Let's discuss certain ways in which this task can be done.

Method #1 : Using zip() + list comprehension The combination of above functions can work together to achieve this particular task. In this, we combine the like index elements into column using zip(), and binding all tuples into a list and iteration of dictionary lists is performed by list comprehension.

Output

The original dictionary is : {'list1': [1, 4, 5], 'list2': [6, 7, 4], 'list3': [9, 1, 11]}
Like index column tuples are : [(1, 6, 9), (4, 7, 1), (5, 4, 11)]

Time Complexity: O(n*n) where n is the number of elements in the list “test_list”. AND operation - Using zip() + list comprehension performs n*n number of operations.
Auxiliary Space: O(n), extra space is required where n is the number of elements in the list

Method #2 : Using zip() + values() The combination of above functions can be used to perform this task. In this, we access the values of dictionary values using the values() and aggregating columns can be done using zip().

Output

The original dictionary is : {'list1': [1, 4, 5], 'list2': [6, 7, 4], 'list3': [9, 1, 11]}
Like index column tuples are : [(1, 6, 9), (4, 7, 1), (5, 4, 11)]

Method #3: Using numpy.column_stack()

Note: Install numpy module using command "pip install numpy"

Using numpy.column_stack() converts lists to column tuples by using the numpy.column_stack() function. The method first imports the numpy library, then initializes a dictionary "test_dict" containing three lists. Then it prints the original dictionary. The numpy.column_stack() function is then used to stack the lists in the dictionary side by side. It takes a list of lists and forms a 2D array by stacking the input lists as columns.

Output:

The original dictionary is : {'list1': [1, 4, 5], 'list2': [6, 7, 4], 'list3': [9, 1, 11]}
Like index column tuples are : [[ 9 6 1]
[ 1 7 4]
[11 4 5]]

Time complexity: O(n) where n is the number of elements in the lists as numpy.column_stack() is a constant time operation, regardless of the number of input lists.
Auxiliary space: O(n) as it creates a new 2D array of tuples.

Method #4: Using pandas.DataFrame

Pandas is a popular library for data manipulation and analysis in Python. We can use the DataFrame function of pandas to create a table from the dictionary and then select the columns to create tuples.

Here is the step-by-step approach:

Import the pandas library.
Create a pandas DataFrame from the dictionary using the from_dict() method.
Select the columns using the loc[] accessor.
Convert the DataFrame to a list of tuples using the to_records() method.

Output:

The original dictionary is : {'list1': [1, 4, 5], 'list2': [6, 7, 4], 'list3': [9, 1, 11]}
Like index column tuples are : [(1, 6, 9), (4, 7, 1), (5, 4, 11)]

Time complexity: O(n), where n is the number of key-value pairs in the dictionary.
Auxiliary space: O(n), where n is the number of key-value pairs in the dictionary.

Method #5 : Using nested for loop + values() method

Step-by-step approach:

Extract values of dictionary using values() method
Use nested for loops to convert values list to like index column tuples
Display column tuples

Output

The original dictionary is : {'list1': [1, 4, 5], 'list2': [6, 7, 4], 'list3': [9, 1, 11]}
Like index column tuples are : [(1, 6, 9), (4, 7, 1), (5, 4, 11)]

Time Complexity: O(M*N) M - length of values list N - length of each list in value list
Auxiliary Space: O(M*N) M - length of values list N - length of each list in value list

Method #6 : Using heapq :

Algorithm :

Initialize the dictionary test_dict with three keys list1, list2, and list3, each having a list of integers as its value.
Print the original dictionary test_dict.
Create an empty list res to store the tuples of columns.
Iterate over each index i of the lists in test_dict using a for loop, where i goes from 0 to the length of any list in test_dict.
Inside the loop, use a list comprehension to extract the values at index i from each list in test_dict.
Pass the resulting list of extracted values to the heapq.nsmallest() function to get the n smallest elements, where n is the length of test_dict.
Store the resulting list of smallest elements in the temp variable.
Convert temp to a tuple and append it to the res list.
Repeat steps 5-8 for all indices i.
Print the resulting list res, which contains tuples with elements corresponding to the original lists in test_dict at the same index.

Output

The original dictionary is : {'list1': [1, 4, 5], 'list2': [6, 7, 4], 'list3': [9, 1, 11]}
Like index column tuples are : [(1, 6, 9), (1, 4, 7), (4, 5, 11)]

Time complexity:

The for loop iterates over len(test_dict) elements, which is the length of any list in test_dict.
The list comprehension extracts the values at the current index from each list in test_dict, taking O(len(test_dict)) time.
The heapq.nsmallest() function takes O(len(test_dict) log len(test_dict)) time to return the smallest n elements.
Therefore, the overall time complexity of the algorithm is O(len(test_dict)^2 log len(test_dict)).
Space complexity:

The space required to store the res list is O(len(test_dict)^2).
The space required to store the temp list is O(len(test_dict)).
Therefore, the overall space complexity of the algorithm is O(len(test_dict)^2).

Method #7: Using itertools:

Algorithm :

Initialize a dictionary named test_dict containing three keys, each mapped to a list of integers.
Print the original dictionary.
Use itertools.zip_longest() method to combine the values of each key into tuples, filling missing values with -inf.
Use map() and tuple() functions to convert the tuples of values into tuples of columns.
Store the resulting tuples of columns in a list named res.
Print the resulting list of tuples.

Output

The original dictionary is : {'list1': [1, 4, 5], 'list2': [6, 7, 4], 'list3': [9, 1, 11]}
Like index column tuples are : [(1, 6, 9), (4, 7, 1), (5, 4, 11)]

The time complexity : O(n*m) where n is the number of keys in test_dict and m is the length of the longest list among the values. This is because the zip_longest() method and map() function both iterate through each element of the lists, and the tuple() function has a constant time complexity.

The space complexity : O(m) because we are creating a new tuple for each column of values, and the size of each tuple is proportional to the length of the longest list among the values. Additionally, we are creating a new list to store the resulting tuples of columns.

Method #8: Using reduce():

Algorithm:

Initialize a dictionary test_dict with three keys and corresponding list values.
Print the original dictionary.
Use the zip_longest function from the itertools module to combine the lists in test_dict into tuples. The fillvalue parameter is set to -inf to ensure that all tuples have the same length, even if some lists are shorter than others.
Use the reduce function from the functools module to convert the tuples into column tuples. The reduce function takes a lambda function as the first argument, which combines the current accumulator value (an empty list at the start) with the current value from the zip_longest iterator. The lambda function creates a new tuple with the three values in the current value and appends it to the accumulator.
Print the resulting list of column tuples.

Output

The original dictionary is : {'list1': [1, 4, 5], 'list2': [6, 7, 4], 'list3': [9, 1, 11]}
Like index column tuples are : [(1, 6, 9), (4, 7, 1), (5, 4, 11)]

Time complexity:

The zip_longest function iterates over the values in test_dict and creates tuples with fillvalue when necessary. The time complexity of zip_longest is O(n*m), where n is the number of keys in test_dict and m is the length of the longest list.
The reduce function iterates over the tuples created by zip_longest and applies the lambda function to them. The time complexity of reduce is O(n), where n is the number of tuples.
Therefore, the overall time complexity of the code is O(n*m).

Space complexity:

The zip_longest function creates tuples with fillvalue when necessary. The number of tuples created is the length of the longest list in test_dict. Therefore, the space complexity of zip_longest is O(m).
The reduce function creates a new list with the column tuples. The length of this list is equal to the number of tuples created by zip_longest. Therefore, the space complexity of reduce is also O(m).
Therefore, the overall space complexity of the code is O(m).

Comment

Article Tags:

Python

Python Programs

Python list-programs

Explore

Python Fundamentals

Python Data Structures

Advanced Python

Data Science with Python

Web Development with Python

Python Practice

Python Courses

URL: https://www.geeksforgeeks.org/python/python-convert-lists-to-column-tuples/

⇱ Python | Convert Lists to column tuples - GeeksforGeeks

Python | Convert Lists to column tuples

Explore