VOOZH about

URL: https://www.geeksforgeeks.org/python/python-aggregate-values-by-tuple-keys/

⇱ Python | Aggregate values by tuple keys - GeeksforGeeks


  • Courses
  • Tutorials
  • Interview Prep

Python | Aggregate values by tuple keys

Last Updated : 7 May, 2023

Sometimes, while working with records, we can have a problem in which we need to group the like keys and aggregate the values of like keys. This can have application in any kind of scoring. Let's discuss certain ways in which this task can be performed. 

Method #1 : Using Counter() + generator expression The combination of above functions can be used to perform this particular task. In this, we need to first combine the like key elements and task of aggregation is performed by Counter(). 

Output : 
The original list is : [('gfg', 50), ('is', 30), ('best', 100), ('gfg', 20), ('best', 50)]
List after grouping : [('best', 150), ('gfg', 70), ('is', 30)]

Time Complexity: O(n*n) where n is the number of elements in the list “test_list”. Counter() + generator expression performs n*n number of operations.
Auxiliary Space: O(n), extra space is required where n is the number of elements in the list

Method #2 : Using groupby() + map() + itemgetter() + sum() The combination of above functions can also be used to perform this particular task. In this, we group the elements using groupby(), decision of key's index is given by itemgetter. Task of addition(aggregation) is performed by sum() and extension of logic to all tuples is handled by map(). 

Output : 
The original list is : [('gfg', 50), ('is', 30), ('best', 100), ('gfg', 20), ('best', 50)]
List after grouping : [('best', 150), ('gfg', 70), ('is', 30)]

Time Complexity: O(n*n), where n is the length of the list test_list 
Auxiliary Space: O(n) additional space of size n is created where n is the number of elements in the res list 

Method #3: Using reduce():

Algorithm:

  • Import the required modules, functools, itertools and operator.
  • Initialize the given list of tuples.
  • Use the reduce function to iterate through the list of tuples, filtering the tuples with the same first element and summing their second element.
  • Append the tuples obtained from step 3 to the accumulator list, if the first element of the tuple is not
  • present in the accumulator list, otherwise return the accumulator list unchanged.
  • Finally, print the list after grouping.

Output
The original list is : [('gfg', 50), ('is', 30), ('best', 100), ('gfg', 20), ('best', 50)]
List after grouping : [('gfg', 70), ('is', 30), ('best', 150)]

Time Complexity: O(nlogn), where n is the length of the input list. This is due to the sorting operation performed by the groupby function.
Auxiliary Space: O(n), where n is the length of the input list. This is due to the list created by the reduce function to store the output tuples.

METHOD 4:Using dictionary.

APPROACH:

The program takes a list of tuples as input and aggregates the values by the tuple keys. In other words, it groups the values of tuples with the same key and sums their values.

ALGORITHM:

1.Initialize an empty dictionary d.
2.Loop through each tuple in the list:
a.Check if the key of the tuple is already present in the dictionary.
b.If the key is present, add the value of the tuple to the existing value of the key in the dictionary.
c.If the key is not present, add the key-value pair to the dictionary.
5.Convert the dictionary to a list of tuples using the items() method.
6.Print the list.


Output
List after grouping : [('gfg', 70), ('is', 30), ('best', 150)]

Time Complexity:

The time complexity of this program is O(n), where n is the length of the input list.

Space Complexity:

The space complexity of this program is O(m), where m is the number of unique keys in the input list. This is because the program creates a dictionary to store the keys and their corresponding values.

Comment