data:image/s3,"s3://crabby-images/e5ae6/e5ae6df912f294954a3cc36dd3cdb2f3256087a6" alt="Photo by rahu on Pixabay"
As one of the most widely used "Pythonic" syntax in Python, you must have ever heard List Comprehension or have already used it frequently. It is almost a must-have section in various Python elementary tutorials. However, did you know that there are actually 4 types of "comprehension" in Python rather than only for the lists? You may also have heard or ever used Dictionary Comprehension, but might not for the set and generator.
In this article, I’ll introduce all the 4 types comprehensions in Python with examples. In the last section, I also want to give an example to show you when not to use comprehensions for better readability. Use it with care, please!
1. List Comprehension
data:image/s3,"s3://crabby-images/33c17/33c174cffb62473b176a3f91be42ad809ab608d7" alt="Photo by Pexels on Pixabay"
This is definitely the most commonly used one in Python. You’ll see them everywhere. The basic syntax is as follows:
my_list = [<expression> for <item> in <iterable> if <condition>]
Here, the expression
may be exactly the same as item
which means that we don’t want to do anything to the items and just have the as-is.
The condition
is optional. Only use it when you want to do a filter.
So, basically:
expression = item
when you want to leave theitem
unchanged.expression != item
when you want to apply some logic to modifyitem
.condition
will help to filter theiterable
which may return feweritems
.condition
is optional so that if it is suppressed the length of theiterable
must be equal tomy_list
.
Let’s have an example for demonstration. Suppose we have a list with just 1–10 integers.
my_list = [1,2,3,4,5,6,7,8,9,10]
What we want to do is to have a new list with the numbers in my_list
- Filter the number to have only even numbers (use
condition
) - With the filtered number, plus one to it before it is pushed into the new list (use
expression
)
new_list = [number + 1 for number in my_list if number % 2 == 0]
data:image/s3,"s3://crabby-images/a4cf0/a4cf0f5f837031f6bec9b6afa80538b818aacb40" alt=""
Very concise and readable! What if we don’t use List Comprehension? Here is an example.
new_list = []
for number in my_list:
if number % 2 == 0:
new_list.append(number + 1)
data:image/s3,"s3://crabby-images/a6405/a64053564fce143df7875a3de8d1a17e61874b7a" alt=""
It also has pretty good readability, but less concise. Also, we have to initialise the new_list before anything else.
So, which one do you like? I think the answer is obvious for most of us
2. Dictionary Comprehension
data:image/s3,"s3://crabby-images/b2d89/b2d89d3258b57ef93dc0df934c12a15ccab2fdb1" alt="Photo by Deedee86 on Pixabay"
The name, "Something" Comprehension, is actually depending on the output object type rather than the iterable that is used in the comprehension. Therefore, a Dictionary Comprehension will generate a Dictionary.
The basic syntax is as follows:
my_dict = [<key>:<value> for <item> in <iterable> if <condition>]
Please be noticed that the key
and value
here can be expressions with or without item
.
Let’s say that we have a list of dictionary objects which contain employee names, age and job titles.
persons = [
{
'name': 'Alice',
'age': 30,
'title': 'Data Scientist'
},
{
'name': 'Bob',
'age': 35,
'title': 'Data Engineer'
},
{
'name': 'Chris',
'age': 33,
'title': 'Machine Learning Engineer'
}
]
The requirement is to get all the employees having their title contains the string "Data". So, we can call them "data employees".
Also, we’re not interested in the employees’ age, so that we want to have such a dictionary with the employee names as the keys and their job titles as the value.
This time, let’s write the code without dictionary comprehension first.
data_employees = {}
for p in persons:
if 'Data' in p['title']:
data_employees[p['name']] = p['title']
data:image/s3,"s3://crabby-images/cde58/cde58e1a6d2d8b6798f4ff60892b483baa715d0f" alt=""
Again, we have to initialise an empty dictionary at the beginning. Inside the for-loop, we will put the if-condition to filter the employees’ job title. Finally, we need to make sure that the structure of the new dictionary obeys the requirements.
With Dictionary Comprehension, we can put all these in one line and even in prove its readability.
data_employees = {p['name']:p['title'] for p in persons if 'Data' in p['title']}
data:image/s3,"s3://crabby-images/d86d1/d86d15faa49e2dc8c6000127551fce1b76d1c1bc" alt=""
Besides concise, I would say that the Dictionary Comprehension is even more readable.
3. Set Comprehension
data:image/s3,"s3://crabby-images/1b285/1b2859ed79550504f3886a10f672177b2e867b8d" alt="Photo by congerdesign on Pixabay"
Yep, there is a Set Comprehension syntax in Python, which many people don’t know. In fact, the only difference between a Set Comprehension and a List Comprehension is that the former uses curly braces {}
rather than square brackets []
.
The syntax is as follows:
my_set = {<expression> for <item> in <iterable> if <condition>}
Are you confusing it with Dictionary Comprehension? The difference between these two is that the Set Comprehension doesn’t have the key:value
pair, but only one expression
.
Now, let reuse the employee list used in the previous section.
Suppose we are not using Set Comprehension.
data_employees_set = set()
for p in persons:
if 'Data' in p['title']:
data_employees_set.add(p['name'])
data:image/s3,"s3://crabby-images/45932/4593298aac4c788e4311554a7bc26e0da8a4d214" alt=""
Then, let’s use it.
data_employees_set = {p['name'] for p in persons if 'Data' in p['title']}
data:image/s3,"s3://crabby-images/c4506/c45066e7e00ae1b04a9167c38b73009f43318f2b" alt=""
Keep in mind that the difference between a set and a list is that the items in a set cannot be repeated and not in order
4. Generator Comprehension
data:image/s3,"s3://crabby-images/68ae1/68ae1849c77ec96f2b7f896168dda2a55729182e" alt="Photo by DaKub on Pixabay"
Last but not the list, Generator Comprehension is also not known by most of the people. The generator is a quite unique syntax in Python, it is usually generated from a function with yield
keyword rather than Python.
For example, let’s use the same number list and requirements from the List Comprehension section. We can define a function to make a generator for us.
def even_generator(numbers):
for n in numbers:
if n % 2 == 0:
yield int(n/2)
eg = even_generator(my_list)
A generator can be iterated by its built-in function next()
. So, we can keep getting the next value of the generator by calling print(next(eg))
.
data:image/s3,"s3://crabby-images/6b285/6b285a9ade05b45721722d653d939756b7cbc9ad" alt=""
In fact, all these can be done using Generator Comprehension. The only difference between Generator Comprehension and List Comprehension is that the former uses parentheses.
my_gen = (<expression> for <item> in <iterable> if <condition>)
Yes, we even don’t need the yield
keyword! Let’s give it a try.
eg = (int(number/2) for number in my_list if number % 2 == 0)
data:image/s3,"s3://crabby-images/22f76/22f768f7f36443054e3b56d38929d9b2ec33088c" alt=""
That’s it, all done.
5. Nested Comprehension
data:image/s3,"s3://crabby-images/65e7a/65e7af85ddafdc791f12411a8d40774f10b595c6" alt="Photo by JerzyGorecki on Pixabay"
We have explored all the 4 types comprehensions in Python. They are really nice syntax in Python. However, as far as I know, a lot of learners tend to use them too frequently once they understand how to use the comprehensions. This might not be a good thing.
We usually like to use comprehensions to write more concise and readable codes. However, sometimes the logic is relatively complex, it will hurt the readability if we use the comprehensions.
One of the typical cases is nested comprehensions.
For example, we want to print a multiplication table from 1–10. Here is a normal way to implement this. Of course, without any list comprehensions.
for row in range(1, 10):
for col in range(1, row+1):
print(f'{col}x{row}={row*col} t', end='')
print("")
data:image/s3,"s3://crabby-images/eba62/eba622bfdfbb99f8d3d9ed458009684f2bce457a" alt=""
A few things to be noticed, it is fairly easy for us to control the output format because we can have multiple expressions inside the for-loop. For example, we can use the end=''
argument in the print()
function to prevent it output a new line.
If we use List Comprehension to implement this, of course we can, but lots of stuff will be squeezed in one line of code which really damage the readability.
print('n'.join([''.join([f'{col}x{row}={row*col} t' for col in range(1, row + 1)]) for row in range(1, 10)]))
data:image/s3,"s3://crabby-images/f6e47/f6e479fddb39855720cc04ad6455af78859e3436" alt=""
I bet no one can understand the code above very easily.
Basically, I’m not suggesting to abandon nested comprehensions. Sometimes, it is OK if the logic is really simple. However, it worth to think twice whether we really need to use the comprehensions if we meet such a case that needs nested comprehensions.
Summary
data:image/s3,"s3://crabby-images/fdcf9/fdcf97aeb9b0077333f6a83c0e4e6737334bd3b5" alt="Photo by NWimagesbySabrinaEickhoff on Pixabay"
In this article, I have introduced all the 4 types of comprehensions in Python – List, Dictionary, Set and Generator Comprehension. Most of the time, they are very elegant, concise and even can improve the readability of our code. That is the so-called "Pythonic".
However, we also need to pay attention not to abuse it. When the logic is not simple, the comprehension can hurt the readability. In many cases that need nested comprehensions, we need to ask ourselves: Should I write the logics in nested comprehensions?
Hope you enjoy the reading. Happy coding!
If you feel my articles are helpful, please consider joining Medium Membership to support me and thousands of other writers! (Click the link above)