Iteration in Python Flashcards
What makes an object iterable? Give the simple AND Pythonic explanation, and give 3 examples.
Simple: Can be looped (iterated) over
Pythonic: Returns an iterator when passed to iter()
, i.e. the object has dunder methods \_\_iter\_\_()
and/or \_\_getitem\_\_()
defined.
Examples: str, list, dict
What is an iterator? Give the simple AND Pythonic explanation.
Simple: An iterable object that has two main actions:
* Returns data one item at a time
* Keeps track of current and visited items
Pythonic: An iterable object that implements the iterator protocol, i.e. defines \_\_next\_\_()
and returns itself within \_\_iter\_\_()
What is the purpose of \_\_next\_\_()
?
It’s used by an iterator to:
* Define exactly how it will traverse through its stored values
* Return the next value in the iteration
* Keep track of which values have already been called
* Raise a StopIteration
exception when no more values are available
What’s one memory-related benefit of using iterators?
Iterators allow you to process the entire dataset one item at a time, which removes the need to store the entire dataset into memory.
What are the functions of the 3 types of iterators?
- Take stream of data, yield original data (classic iterator)
- Take stream of data, transform original data and yield new data
- Take no input data, yield results from a computation
What does it mean when you get a StopIteration
error when calling an iterator with next()
?
The iterator object is consumed, i.e. it has iterated through all of its values and you’ll need to create another one if you want to perform another iteration.
Implement a classic iterator that takes a sequence of values as an input and yields each value sequentially
create an iterator that takes a sequence of values at instantiation and is
class SequenceIterator: # store the sequence and set the index to 0 at initialization def \_\_init\_\_(self, sequence): self._sequence = sequence self._index = 0 # per iterator protocol, iterators will always return itself in \_\_iter\_\_() def \_\_iter\_\_(self): return self # return the current value of the sequence and increment the index by 1 # if index is incremented past the sequence, raise StopIteration def \_\_next\_\_(self): if self._index < len(self._sequence): item = self._sequence[self._index] self._index += 1 return item raise StopIteration
What does iter()
do?
Calls an iterable object’s \_\_iter\_\_()
method to return an iterator. If \_\_iter\_\_()
doesn’t exist but \_\_getitem\_\_()
does, it will return an iterator that calls \_\_getitem\_\_()
with i = range(inf)
until a StopIteration
exception is raised
T/F: An object isn’t iterable without \_\_iter\_\_()
defined
False, as long as \_\_getitem\_\_()
exists instead. If both don’t exist, then it’s True.
Implement Python’s for
loop using the iterator protocol
iterator = sequence.\_\_iter\_\_() while True: try: # get the next item using \_\_next\_\_() item = iterator.\_\_next\_\_() except StopIteration: break else: # the loop's code goes here print(item)
What does it mean when an iterator is lazy?
It means that an iterator will only calculate and/or return one value at a time. It is hugely efficient in terms of memory consumption because your code will only require memory for a single item at a time.
How do you create an infinite iterator?
Create a normal iterator, but don’t include the StopIteration
exception
Which of Python’s abstract base classes (ABCs) can you inherit to more easily create custom iterators?
collections.abc.Iterator
, which
What is a generator function?
A unique function that returns a generator iterator and keeps track of its state with the yield
keyword
How do generator functions/expressions differ from custom iterable classes?
Write a generator function that yields an infinite sequence of numbers, then state the built-in one from Python
def infinite_sequence(): num = 0 while True: yield num num += 1
itertools.count()
How does the syntax differ between a generator expression and list comprehension?
Generator expression uses ()
, while list comprehension uses []
What’s a downside of using generator expressions in place of list comprehension?
Since generator expressions are equivalent to creating a generator function and then calling it, there is added overhead of calling the function. Because of this, generator expressions are slightly slower than list comprehension
T/F: Generator expressions are more performant than generator functions
False, expressions just allow you to define simple generators in a single line with an assumed yield
at the end of each inner iteration
How does yield
work in a generator function?
When the generator from a generator function is called, the code within the function is executed up to the yield
statement. The program then suspends the function, returns the yielded value to the caller, and then saves the state of the function (i.e. the local variables, instruction pointer, internal stack, and exception handling). When the generator is called again, the function continues off from yield
until it hits another yield
, or when StopIteration
is returned
What happens when you use return
in a generator function?
It will stop the function execution completely. It’s useful if you want more control over function termination than just hitting StopIteration
What are the main differences between yield
and return
?
-
return
stops the function execution completely, whileyield
suspends function execution -
return
returns the entire output of a function, whileyield
returns a generator object -
yield
keeps track of the state of the function execution, whilereturn
does not
Explain what’s happening in the code below, assuming that is_palindrome(num)
returns True or False depending on whether num
is a palendrome:
def infinite_palindromes(): num = 0 while True: if is_palindrome(num): i = (yield num) if i is not None: num = i num += 1
and you execute the code below:
~~~
pal_gen = infinite_palindromes()
for j in pal_gen:
print(f”current palendrome: {i}”)
digits = len(str(j))
pal_gen.send(10 ** (digits))
~~~
-
pal_gen
is the generator returned by theinfinite_palindromes()
gen function. -
pal_gen
will find the next palindrome in the sequence, set variablei
to the yielded result within the gen function withi = (yield num)
, and then yield the result asj
in the for loop. -
digits
stores the number of digits in the current palindrome. -
10 ** digits
is then sent back topal_gen
using.send()
wherei
is now updated. - The gen function resumes execution, now with an updated
num
value.
current palendrome: 11 current palendrome: 111 current palendrome: 1111 current palendrome: 10101 current palendrome: 101101
If pal_gen
is a generator, what’s the difference between the following methods?
pal_gen.throw(ValueError("We don't like large palindromes!"))
and
~~~
pal_gen.close()
~~~
Both methods will cause the generator to finish, however .throw()
will tell pal_gen
to throw the given exception, while .close()
will always tell pal_gen
to throw StopIteration