Lists#
Questions:#
How can I store multiple values in a single variable?
Learning Objectives:#
Explain why programs need collections of values
Write programs that create lists, index them, slice them, and modify them through assignment and method calls
A list stores many values in a single structure#
We have seen how to store values by assigning them to variables
With data sets of any reasonable size, it would rapidly become inefficient and confusing to store each data point with a separate variable name
For example, here are average life expectancies for Canada over 100 years, from the Gapminder data set:
life_exp_1900 = 48.1
life_exp_1920 = 56.6
life_exp_1940 = 64.0
life_exp_1960 = 71.0
life_exp_1980 = 75.2
life_exp_2000 = 79.2
While you can store the data this way, it requires a lot of variable names. As well, although there is structure in the data (they reflect an ordered sequence of dates), this ordering is not reflected in how the data are stored.
We can use a list to store many values together
Lists are defined by one or more items contained within square brackets
[]
, with individual values (list items) separated by commas,
:
life_exp = [48.1, 56.6, 64.0, 71.0, 75.2, 79.2]
print('Life expectancies:', life_exp)
Life expectancies: [48.1, 56.6, 64.0, 71.0, 75.2, 79.2]
Like strings, we can use the len()
function to get the length of a list
print('length:', len(life_exp))
length: 6
Here, length refers to the number of items in the list
Lists may contain values of different types#
A single list may contain virtually any Python data type, including integers, floats, strings, and even other lists!
personal_info = ['First Name', 'Galena',
'Last Name', 'Jeppesen',
'Age', 41,
'Height', 1.58,
'Children', ['Wayne', 'Miguel']
]
Python allows us to input a list over multiple lines. Jupyter nicely indents the additional lines to help us see that they are continuations of the same list.
(Data generated by https://www.behindthename.com)
Indexing is similar for lists and strings#
We previously saw how we could get single characters from a character string, using indexes in square brackets:
element = 'carbon'
print('zeroth character:', element[0])
print('third character:', element[3])
zeroth character: c
third character: b
Use an item’s index to fetch it from a list#
Indexing works for lists as it does for strings:
print('zeroth item of life_exp:', life_exp[0])
print('fourth item of life_exp:', life_exp[4])
zeroth item of life_exp: 48.1
fourth item of life_exp: 75.2
Slicing is similar for lists and strings#
Recall that we can slice a string to get a range of values, e.g.,
print('First three items:', element[0:3])
We can do the same thing with lists:
print('First four items in life_exp:', life_exp[0:4])
print('Last item in life_exp:', life_exp[-1])
First four items in life_exp: [48.1, 56.6, 64.0, 71.0]
Last item in life_exp: 79.2
To get all the items in a list from a certain index onwards, use :
after the initial index#
# Print all of the values from the fourth item to the end.
print(life_exp[3:])
[71.0, 75.2, 79.2]
Similarly, use :
before the stopping index value to get all preciding elements#
# Print the first four items in life_exp
print("First four items in life_exp:", life_exp[:4])
First four items in life_exp: [48.1, 56.6, 64.0, 71.0]
Indexing can be used to replace a value in a list#
life_exp[0] = 48.3
print(life_exp)
[48.3, 56.6, 64.0, 71.0, 75.2, 79.2]
Indexing beyond the end of the collection is an error.#
Python reports an IndexError
if we attempt to access a value that doesn’t exist.
This is a kind of runtime error, as discussed in the section on built-ins
It’s not a syntax error, because it cannot be detected as the code is parsed before being run; this is because the index might be calculated based on data
print('99th element of life_exp is:', life_exp[99])
---------------------------------------------------------------------------
IndexError Traceback (most recent call last)
Cell In[10], line 1
----> 1 print('99th element of life_exp is:', life_exp[99])
IndexError: list index out of range
Appending items to a list lengthens it#
We can use the .append()
method to add an item to the end of a list
Here we will add the life expectancy for 2020 in Canada to extend our list:
print(life_exp)
life_exp.append(82.3)
print(life_exp)
Methods#
.append()
is a list methodMethods are like functions, but tied to a particular type of object. In other words, the fact that there is a list
.append()
method does not mean that we can use.append()
with other Python objects that aren’t lists.Use object_name.method_name() to call methods.
We will meet other methods of lists as we go along.
Use
help(list)
for a preview of the methods available for objects of typelist
help(list)
Many methods modify a data object in place#
Note in the above example of .append()
that we didn’t need an assignment statement (i.e., a variable name and the =
operator) to modify life_exp
. That is, we didn’t write:
life_exp = life_exp.append(82.3)
We wrote:
life_exp.append(82.3)
In fact, the list .append()
method modifies the list in place and returns None
, so if we tried:
life_exp = life_exp.append(82.3)
print(life_exp)
The resulting value of life_exp
would be:
None
which is definitely not what we want!
Extending a list#
the
.append()
method only allows you to add a single item to the end of a listsee what happens if you try to append a list to an existing list:
life_exp_2000s = [85.1, 87.3, 89.5, 91.6]
life_exp.append(life_exp_2000s)
print(life_exp)
The result is what we call a nested list, where the appended list is stored as a single list item — of type list
— in the original list. This is a consequence of the fact that lists can contain multiple types.
In this case, what we want is a flat list, a single list of floats.
The .extend()
method will do this. It is similar to append
, but it allows you to combine two lists. For example:
life_exp = [48.1, 56.6, 64.0, 71.0, 75.2, 79.2]
life_exp_2000s = [82.3, 85.1, 87.3, 89.5, 91.6]
life_exp.extend(life_exp_2000s)
print(life_exp)
Insert a new list entry at a specific position#
The result of help(list)
above includes a method .insert()
, which takes two arguments: an index (position) in the list that you want the element inserted before, and the value to insert. For example, to add the value 46.9 to the start of our life_exp
list, we would use:
life_exp.insert(0, 46.9)
print(life_exp)
Removing items from a list#
There are two list methods that you can use to remove an element from a list:
.remove()
will remove the first instance of a specific value you give it as an argument.pop()
will remove an element at a specific position (index) that you give as an argument
In the example below, we extend one list with another, but then we notice that the last value in the first list, is duplicated as the first value in the second list. As well, the last value in the second list is duplicated. So we need to remove one copy of each repeated value.
life_exp = [48.1, 56.6, 64.0, 71.0, 75.2, 79.2, 82.3]
life_exp_2000s = [82.3, 85.1, 87.3, 89.5, 91.6, 91.6]
life_exp.extend(life_exp_2000s)
print('With duplicated values:', life_exp)
We will remove the value that’s duplicated in the two lists, 82.3
, using .remove()
:
life_exp.remove(82.3)
print('Removed duplicate value:', life_exp)
We will remove the value that’s duplicated at the end using .pop()
:
life_exp.pop(-1)
print('Removed duplicate value:', life_exp)
An empty list contains no values#
We can assign []
to a variable name, to represent a list that doesn’t contain any values.
This can be a good starting point if we want to create a list by appending values as we calculate them
The example below is trivial, but in later lessons we will see better uses for empty lists
x = []
y = 1
x.append(y)
x.append(y + 7)
print(x)
Difference between strings and lists#
Both strings and lists are called collections in Python, so they can be indexed
Although indexing works similarly for strings and lists, the two data types are different in other ways
Importantly, strings are immutable — they cannot be changed after they have been created
Python considers the string to be a single value with parts, not a collection of values
In contrast, lists are mutable — as we saw above, they can be changed such as by:
using indexing to change a single list element
appending items to a list with
.append()
or.extend()
deleting items from a list with
del
So because we defined element
above as a string, we cannot change one letter in the string:
element[0] = 'C'
element[0] = 'C'
Likewise, you cannot .append()
to a string, nor can you del
a string element.
Indexing beyond the end of the collection is an error.#
Python reports an IndexError
if we attempt to access a value that doesn’t exist.
This is a kind of runtime error, as discussed in the section on built-ins
It’s not a syntax error, because it cannot be detected as the code is parsed before being run; this is because the index might be calculated based on data
print('99th element of life_exp is:', life_exp[99])
Exercises#
Fill in the Blanks#
Fill in the blanks so that the program below produces the output shown.
values = ____
values.____(1)
values.____(3)
values.____(5)
print('first time:', values)
values = values[____]
print('second time:', values)
Output:
first time: [1, 3, 5]
second time: [3, 5]
How Large is a Slice?#
Imagine we have a list like life_exp
above. If a
and b
are both non-negative integers (e.g., 2
and 5
), how long is the list life_exp[a:b]
?
Click to show the answer.
Solution
Indexing works similarly for lists as we saw previously for strings: it returns from the first item specified in the range up to, but not including, the last item in the range. For example, `values[2:5]` has the 3 elements `life_exp[2]`, `life_exp[3]`, and `life_exp[4]`. Put another way, the list `life_exp[a:b]` has `b - a` elements.Note that the expression will only work if b
is less than the total length of the list life_exp
.
From Strings to Lists and Back#
What does the list()
function do when applied to a string?
For example:
list('tin')
list('tin')
And what does the .join()
method do?
E.g.:
''.join(['g', 'o', 'l', 'd'])
''.join(['g', 'o', 'l', 'd'])
What happens if you put a character inside the quote marks for .join()
, like this?
'-'.join(['g', 'o', 'l', 'd'])
'-'.join(['g', 'o', 'l', 'd'])
Click the plus to show the answer
Solution
list('some string')
converts a string into a list containing all of its characters.join
returns a string that is the concatenation of each string element in the list and adds the separator between each element in the list. This results inx-y-z
. The separator between the elements is the string that provides this method.
Indexing from the end#
What does the following program print?
life_exp = [48.1, 56.6, 64.0, 71.0, 75.2, 79.2, 82.3]
print(life_exp[-1])
How does Python interpret a negative index?
If a list or string has N elements, what is the most negative index that can safely be used with it, and what location does that index represent?
What does
del life_exp[-1]
do?How can you display all elements but the last one in a list (or string)? (Hint: you will need to combine slicing and negative indexing.)
Click the plus to show the answer
The program prints 82.3
, which is the last element in the list.
Python interprets a negative index as starting from the end (as opposed to starting from the beginning). The last element is
-1
.The last index that can safely be used with a list of N elements is element
-N
, which represents the first element.del life_exp[-1]
removes the last element from the list.life_exp[:-1]
If we write a slice as
start:stop:step
, what doesstep
do?
Click the plus to show the answer
step
is the step size of the slice
Stepping Through a List#
What does the following program print?
life_exp = [48.1, 56.6, 64.0, 71.0, 75.2, 79.2, 82.3]
print(life_exp[::2])
print(life_exp[::-1])
If we write a slice as
start:stop:step
, what doesstep
do?What expression would select all of the even-numbered items from a collection?
Click the plus to show the answer
step
is the step size of the sliceThe slice
1::2
selects all even-numbered items from a collection: it starts with element1
(which is the second element, since indexing starts at0
), goes on until the end (since noend
is given), and uses a step size of2
(i.e., selects every second element).
Slice Bounds#
What does the following program print?
life_exp = [48.1, 56.6, 64.0, 71.0, 75.2, 79.2, 82.3]
print(life_exp[0:20])
print(life_exp[-1:3])
Even though we saw above that we get an error if we try to index a single element that is outside the range of items in the list (e.g., print(life_exp[20])
would fail), we can use a value outside of the elements in a list when specifying a range.
What do you predict will be printed to the screen if this program is executed?
# Program A
letters = list('gold')
letters_s = sorted(letters)
print('letters is', letters, 'and letters_s is', letters_s)
# Program B
letters = list('gold')
letters_s = letters.sort()
print('letters is', letters, 'and letters_s is', letters_s)
Click the plus to show the answer
Solution
This example highlights the importance of understanding how individual functions and methods work.
Program A prints
letters is ['g', 'o', 'l', 'd'] and letters_s is ['d', 'g', 'l', 'o']
Program B prints
letters is ['d', 'g', 'l', 'o'] and letters_s is None
sorted()
is a function that returns a sorted copy of the input list letters
; the original
list letters
remains unchanged. In contrast, .sort()
is a method that sorts the list
letters
in-place (it modifies its input) and does not return anything (hence None
).
Copying (or Not)#
What do these two programs print?
In simple terms, explain the difference between x = y
and x = y[:]
.
# Program A
y = list('gold')
x = y
x[0] = 'D'
print('x is', x, 'and y is', y)
# Program B
y = list('gold')
x = y[:]
x[0] = 'D'
print('x is', x, 'and y is', y)
Click the plus to show the answer
Solution
Program A prints
x is ['D', 'o', 'l', 'd'] and y is ['D', 'o', 'l', 'd']
Program B prints
x is ['D', 'o', 'l', 'd'] and y is ['g', 'o', 'l', 'd']
x = y
makes x
a reference to the list y
in memory; the variables x
and y
both point towards the same object. So when we change x
, we also change y
.
In contrast, x = y[:]
creates a new list object x
containing all elements from the list y
; x
and y
are different objects, and so changing one has no effect on the other.
The fact that some operations create copies while others make new pointers to existing objects, is an important lesson to learn in Python. Different functions and methods work in different ways; the important thing for now is to be aware that both results could happen.
Although using x = y[:]
works to create a copy of a list, it’s not the clearest, most explicit, or “pythonic” way to copy a list. A better approach is to use the list method .copy()
. This is preferable because it’s easier to understand what the code is doing:
y = list('gold')
x = y.copy()
x[0] = 'D'
print('x is', x, 'and y is', y)
Prints:
x is ['D', 'o', 'l', 'd'] and y is ['g', 'o', 'l', 'd']
Summary of Key Points:#
A list stores many values in a single structure
Lists may contain values of different types
The empty list contains no values
Use an item’s index to fetch it from a list
Lists’ values can be replaced by assigning to them
Appending items to a list lengthens it
if
.append()
is used to add a collection of items (such as a list), they are added as a single new list itemThe list method
.extend()
can be used with a collection of items (such as another list) to create a flat listThe list methods
.remove()
and.pop()
can be used to delete list items, using either their value or index, respectivelyCharacter strings can be indexed like lists
Character strings are immutable, whereas lists are mutable (changable)
Indexing beyond the end of the collection is an error, but slicing can include indexes that do not occur in the list
This section was adapted from Aaron J. Newman’s Data Science for Psychology and Neuroscience - in Python and Software Carpentry’s Plotting and Programming in Python workshop.