print("Hello World")
Hello World
Python is a general purpose programming language that allows for both simple and complex data analysis. Python is incredibly versatile, allowing analysts, consultants, engineers, and managers to obtain and analyze information for insightful decision-making.
The Jupyter Notebook is an open-source web application that allows for Python code development. Jupyter further allows for inline plotting and provides useful mechanisms for viewing data that make it an excellent resource for a variety of projects and disciplines.
The following section will outline how to install and begin working with Python and Juypter.
Instruction guides for Windows and MacOS are included below. Follow the one that corresponds with your operating system.
Open your browser and go here
Click on your OS and then “Download”
Run the downloaded file found in the downloads section from Step 2
Click through the install prompts
Go to menu (or Windows Explorer), find the Anaconda3 folder, and double-click to run OR Use a Spotlight Search and type “Navigator”, select and run the Anaconda Navigator program. Note that MacOS also comes with Python pre-installed, but you should not use this version, which is used by the system. Anaconda will run a second installation of Python and you should ensure that you only use this second installation for all tasks.
VSCode is a very popular tool for programming in a variety of languages. In can be used in place of Jupyter. It can be downloaded from here. After installation, follow this guide to set up Jupyter in VSCode. You will also need the Python extension and the Jupyter extension.
It is common practice to have a main folder where all projects will be located (e.g. “jupyter_research”). The following are guidelines you can use for Python projects to help keep your code organized and accessible:
You should now be set up and ready to begin coding in Python!
In this case, we will introduce you to more basic Python concepts. If you prefer, you can first go through the more extensive official Python tutorial and/or the W3School Python tutorial and practice more fundamental concepts before proceeding with this case. It is highly recommended that you go through either or both of these tutorials either before or after going through this notebook to solidify and augment your understanding of Python. For a textbook introduction to Python, see this text, from which some of the following material is adapted/taken.
Every program you’ve ever used, no matter how complicated, is made up of instructions that look pretty much like these. So you can think of programming as the process of breaking a large, complex task into smaller and smaller subtasks until the subtasks are simple enough to be performed with one of these basic instructions.
Note that writing a program involves typing out the correct code, and then running, or executing that code. For example, the print()
function allows you to print an object. Placing a python object in the round brackets and running the code makes the computer print the object.
print("Hello World")
Hello World
After printing, the simplest use of Python is as a calculator. We can use the operators +, -, /, and * perform addition, subtraction, division and multiplication. We can use ** for exponentiation. See the following examples:
print(40 + 2)
print(43 - 1)
print(6 * 7)
print(2**2)
42
42
42
4
Note that a Python program contains one or more lines of code, which are executed in a top-down fashion.
Use type()
to determine the type of a value. Example:
print(type(2))
print(type(42.0))
print(type('Hello, World!'))
<class 'int'>
<class 'float'>
<class 'str'>
If you are used to using Microsoft Excel, this is similar to how Excel distinguishes between data types such as Text, Number, Currency, Date, or Scientific. As noted above, some common data types that you will come across in Python are:
int
: 1
float
: 25.5
str
: 'Hello'
There are other data types available in Python; however, these are the three fundamental types that you will see across almost every Python program. Always keep in mind that every value, or object, in Python has a type and some of these “types” can be custom-defined by the user, which is one of the benefits of Python.
We can assign names to objects in python so that they are easy to manipulate, a named object is called a variable. Use the = sign to assign a name to a variable.
my_int
, this can be accomplished by writing the Python statement, my_int = 5
.my_int
is a variable, much like you might find in algebra, but the =
sign is for assignment not equality .my_int = 5
should be taken to mean something more like Let my_int be equal to 5
rather than my_int is equal to 5
.=5
my_intprint(my_int)
5
Here, my_int
is an example of a variable because it can change value (we could later assign a different value to it) and it is of type Integer, known in Python simply as int
. Unlike some other programming languages, Python guesses the type of the variable from the value that we assign to it, so we do not need to specify the type of the variable explicitly. For example,
int
: my_int = 1
float
: my_float = 25.5
str
: my_string = 'Hello'
Note that the names my_int
, my_float
and my_string
are arbitrary here. While it is useful to name your variables so that the names contain some information about what they are used for, this code would be functionally identical to if we had used x
, xrtqp2
and my_very_long_variable_name
, respectively.
=5
my_intprint(my_int)
="Canada"
countryprint(country)
5
Canada
We mentioned before that variables can change value. Let’s take a look at how this works, and also introduce a few more Python operations:
= 4
x print(x)
= 2
y = y + x
x print(x)
4
6
Again, if you’re used to syntax from mathematics “x = y + x” might look very wrong at first glance. However, it simply means “throw out the existing value of x, and assign a new value which is calculated as the sum of y and x”. Here, we can see that the value of x changes, demonstrating why we call them “variables”.
We can also use more operators than just +
and -
. We use *
for multiplication, /
for division, and **
for power. The standard order of operations rules from mathematics also applies and parentheses ()
can be used to override these.
A data structure is a data/value type which organizes and stores multiple values. These are more complicated data types that comprise many single pieces of data, organized in a specific way. Examples include dictionaries, arrays, lists, stacks, queues, trees, and graphs. Each type of data structure is designed to handle specific scenarios.
=
, the assignment operator, to assign a value to the variable.Syntax for creation:
= {'Key1': Value1, 'Key2': Value2, 'Key3': Value3} user_dictionary
Notes:
In Python, the dictionary type has built-in methods to access the dictionary keys and values.
.keys()
or .values()
after the dictionary object..keys()
and .values()
to a list by using the list()
method. Below when we print the unconverted keys, the first thing you see is dict_keys
, indicating the type of the data. Convert it to a list which is a simpler and more common data type. We can do this by passing the data into the list(...)
function.Example:
# Creating a dictionary
= {
person "name": "Alice",
"age": 30,
"city": "New York"
}
# Accessing values
print(person["name"]) # Output: Alice
print(person["age"]) # Output: 30
print(person["city"]) # Output: New York
# Adding a new key-value pair
"job"] = "Engineer"
person[
# Updating an existing value
"age"] = 31
person[
# Deleting a key-value pair
del person["city"]
# Printing the updated dictionary
print(person)
# Getting all keys
= person.keys()
keys
# Converting to list
=list(keys)
keys_listprint(keys) # Output: dict_keys(['name', 'age', 'job'])
print(keys_list) # Output: type list
# Getting all values
= person.values()
values print(values) # Output: dict_values(['Alice', 31, 'Engineer'])
# Converting to list
=list(values)
values_listprint(values_list) # Output: type list
print(type(values_list))
print(type(list(values)))
# Creating a nested dictionary
= {
person "name": "Alice",
"age": 30,
"address": {
"city": "New York",
"zipcode": "10001"
}
}
# Accessing elements in a nested dictionary
print(person["address"]["city"]) # Output: New York
print(person["address"]["zipcode"]) # Output: 10001
Alice
30
New York
{'name': 'Alice', 'age': 31, 'job': 'Engineer'}
dict_keys(['name', 'age', 'job'])
['name', 'age', 'job']
dict_values(['Alice', 31, 'Engineer'])
['Alice', 31, 'Engineer']
<class 'list'>
<class 'list'>
New York
10001
A list is an incredibly useful data structure in Python that can store any number of Python objects. Lists are denoted by the use of square brackets []
:
Syntax:
= [Value1, Value2, Value3] user_list
Example:
# Creating a list
= ["apple", "banana", "cherry", "date"]
fruits
# Adding a new element
"orange")
fruits.append(
# Updating an existing element
1] = "blueberry"
fruits[
# Deleting an element
del fruits[2]
# Printing the updated list
print(fruits)
# Getting the length
print(len(fruits))
= ["apple", "banana", "cherry", "date"]
fruits
# Accessing elements
print(fruits[0]) # Output: apple
print(fruits[1]) # Output: banana
print(fruits[2]) # Output: cherry
# Accessing elements by negative index
print(fruits[-1]) # Output: date
print(fruits[-2]) # Output: cherry
print(fruits[-3]) # Output: banana
print(fruits[-4]) # Output: apple
# Accessing a range of elements
print(fruits[1:3]) # Output: ['banana', 'cherry']
print(fruits[:2]) # Output: ['apple', 'banana']
print(fruits[2:]) # Output: ['cherry', 'date']
print(fruits[:]) # Output: ['apple', 'banana', 'cherry', 'date']
# Creating a nested list
= [["apple", "banana"], ["cherry", "date"]]
nested_list
# Accessing elements in a nested list
print(nested_list[0][0]) # Output: apple
print(nested_list[0][1]) # Output: banana
print(nested_list[1][0]) # Output: cherry
print(nested_list[1][1]) # Output: date
['apple', 'blueberry', 'date', 'orange']
4
apple
banana
cherry
date
cherry
banana
apple
['banana', 'cherry']
['apple', 'banana']
['cherry', 'date']
['apple', 'banana', 'cherry', 'date']
apple
banana
cherry
date
List computations example:
# Creating a list of numbers
= [3.5, 1.2, 6.8, 2.4, 5.1]
numbers
# Finding the minimum value
= min(numbers)
min_value print("Minimum value:", min_value) # Output: Minimum value: 1.2
# Finding the maximum value
= max(numbers)
max_value print("Maximum value:", max_value) # Output: Maximum value: 6.8
# Summing all elements in the list
= sum(numbers)
total_sum print("Sum of all elements:", total_sum) # Output: Sum of all elements: 19.0
# A more complicated example - this is a list comprehension - more on this later.
# Rounding each element to the nearest integer
= [round(num) for num in numbers]
rounded_numbers print("Rounded numbers:", rounded_numbers) # Output: Rounded numbers: [4, 1, 7, 2, 5]
Minimum value: 1.2
Maximum value: 6.8
Sum of all elements: 19.0
Rounded numbers: [4, 1, 7, 2, 5]
in
operatorThe in
operator in Python is used to check for the presence of an element within a collection, such as a list, tuple, set, or dictionary. (Tuples and sets are other data structures we have not learnt about yet.) When used with lists or other sequences, it checks if a specific value is contained in the sequence and returns True
if it is, and False
otherwise. When used with dictionaries, the in
operator checks for the presence of a specified key. If the key exists in the dictionary, it returns True
; otherwise, it returns False
. This operator provides a simple and readable way to perform membership tests in various data structures.
Example:
# Creating a dictionary
= {
person "name": "Alice",
"age": 30,
"city": "New York"
}
# Using the 'in' operator to check for a key
print("name" in person) # Output: True
print("job" in person) # Output: False
# Using the 'in' operator to check for a value
print("Alice" in person.values()) # Output: True
print("Engineer" in person.values()) # Output: False
# Creating a list
= ["apple", "banana", "cherry"]
fruits
# Using the 'in' operator to check for an element
print("banana" in fruits) # Output: True
print("orange" in fruits) # Output: False
True
False
True
False
True
False
One control flow element in Python is the for loop.
for
loop allows one to execute the same statements over and over again (i.e. looping).Syntax:
for iterator_variable in some_sequence:
statements(s)
The for
loop iterates over some_sequence
and performs statements(s)
at each iteration.
iterator_variable
is updated to the next value in some_sequence
.Example:
for i in [1,2,3,4]:
print(i*i)
1
4
9
16
for
loop will print to the screen four times; that is it will print 1
on the first iteration of the loop, 4
on the second iteration, 9
on the third, and 16
on the fourth.for
loop statement will iterate over all the elements of the list [1,2,3,4]
, and at each iteration it updates the iterator variable i
to the next value in the list [1,2,3,4]
.for
loops, it is an extremely good habit to choose an iterator variable that provides context rather than a random letter.i
whenever possible for ease of communication.A list comprehension is a concise way to create a new list by applying an expression to each element of an existing list while optionally filtering elements based on a condition. It combines loops and conditional statements into a single line of code, making it efficient and readable for creating lists with specific transformations or filters. It can be used to replace a for loop with shorter code.
Example:
# Using a for loop
= [1, 2, 3, 4, 5]
numbers = []
squared_numbers for num in numbers:
** 2)
squared_numbers.append(num print("Squared numbers (using for loop):", squared_numbers)
# Using list comprehension
= [num ** 2 for num in numbers]
squared_numbers print("Squared numbers (using list comprehension):", squared_numbers)
# Using a for loop
= [1, 2, 3, 4, 5, 6, 7, 8, 9, 10]
numbers = []
even_numbers for num in numbers:
if num % 2 == 0:
even_numbers.append(num)print("Even numbers (using for loop):", even_numbers)
# Using list comprehension
= [num for num in numbers if num % 2 == 0]
even_numbers print("Even numbers (using list comprehension):", even_numbers)
Squared numbers (using for loop): [1, 4, 9, 16, 25]
Squared numbers (using list comprehension): [1, 4, 9, 16, 25]
Even numbers (using for loop): [2, 4, 6, 8, 10]
Even numbers (using list comprehension): [2, 4, 6, 8, 10]
A boolean is a data type that represents one of two possible states: True
or False
. Booleans are crucial for controlling the flow of programs, making decisions, and executing code based on specific conditions being met or not. Booleans are used extensively for making decisions and comparisons. They are often the result of logical operations, such as comparisons (e.g., greater than, less than) or boolean operations (e.g., and, or, not).
As stated above, a boolean object can take on two values:
If statements are conditional statements that allow you to execute certain blocks of code based on specified conditions. They form the foundation of decision-making in code, enabling programs to make choices and take different actions depending on whether certain conditions are True or False.
Syntax:
if test_expression_1:
block1_statement(s)elif test_expression_2:
block2_statement2(s)else:
block3_statement(s)
Example:
# Example 0: A boolean
=2
x=7
yprint(True)
print(x==y)
print(x > 5)
# Example 1: Simple if statement
= 10
x
if x > 5:
print("x is greater than 5")
# Example 2: if-else statement
= 3
y
if y % 2 == 0:
print("y is even")
else:
print("y is odd")
# Example 3: if-elif-else statement
= 75
grade
if grade >= 90:
print("Grade is A")
elif grade >= 80:
print("Grade is B")
elif grade >= 70:
print("Grade is C")
elif grade >= 60:
print("Grade is D")
else:
print("Grade is F")
True
False
False
x is greater than 5
y is odd
Grade is C
Explanation:
Example 0: Creates different types of boolean variables.
Example 1: Checks if x
is greater than 5. If true, it prints “x is greater than 5”.
Example 2: Checks if y
is even (remainder of division by 2 is zero). If true, it prints “y is even”; otherwise, it prints “y is odd”.
Example 3: Evaluates the value of grade and prints a corresponding grade based on the ranges specified using if, elif (else if), and else statements. This demonstrates chaining conditions to determine a grade based on numerical thresholds.
String formatting refers to the various techniques used to insert dynamic values into strings in a controlled and formatted manner. We cover .format
and f-strings
.
Syntax:
f"some text {expression1} more text {expression2} ..."
Syntax:
"some text {} more text {}".format(value1, value2)
Differences: - f-strings are more concise and readable. - .format() method offers more flexibility, such as specifying formatting options or reusing values.
Examples:
# Example using .format() method
= "Bob"
name = 25
age = "Hello, {}! You are {} years old.".format(name, age)
formatted_string print(formatted_string)
# Example using f-strings
= "Charlie"
name = 35
age = f"Hello, {name}! You are {age} years old."
formatted_string print(formatted_string)
Hello, Bob! You are 25 years old.
Hello, Charlie! You are 35 years old.
1.8 Comments and debugging
Comments are lines of code which begin with the
#
symbol. Nothing happens when you run these lines. Their purpose is to describe the code you have written, especially if it would be unclear to someone else reading it. You should be commenting your code as you go along. For example “these lines compute the average” or “These lines remove missing data” etc.We have seen that programs may have errors or bugs. The process of resolving bugs is known as debugging. Debugging can be a very frustrating activity. Please be prepared for this. Following, this Python textbook, there are three kinds of errors you may encouter:
print
feature.Debugging means to fix errors in your program or code. Debugging is a constant and necessary part of writing code, do not be surprised if you are spending most of your time debugging. In order to debug syntax and runtime errors, the first step is to read the error message.
For example if you run the following:
You will get the following error message:
You can see that there is an error pointing to the second line
fruits[3]
. That is the line where the program encountered an error. The error it encountered is displayed at the bottom. “IndexError: list index out of range.”We can use the error for clues as to what could be going wrong. In this case, out of range means the index is too large. Inspecting the code further, you can see that Python indexing starts at 0, and so to access the third element of the list, we use the index 2.
If you do not understand the error message, you should Google search the error message to find out what it means. You can also use tools such as StackOverflow threads and ChatGPT to help you debug your code. When you do not know what is causing the error, it can be helpful to go through the code line by line, printing the output of each variable as you go, to see where the program is going wrong.