Machine Learning¶

1. Introduction to Machine Learning¶

Prof. Iacopo Masi

Introduction and administrative stuff¶

👨🏼‍🏫 About Me¶

Associate Professor with Sapienza since late 2020
Adjunct Research Assistant Professor with University of Southern California (USC), Los Angeles till August 2022
Worked as Research Scientist on big DARPA projects (Dept. of Defense) of USA.

My Background:
- Computer Vision
- Machine Learning

⏰ Course Schedule¶

- Tuesday, 1pm - 3pm (2 hours)¶

- Thursday, 1pm - 4pm (3 hours)¶

From February 27 till the end of May 30 (one week break for Easter vacation)¶

📽 Lecture Modality¶

Lectures only in presence
No Recordings
Content:
- Theoretical Sessions (yes, you have to know the math behind!)
- Embedded with practicals (even how to make it computable!)
- With some cool applications (have fun!)

📽 Lecture Modality¶

Where: Aula 3 De Lollis
Forum: We will use this Google Classroom

📝 Course Material & Interaction¶

Google Classroom (Very Important):¶

Material uploaded before every lecture (if time permits)
Use Google Classroom for most and private communication with course staff
Ask questions about logistics, homework, etc.
Very important: write down now!

Code to enter classroom:
nr6h4g7
¶

classroom.google.com/c/MjEzMzg1MTcyMjda?cjc=nr6h4g7

Code to enter classroom:¶

nr6h4g7
¶

classroom.google.com/c/MjEzMzg1MTcyMjda?cjc=nr6h4g7 ¶

📝 Course Material & Interaction¶

Google Classroom (Very Important):¶

Material uploaded before every lecture (if time permits)
Use Google Classroom for most and private communication with course staff
Ask questions about logistics, homework, etc.

📝 Course Material & Interaction¶

Github Website (Public for everyone)
Our private classroom I will mainly use it to send you notifications

📖 Course Material & Textbook¶

Slides and material will be uploaded before every lecture on Google Classroom.
- Good starting point but but may be not enough.
- Textbooks are required.

Topic	Authors	Book	Difficulty
Generic ML	H. Daumé III	"A Course in Machine Learning", download the book	Easy
Generic ML	Christopher M. Bishop	“Pattern Recognition and Machine Learning” download the book	Difficult

The course is inspired and follows CS229 by Stanford while other material is inspired from other courses

📚 Textbooks¶

There is not a single textbook but suggested are:

Topic	Authors	Book
Generic ML	H. Daumé III	"A Course in Machine Learning", download the book
Generic ML	Christopher M. Bishop	“Pattern Recognition and Machine Learning” download the book
Generic ML	Kevin P. Murphy	“Probabilistic Machine Learning: An introduction", MIT Press, 2021
Deep Learning	Ian Goodfellow and Yoshua Bengio and Aaron Courville	“Deep Learning”, MIT Press 2016
Deep Learning	Ston Zhang, Zack C. Lipton, Mu Li, Alex J. Smola	“Dive into Deep Learning”
Deep Learning	Simone Scardapane (uniroma1!)	Alice's Adventures in a Differentiable Wonderland

You can find online most of these or part of them.

Yet Another Text Book¶

Authors keep the PDF freely available Check the license if you can print it though!

Scardapane's Book¶

📚 How to study¶

Use my slides! Most of the questions/answers in the exam will be coming out from my slides or a remix of them.
If you do not understand the slide, search for a matching chapter in one of the books I mentioned.
Watch again and again the lecture in the part that is not clear.

🙏🏼 Credits¶

Credits: This program and material was inspired by the following courses:

💰 Exam (your payback)
Warning - Changed from last year!
¶

Written exam (Quiz yet to simulate open questions)
- Grade range $\in [0,\ldots,17]$
- 17 points = 15 points + 2 bonus points.
The rest of the 17 points will come from the exam of Unit I
Final grade can arrive up to 34
We cannot register the grade of a single Unit (AI&ML is an exam as a whole, the final score is inseparable).
Note: a Unit is passed if score >= 8.5 (18/30)

💰 Cum Laude¶

You attain 30+L if:

AND(round(Unit1) >= 15,round(Unit2) >= 15,
    round(Unit1+Unit2 grade) >= 32)

round means 0.3 is 0 and 0.9 is 1 (the threshold is at 0.5)

So for example if you attain 14.5 at Unit 1, you have to score 17/17 to Unit 2 hence you get 14.5+17 = 31.5 --> round(31.5)=32

Note: a Unit is passed if score >= 8.5 (18/30)

💰 Exam (your payback)¶

Sum of the grade of Unit I with the grade of Unit II **Advise:** ML is widespread now. **Do not study this course just to pass the exam.** **Find internal motivation to do it.** *Establish me as a scientist in AI, help neural scientists to understand how the brain works using AI.*

💰 Exam: Caveat - [especially for Erasmus students]¶

- Sum of the grade of Unit I with the grade of Unit II - We **CANNOT** record on Infostud just a single Unit!

🎯Course Objectives¶

Introducing you to the basic principles of Machine Learning
Knowledge on the main learning modalities (supervised, unsupervised, parametric/non parametric)
Knowledge on the main ML algorithm strengths and weaknesses (no free lunch theorem)
Develop awareness of the mathematical tools behind.
Setting strong foundations for more advanced courses (i.e. Deep Learning)
Develop critical thinking/raise next generation of scientists
Show a few cool, practical applications

Good to know¶

No mandatory requirements but math tools that come in handy

Linear algebra: vector/matrix manipulations (geometry in high dimensions)
Calculus: partial derivatives (cost function, gradients)
Probability: common distributions; Bayes Rule (learn how NOT thinking deterministic)
Statistics: mean/median/mode; maximum likelihood

We will review these in the first lectures¶

👩🏾‍💻Technology is power (toolsets to use)¶

Toolsets:¶

Python (widely used in ML)
NumPy (matrix manipulation and linear algebra) I will cover the basics in the course
Scikit learn (basic ML) We will try to avoid this and use our code as much as possible
PyTorch (automatic differentiation and neural nets) Basic Concepts

You may be covering this in AI Lab class so I will not go much in details.

👩🏾‍💻Technology is power (toolset to use)¶

Install a Python 3.8 environment 🐍 with:¶

python 3.8+
numpy
scikit learn
matplotlib

ℹ️ Provisional Course Agenda at a glance¶

Topic	Hours
Intro to ML, Correlation and Learning Paradigms	6
Linear Algebra Foundations
Geometry of Linear Maps, Numpy Tensors, PCA	9
Dimensionality Reduction & Generative Models
PCA, SVD, Linear Generative Modeling, 3DMM	9
Probabilistic Modeling & Regression
MLE, Gaussian Mixture Models, Linear Regression	15
📝 Self-Assessment (1st part) 🤓	3
Easter Vacation
📝 Mid-Term (post vacation)	3
Regularization & Optimization
L2 Regularization, Logistic Regression, Gradient Descent	9
Neural Networks & AutoDiff
MLP, AutoDiff (PyTorch/JAX), Signed Distance Functions	9
Implicit Representations & Diffusion Models
NeRFs, Flow, Score Matching, Diffusion	6
Exam Preparation and Final
Ask Me Anything + 📝 Final-Term	6
Total	66

Why using Machine Learning?¶

Everyone is using it now.... (Impact in applications)...¶

...but this is not a good answer.¶

We will get back to the answer later

Rise of AI¶

AI Job Landscape¶

AI Job Landscape - An example¶

An AI&ML student from previous year, given that they studied hard AI&ML (along with Unit I and AI Lab), was able to secure an internship with Hewlett-Packard Enterprise (HPE). The student told me that:

they were selected among 40 candidates (1:40)
they were preferred to graduated students (students obtained the master)
they will work with international team

Quick History¶

AI in Science Fiction¶

Turing Test¶

The imitation game (based on language):

The interrogator (C) is unable to see players (A, B) and can communicate with them only through written notes
The interrogator tries to determine which player is a computer and which is a human

Let's do a VISUAL Turing test¶

Who believes this image is real?

Let's do a VISUAL Turing test¶

Who believes this image is real?

Let's do a VISUAL Turing test¶

Who believes this image is real?

What is AI (Informal)¶

J. McCarthy, who coined the term in 1956, defines AI as

the science and engineering of making intelligent machines
A modern definition of AI:

"The ability of a digital computer or computer-controlled robot to perform tasks commonly associated with intelligent beings"

What is ML (Informal)¶

First definition in 1959 by Arthur Lee Samuel:

ML is the field of study that gives computers the ability to learn without being explicitly programmed.

Common definition (by Tom Mitchell):

ML is the study of computer algorithms that improve automatically through experience.

AI vs Machine Learning vs Deep Learning¶

Deep Learning $\subset$ Machine Learning $\subset$ AI

AI and beyond¶

Computer Vision, Robotics, NLP are all, in some sense, applications of AI to a domain.
vision = let machine see the world

Machine Learning¶

2. Correlation and Learning Paradigm¶

Iacopo Masi

Code to enter classroom:¶

nr6h4g7
¶

classroom.google.com/c/MjEzMzg1MTcyMjda?cjc=nr6h4g7 ¶

Yes, but why using it?¶

To solve problems, but what kind of problems?¶

There are two types of problems.

1) Problems solvable using algorithms developed by humans with a set of rules:

As computer scientists (or mathematicians) we design an algorithm and write a program that encodes a set of rules that is useful to solve the problem

Algorithmic approach¶

An Example: Self Driving Cars¶

A self-driving car system uses dozens of components that include detection of cars, pedestrians, and other objects.

^{Credits Cornell CS5785}

Self Driving Cars: A Rule-Based Algorithm¶

One way to build a detection system is to write down rules.

^{Credits Cornell CS5785}

pseudocode example for a rule-based classification system¶

object = camera.get_object()
if object.has_wheels(): # does the object have wheels?
    if len(object.wheels) == 4: return "Car" # four wheels => car    
    elif len(object.wheels) == 2:,
        if object.seen_from_back():
            return "Car" # viewed from back, car has 2 wheels
        else:
            return "Bicycle" # normally, 2 wheels => bicycle
return "Unknown" # no wheels? we don't know what it is

^{Credits Cornell CS5785}

In practice, it's almost impossible for a human to specify all the edge cases.

Self Driving Cars: An ML Approach¶

The machine learning approach is to teach a computer how to do detection by showing it many examples of different objects.

No manual programming is needed: the computer learns what defines a pedestrian or a car on its own!

^{Credits Cornell CS5785}

2) Problems that are very hard to solve with a set of rules

As ML engineer/data scientist/research scientist we design and optimize a model that learns patterns and extract "rules" from data that are useful to solve the problem

Big difference is: instead of writing the algorithm, we write the optimization for the hypothesis.

ML approach¶

Machine Learning¶

To apply supervised learning, we define a dataset and a learning algorithm.

$$ \underbrace{\text{Dataset}}_\text{Features, Attributes, Targets} + \underbrace{\text{Learning Algorithm}}_\text{Model Class + Objective + Optimizer } \to \text{Predictive Model} $$

The output is a predictive model that maps inputs to targets. For instance, it can predict targets on new inputs.

Why not to use a traditional algorithmic approach?¶

Impossibility to exactly formalize the problem (and so to give an algorithmic solution)
Presence of noise, uncertainty, too many variations in the data
High complexity in formulating a solution, i.e. it cannot be done manually
Lack of compiled knowledge with respect to the problem to be solved

Example: Write a program that recognizes faces (face recognition) over a closed-set of identity¶

Very hard to exactly formalize the problem
Noise may be present and data may be ambiguous
Algorithmic approach: Store predefined templates of faces as images with those closed set identities. Take all the pixel at position (x,y) and if then else then...
ML approach: Learn a function that maps input images to an identity using prior data. We will soon see that learning $\approx$ optimizing.

Example: Face Recognition. Humans can do it, why hard for machines?¶

No one trained humans (maybe "God"/evolution/X did...)
Can you recognize this face?
- ...but let's do it like the computer does it

Example: Face Recognition. Humans can do it, why hard for machines?¶

No one trained humans (maybe "God"/evolution/X did...)
Can you recognize this face?
- ...but let's do it like the computer does it
- right, I forgot to zoom in

Example: Face Recognition. Humans can do it, why hard for machines?¶

No one trained humans (maybe "God"/evolution/X did...)
Can you recognize this face?
- ...but let's do it like the computer does it

ML is widespread¶

You probably use ML dozens of times a day without even knowing it:

[Information Retrieval] A web search on Google works well because a software based on ML has figured out how to rank pages
[Spam Filter/Classifier] Each time you check your e-mail a spam filter has learned how to distinguish spam from not-spam e-mails
[Face Recognition] When Facebook or Apple's photo application recognizes your friends in your pictures, that's also because of ML ## and useful in many tasks

Image/Text Retrieval¶

Recommendation Systems¶

Classification/Recognition¶

Is this a dog?¶

What about this?¶

Applications¶

Classification: Determine which discrete category the example belongs to
Recognizing patterns: Speech Recognition, Facial identity, etc
Recommender Systems: Noisy data, commercial pay-off (e.g., Amazon, Netflix).
Information retrieval: Find documents or images with similar content
Computer vision: detection, segmentation, depth estimation, optical flow,
Robotics: perception, planning, Autonomous Driving (Tesla)
Learning to play games: AlphaGO, IBM DeepBlue
Recognizing anomalies: Unusual sequences of credit card transactions, the panic situation at an airport

Limits of Machine Learning¶

Causality vs Correlation
Noise in the data or in the labels
Datasets could have historical bias
In some cases, ML = blackbox that cannot explain why a prediction was made

Correlation¶

Graphics from Wikipedia

Graphics from [this link](https://wtmaths.com/correlation.html)

Measuring Correlation¶

X1	X2
0.1	45
0.1	65
0.2	28
0.3	76
0.5	55
0.6	48
0.9	64
1.1	41
1.5	30
1.8	52
1.8	75
1.9	35
2.1	42
2.2	65
3.0	30
3.6	71

Pearson Correlation Coefficient¶

$ \rho_{X,Y}= \frac{\operatorname{cov}(X,Y)}{\sigma_X \sigma_Y}$

where:

$ \operatorname{cov} $ is the covariance of the two series
$ \sigma_X $ is the standard deviation of $X $
$ \sigma_Y $ is the standard deviation of $ Y $

Covariance of two series¶

The formula for $\rho$ can be expressed in terms of mean and expectation.

$$\operatorname{cov}(X,Y) = \mathbb{E}[(X-\mu_X)(Y-\mu_Y)]$$

So Pearson correlation $\rho$ can also be written as:

$$\rho_{X,Y}=\frac{\mathbb{E}[(X-\mu_X)(Y-\mu_Y)]}{\sigma_X\sigma_Y}$$

Normalized Measure of the Covariance
Takes values in [-1,+1]

Pearson Correlation Coefficient¶

The correlation coefficient ranges from −1 to 1.
An absolute value of exactly $\pm 1$ implies that a linear equation describes the relationship between X and Y perfectly, with all data points lying on a line.
The correlation sign is determined by the regression slope: a value of +1 implies that all data points lie on a line for which Y increases as X increases, and vice versa for −1.
0 means that there is no linear dependency between variables

Pearson Correlation Coefficient Geometry¶

It takes maximum intensity when the numerator is equal to the denumerator. Otherwise Covariance is Always less than the product of the std. deviation
The sign of the covariance tells you if the data is correlated or anticorrelated

Now Interpret again the plot¶

Graphics from Wikipedia

Final Note: Estimation $\rightarrow$ Predictive Power for Future data¶

...but we have to be careful when predicting....

Final Note: Estimation $\rightarrow$ Predictive Power for Future data¶

Final Note: The more Samples you have, the better you predict!¶

We will see what happens with ML when you have a low number of samples for training.

Final Note: The more Samples you have, the better you predict!¶

Correlation DOES NOT imply Causation¶

Screen%20Shot%202022-02-24%20at%2009.41.38.png

Correlation DOES NOT imply Causation¶

Correlation does NOT imply Causation¶

If given two variable $A$ and $B$, we see that by increasing $A$, $B$ increases as well:

they are positively correlated (it could be spurious)
It is **NOT** sufficient condition for causality. It may be OR may be not.
It could be that $B \rightarrow A$ or $A \rightarrow B$ (or even that they both co-imply)
It could also be that another unknown variable $C$, $C \rightarrow A$ and $C \rightarrow B$.

Graphics from [this link](https://sundaskhalid.medium.com/correlation-vs-causation-in-data-science-66b6cfa702f0)

Graphics xcd comic

Spurious Correlations 🤷🏽‍♀️¶

Check this link out

Spurious Correlations 🤷🏽‍♀️¶

Check this link out

Inductive Bias: What We Know Before the Data Arrives¶

Let's play a learning "game"¶

Training data¶

Class A	Class B

Classify these images with A or B from left to right, top to bottom¶

Write down your answer, then I will ask a few answers¶

Training data¶

Class A	Class B

Test data¶

Answers?¶

parrot	squirrel	cat	penguin
A	B	B	A
A	A	B	B

~70% ABBA prediction (Inferred bird vs non bird)
~30% AABB (Inferred fly vs not fly)

This preference for one distinction (bird/non-bird) over another (fly/no-fly) is a bias that different human learners have.

In the context of machine learning, it is called inductive bias: in the absense of data that narrow down the relevant concept, what type of solutions are we more likely to prefer?

Inductive Reasoning vs Deductive Reasoning¶

Inductive reasoning is a method of reasoning in which a general principle is derived from a body of observations. It consists of making broad generalizations based on specific observations. The truth of the conclusion of an inductive argument is probable, based upon the evidence given ✅ (Unit II)
Deductive reasoning is the mental process of drawing deductive inferences. An inference is deductively valid if its conclusion follows logically from its premises (Unit I)

Inductive Learning¶

Most of methods covered in this course are "Inductive"---as opposed to transductive.

Inductive Learning¶

Learn a model $\mbf{\theta}$ on the training set (fix $\mbf{\theta}$, throw away the training set)
Now, given a new unseen sample $\mbf{x}^{\prime}$ use $\mbf{\theta}$ to predict your result
Note if you have multiple samples to test, each $\mbf{x}^{\prime}$ is processed independently and one-by-one.

Transductive Learning¶

Vapnik'98 - Learning by Transduction

Learning Paradigms¶

1. Supervised Learning (we have labels)¶

2. Unsupervised Learning (we do NOT have labels)¶

There are others: Reinforcement Learning/Active/Self Supervised Learning (not covered in this course)¶

Introduction to Supervised Learning¶

Assume that there is a unknown and complex generator $\mathcal{D}$ that provides output pairs $(\mathbf{x},y)$.

We refer to this unknown generator process as an unknown probability distribution $\mathcal{D}$ over input pairs $(\mathbf{x},y) \in \mathcal{X}\times \mathcal{Y}$.
Example: Pairs of images and a label as in the case of bird/non-bird
- $\mathbf{x}$ corresponds to the image;
- $y$ to the label

Supervised Learning¶

The most common approach to machine learning is supervised learning.

_{^{Image Credit: DataFlair}}

Supervised Learning: Object Detection¶

We previously saw an example of supervised learning: object detection.

We start by collecting a dataset of labeled objects.
We train a model to output accurate predictions on this dataset.
When the model sees new, similar data, it will also be accurate.

^{Credits Cornell CS5785}

Applications of Supervised Learning¶

Many important applications of machine learning are supervised:

Classifying medical images.
Translating between pairs of languages.
Detecting objects in autonomous driving.

Supervised Learning¶

Given paired $(\mathbf{x},y)$, we learn to predict the label when given as input unseen data.
- Classification: the output is a discrete value (category)
  - Binary Classification (0/1)
  - Multi-Class Classification (1...N)
- Regression: the output is a continuous value (real-valued output)

In practice, in a real-world problem no one has access to $\mathcal{D}$ because problems are too complex

Try to write a computer program to generate all possible natural images that you can find in the world. Is it easy?

Let's assume here that we have access to $\mathcal{D}$ as a python function get_prob_under_D(x,y) that takes as input a pair (x,y) and returns the probability of the pair under $\mathcal{D}$.

If so, we can define the Bayes optimal classifier as the classifier that:

for any test input $\mathbf{x}^{\prime}$, simply returns the $y^{\prime}$ that maximizes get_prob_under_D(x,y)
Or else, try all possible labels and return the label which yields maximum prob.

\begin{equation} h({x}^{\prime}) = \arg\max_{y^{\prime} \in \mathcal{Y} } \mathcal{D}(x^{\prime},y^{\prime}) \end{equation}

Take away¶

The take-home message is that if someone gave you access to the "data distribution", forming an optimal classifier would be trivial.

Real world¶

Unfortunately, no one gave you the implementation of this distribution.

We need to figure out ways of learning the mapping from x to y
given only access to a training set sampled from $\mathcal{D}$, rather than $\mathcal{D}$ itself.

Training set¶

\begin{equation} \underbrace{\{\mathbf{x}_i,y_i\}_{i=1}^N}_{\text{known}} \sim \underbrace{\mathcal{D}}_{\text{unknown}} \end{equation}

where:

$N$ is the number of training samples
the vector $\mathbf{x}$ is the input data
$y$ is the associated (scalar) label

Supervised Learning¶

Goal: given a training set with labels, learn a function over a set of possible functions (hypothesis over a Hypothesis set)

$$h \in \mathcal{H}\text{ so that }h : \mathbf{x} \mapsto y$$

Output of the learning is $h(\cdot)$ that can be used to do prediction at test-time.

Prediction: Classification (discrete-valued) vs Regression (real-valued output)

Supervised Learning for our game¶

Cardinal Rule of Machine Learning¶

The cardinal rule of machine learning is: never touch your test data.

Ever! If that’s not clear enough:

Never ever touch your test data!
¶

There is a specific validation set for that.
¶

From cimi book:

Do not look at your test data. Even once. Even a tiny peek. Once you do that, it is not test data any more. Yes, perhaps your algorithm hasn’t seen it. But you have. And you are likely a better learner than your learning algorithm. Consciously or otherwise, you might make decisions based on whatever you might have seen. Once you look at the test data, your model’s performance on it is no longer indicative of it’s performance on future unseen data. This is simply because future data is unseen, but your “test” data no longer is.

Unsupervised Learning¶

\begin{equation} \underbrace{\{\mathbf{x}_i\}_{i=1}^N}_{\text{known yet no labels}} \sim \underbrace{\mathcal{D}}_{\text{unknown}} \end{equation}

We do not have any labels paired with the data.
Create an internal representation of the input, capturing regularities/structure in data
- Examples: form clusters; extract features
- How do we know if a representation is good?

Unsupervised Learning¶

Here, we have a dataset without labels. Our goal is to learn something interesting about the structure of the data.

_{^{Image Credit: DataFlair}}

Clustering (unsupervised)¶

Each column is the result of a clustering algorithm
The input data lives in a 2D space
Colors indicate the clustering results (which points should be considered together)

Image from [scikit-learn](https://scikit-learn.org/stable/modules/clustering.html#clustering)

Classification as an example¶

Tools¶

We are going to use tools such as:

Base programming Python

Matrix and array manipulation Numpy

Basic ML methods implemented Scikit Learn

Plotting and Visualization Tool: MatplotLib

Code to enter classroom:¶

nr6h4g7
¶

classroom.google.com/c/MjEzMzg1MTcyMjda?cjc=nr6h4g7 ¶

The End¶

Thank you for your attention

X1	X2
0.1	45
0.1	65
0.2	28
0.3	76
0.5	55
0.6	48
0.9	64
1.1	41
1.5	30
1.8	52
1.8	75
1.9	35
2.1	42
2.2	65
3.0	30
3.6	71

X1	X2
0.1	45
0.1	65
0.2	28
0.3	76
0.5	55
0.6	48
0.9	64
1.1	41
1.5	30
1.8	52
1.8	75
1.9	35
2.1	42
2.2	65
3.0	30
3.6	71

Machine Learning¶

1. Introduction to Machine Learning¶

Introduction and administrative stuff¶

👨🏼‍🏫 About Me¶

⏰ Course Schedule¶

- Tuesday, 1pm - 3pm (2 hours)¶

- Thursday, 1pm - 4pm (3 hours)¶

From February 27 till the end of May 30 (one week break for Easter vacation)¶

📽 Lecture Modality¶

📽 Lecture Modality¶

📝 Course Material & Interaction¶

Google Classroom (Very Important):¶

Code to enter classroom: nr6h4g7 ¶

Code to enter classroom:¶

nr6h4g7 ¶

classroom.google.com/c/MjEzMzg1MTcyMjda?cjc=nr6h4g7¶

📝 Course Material & Interaction¶

Google Classroom (Very Important):¶

📝 Course Material & Interaction¶

📖 Course Material & Textbook¶

📚 Textbooks¶

Yet Another Text Book¶

Scardapane's Book¶

📚 How to study¶

🙏🏼 Credits¶

💰 Exam (your payback) Warning - Changed from last year!¶

💰 Cum Laude¶

💰 Exam (your payback)¶

💰 Exam: Caveat - [especially for Erasmus students]¶

🎯Course Objectives¶

Good to know¶

We will review these in the first lectures¶

👩🏾‍💻Technology is power (toolsets to use)¶

Toolsets:¶

👩🏾‍💻Technology is power (toolset to use)¶

Install a Python 3.8 environment 🐍 with:¶

ℹ️ Provisional Course Agenda at a glance¶

Why using Machine Learning?¶

Everyone is using it now.... (Impact in applications)...¶

...but this is not a good answer.¶

Rise of AI¶

AI Job Landscape¶

AI Job Landscape - An example¶

Quick History¶

AI in Science Fiction¶

Turing Test¶

Let's do a VISUAL Turing test¶

Let's do a VISUAL Turing test¶

Let's do a VISUAL Turing test¶

What is AI (Informal)¶

What is ML (Informal)¶

AI vs Machine Learning vs Deep Learning¶

AI and beyond¶

Machine Learning¶

2. Correlation and Learning Paradigm¶

Code to enter classroom:¶

nr6h4g7 ¶

classroom.google.com/c/MjEzMzg1MTcyMjda?cjc=nr6h4g7¶

Yes, but why using it?¶

To solve problems, but what kind of problems?¶

Algorithmic approach¶

An Example: Self Driving Cars¶

Self Driving Cars: A Rule-Based Algorithm¶

pseudocode example for a rule-based classification system¶

Self Driving Cars: An ML Approach¶

ML approach¶

Machine Learning¶

Why not to use a traditional algorithmic approach?¶

Example: Write a program that recognizes faces (face recognition) over a closed-set of identity¶

Example: Face Recognition. Humans can do it, why hard for machines?¶

Example: Face Recognition. Humans can do it, why hard for machines?¶

Example: Face Recognition. Humans can do it, why hard for machines?¶

ML is widespread¶

Image/Text Retrieval¶

Recommendation Systems¶

Classification/Recognition¶

Is this a dog?¶

What about this?¶

Applications¶

Limits of Machine Learning¶

Code to enter classroom:
nr6h4g7
¶

nr6h4g7
¶

classroom.google.com/c/MjEzMzg1MTcyMjda?cjc=nr6h4g7 ¶

💰 Exam (your payback)
Warning - Changed from last year!
¶

nr6h4g7
¶

classroom.google.com/c/MjEzMzg1MTcyMjda?cjc=nr6h4g7 ¶

Never ever touch your test data!
¶

There is a specific validation set for that.
¶

X1	X2
0.1	45
0.1	65
0.2	28
0.3	76
0.5	55
0.6	48
0.9	64
1.1	41
1.5	30
1.8	52
1.8	75
1.9	35
2.1	42
2.2	65
3.0	30
3.6	71