Professional Documents
Culture Documents
“Just as electricity transformed almost everything 100 years ago, today I actually have a hard time
thinking of an industry that I don’t think AI (Artificial Intelligence) will transform in the next
few years.”
-Andrew NG
Machine learning has evolved from the study of computational learning theory
and pattern recognition. It is the most effective method used in the field of data
analytics to predict something by devising some models and algorithms. These
analytical models allow researchers, engineers, data scientists and analysts to
produce reliable and valid results and decisions. It also helps to discover some
hidden patterns or features through historical learning's and trends in data.
Feature selection is the most important task of machine learning. Model is
created based on the results gathered from the training data that is why machine-
learning algorithms are non-interactive. It studies the past observations to make
precise predictions. It is a very difficult task to make an accurate prediction rule
based on which algorithm can be developed.
Mathematical Notations Used
Vectors are denoted by lower case bold Roman letters such as x, and all vectors are
assumed to be column vectors. A superscript T denotes the transpose of a matrix or
vector, so that 𝑥 𝑇 will be a row vector. Uppercase bold roman letters, such as M,
denote matrices. The notation (w1, . . ., wM) denotes a row vector with M elements,
while the corresponding column vector is written as w = (w1, . . ., wM)T.
The notation [a, b] is used to denote the closed interval from a to b, that is the interval
including the values a and b themselves, while (a, b) denotes the corresponding open
interval, that is the interval excluding a and b. Similarly, [a, b) denotes an interval
that includes a but excludes b. For the most part, however, there will be little need to
dwell on such refinements as whether the end points of an interval are included or
not.
The M × M identity matrix (also known as the unit matrix) is denoted IM, which will
be abbreviated to I where there is no ambiguity about it dimensionality. It has
elements Iij that equal 1 if i = j and 0 if i ≠ j.
The notation g(x) = O(f(x)) denotes that |f(x)/g(x)| is bounded as x→∞. For instance,
if g(x) = 3x2 + 2, then g(x) = O(x2).
The science of learning plays a key role in the fields of machine learning, statistics,
datamining and artificial intelligence, intersecting with areas of engineering and other
disciplines. Here are some examples of learning problems:
• Predict whether a patient, hospitalized due to a heart attack, will have a second
heart attack. The prediction is to be based on demographic, diet and clinical
measurements for that patient.
• Predict the price of a stock in 6 months from now, based on company
performance measures and economic data.
• Identify the numbers in a handwritten ZIP code, from a digitized image.
• Estimate the amount of glucose in the blood of a diabetic person, from the
infrared absorption spectrum of that person’s blood.
• Identify the risk factors for prostate cancer, based on clinical and
demographic variables.
2
Supervised Learning
In supervised learning, we are given a data set and already know what our correct
output should look like, having the idea that there is a relationship between the input
and the output.
Supervised learning problems are categorized into "regression" and "classification"
problems. In a regression problem, we are trying to predict results within a
continuous output, meaning that we are trying to map input variables to some
continuous function. In a classification problem, we are instead trying to predict
results in a discrete output. In other words, we are trying to map input variables into
discrete categories.
Example:
(a) Regression - Given a picture of a person, we have to predict their age on the basis of the
given picture,
(b) Classification - Given a patient with a tumor, we have to predict whether the tumor is
malignant or benign.