尊敬的 微信汇率:1円 ≈ 0.046089 元 支付宝汇率:1円 ≈ 0.04618元 [退出登录]
SlideShare a Scribd company logo
HANDS-ON ONLINE TRAINING
ON DATA SCIENCE
KNOWLEDGE AND SKILLS FORUM
INTRODUCTION TO
MACHINE LEARNING
FAITHFUL ONWUEGBUCHE
OLUWASEUN ODEYEMI
AUGUSTINE OKOLIE
WHAT IS MACHINE
LEARNING
• Machine learning is a type of artificial
intelligence (AI) that enables computers to
learn and make decisions without being
explicitly programmed.
• Using algorithms that iteratively learn
from data, machine learning allows
computers to find hidden insights without
being explicitly programmed where to
look.
THINK AND LEARN LIKE A BABY
MACHINE
LEARNING VS.
STATISTICS
VS.
COMPUTER
SCIENCE
Machine Learning Statistics Computer Science
Objective
Focuses on learning from
data to make predictions or
decisions without being
explicitly programmed. It
prioritizes prediction
accuracy and
generalizability.
Aims to infer properties of
an underlying distribution
from a data sample. It
emphasizes understanding
and interpreting data and
probabilistic models.
Focuses on the
creation and
application of
algorithms to
manipulate, store, and
communicate digital
information.
Methodologie
s
Typically uses complex
models (like neural
networks) and large
amounts of data to train
models for prediction.
Utilizes both supervised
and unsupervised learning
methods.
Often employs simpler,
more interpretable
models. Focuses on
hypothesis testing,
experimental design,
estimation, and
mathematical analysis.
Involves algorithm
design, data
structures,
computation theory,
computer architecture,
software development,
and more.
Validation
Measures model
performance through
methods like cross-
validation and seeks to
improve generalization to
unseen data.
Validates models using
methods such as
confidence intervals, p-
values, and hypothesis
tests to quantify
uncertainty.
Uses formal methods
for verifying
correctness, analyzing
computational
complexity, and
proving algorithmic
bounds.
Primary
Concern
Creating models that can
learn from and make
decisions or predictions
based on data.
Drawing valid conclusions
and quantifying
uncertainty about
observed data and
underlying distributions.
Creating efficient
algorithms and data
structures to solve
computational
problems.
TYPES OF MACHINE LEARNING
Three Types of Problems
• Supervised
• Unsupervised
• Reinforcement
SUPERVISE
D
• Trained using labeled examples
• Desired output is known
• Methods include classification, regression, etc.
• Uses patterns to predict the values of the label on
additional unlabeled data
• Algorithms:
• Linear regression
• Logistic regression
• K-Nearest Neighbors (KNN)
• Decision Trees and Random Forests
• Support Vector Machines
• Naive Bayes
• Neural Networks
UNSUPERVIS
ED
• Used against data that has no historical labels
• Desired output is unknown
• Goal is to explore the data and find some
structure within the data
• Algorithms
• Anomaly detection
• K-means clustering
• Hierarchical clustering
• DBSCAN
• Principal Component Analysis (PCA)
• Neural Networks
REINFORCEMEN
T
• Algorithm discovers through trial and error
which actions yield the greatest rewards.
• Three primary components:
• the agent (the learner or decision maker),
• the environment (everything the agent
interacts with)
• actions (what the agent can do).
• Objective: the agent chooses actions that
maximize the expected reward over a given
amount of time.
• Algorithms
• Markov Decision Process
• Q-Learning
• Deep Q Network (DQN)
WHY USE IT?
• Machine learning based models can extract patterns from massive
amounts of data which humans cannot do because
• We cannot retain everything in memory or we cannot perform
obvious/redundant computations for hours and days to come up
with interesting patterns.
• “Humans can typically create one or two good models in a week;
machine learning can create thousands of models in a week” (Thomas
H. Davenport)
• Solve problems we simply could not before
USE CASES
• Email spam filter
• Recommendation systems
• Self driving car
• Finance
• Image Recognition
• Competitive machines
TYPICAL MACHINE LEARNING PROCESS
Source: INTRODUCING AZURE MACHINE
LEARNING, pg. 5
TO GIVE CREDIT, OR NOT TO GIVE CREDIT
• You are asked by your boss while working at Big Bank Inc. to
develop an automated decision maker on whether to give a
potential client credit or not.
WHAT IS THE QUESTION?
• What question are we trying to answer here? What problem are
we looking to solve?
SELECTING DATA
Feature
• An individual measurable property of a phenomenon being
observed
• Best found through industry experts
SELECTING DATA
Feature Extraction
• Feature extraction is a general term for methods of
constructing combinations of the variables to get around
certain problems while still describing the data with sufficient
accuracy.
• Analysis with a large number of variables generally requires a
large amount of memory and computation.
• Reducing the amount of resources required to describe a
large set of data
SELECTING DATA
PCA (Principal ComponentAnalysis)
• We have a huge list of different features
• Many of them will measure related properties and so will be
redundant
• Summarize with less features
PREPARING DATA
• Cleaning
• Units
• Missing Values
• Metadata
DEVELOPING MODEL
• What is the problem being solved?
• What is the goal of the model?
• Minimize error on the “training” data
• Training data is the data used to train the model (all of it but the part we
removed)
DEVELOPING MODEL
Linear Model
• Relationships are modeled using linear predictor functions
whose unknown model parameters are estimated from the data
• Complex way of saying the model draws a line between two
categories (classification) or to estimate a value (regression)
• Linear regression the most common form of linear model
DEVELOPING MODEL
Non-linear Model
• A nonlinear model describes nonlinear relationships in
experimental data
• The parameters can take the form of an exponential,
trigonometric, power, or any other nonlinear function
DEVELOPING
MODEL
Overfitting vs. Bias in Machine
Learning
Overfitting​ Bias​
Definition​
Overfitting occurs when a
model fits the data more
than is warranted. It
captures the noise along
with the underlying
pattern in the data.​
Bias is error from
erroneous assumptions in
the learning algorithm.
High bias can cause
an algorithm to miss
relevant relations between
features and
target outputs.​
Consequence​
Overfitting leads to a
smaller error on the
training data set but
a larger one on unseen
data, reducing the model's
ability to generalize.​
High bias often leads
to underfitting, where the
model oversimplifies the
data and doesn't capture
its complexity.​
Example​
Creating a very complex
decision tree that classifies
each
training instance perfectly,
but performs poorly on
unseen data.​
Fitting a quadratic dataset
using a linear model – the
model will consistently fail
to capture the
true relationship and make
errors.​
DEVELOPING MODEL
Overfitting vs. Bias in Machine
Learning
COMBINATIONS OF BIAS-VARIANCE
DEVELOPING MODEL
Rule Addition
• Minimize error on the “training” data
• AND make sure that error in the “unseen” data is close to error in
the “training” data
DEVELOPING A MODEL
• Keep it Simple
• Go for simpler models over more complicated models
• Generally, the fewer parameters that you have to tune the better
• Cross-Validation
• K-fold cross validation is a great way to estimate error on training data
• Regularization
• Can sometimes help penalize certain sources of overfitting.
• LASSO
• Forces the sum of the absolute value of coefficients to be less than a
fixed value
• Effectively choosing a simpler model
DEVELOPIN
G A MODEL
Data Snooping or Data Dredging
• It's a form of bias that arises when you make decisions based on
the same data you've used to train and test your model.
• “If a data set has affected any step in the learning process, its
ability to access the outcome has been compromised”
• Experimenting
• Reuse of the same data set to determine quality of model
• Once a data set has been used to test the performance of a
data set, it should be considered contaminated
Source: Learning From Data, pg. 1
INTERPRETING RESULTS
• Validation
• Cross validation
• Test set
• Once the test set has been used, you must find new data!

More Related Content

What's hot

Generative AI - Responsible Path Forward.pdf
Generative AI - Responsible Path Forward.pdfGenerative AI - Responsible Path Forward.pdf
Generative AI - Responsible Path Forward.pdf
Saeed Al Dhaheri
 
Programming with Python
Programming with PythonProgramming with Python
Programming with Python
Rasan Samarasinghe
 
Feature Engineering for NLP
Feature Engineering for NLPFeature Engineering for NLP
Feature Engineering for NLP
Bill Liu
 
Python ppt
Python pptPython ppt
Python ppt
Mohita Pandey
 
Generative AI
Generative AIGenerative AI
Generative AI
Carlos J. Costa
 
Generative AI Application Development using LangChain and LangFlow
Generative AI Application Development using LangChain and LangFlowGenerative AI Application Development using LangChain and LangFlow
Generative AI Application Development using LangChain and LangFlow
Gene Leybzon
 
Introduction to LLMs
Introduction to LLMsIntroduction to LLMs
Introduction to LLMs
Loic Merckel
 
A brief primer on OpenAI's GPT-3
A brief primer on OpenAI's GPT-3A brief primer on OpenAI's GPT-3
A brief primer on OpenAI's GPT-3
Ishan Jain
 
LanGCHAIN Framework
LanGCHAIN FrameworkLanGCHAIN Framework
LanGCHAIN Framework
Keymate.AI
 
GenAI in Research with Responsible AI
GenAI in Researchwith Responsible AIGenAI in Researchwith Responsible AI
GenAI in Research with Responsible AI
Liming Zhu
 
Large Language Models - Chat AI.pdf
Large Language Models - Chat AI.pdfLarge Language Models - Chat AI.pdf
Large Language Models - Chat AI.pdf
David Rostcheck
 
Reproducible AI using MLflow and PyTorch
Reproducible AI using MLflow and PyTorchReproducible AI using MLflow and PyTorch
Reproducible AI using MLflow and PyTorch
Databricks
 
ChatGPT Use- Cases
ChatGPT Use- Cases ChatGPT Use- Cases
ChatGPT Use- Cases
Bluechip Technologies
 
Python PPT
Python PPTPython PPT
Python PPT
Edureka!
 
Python Class | Python Programming | Python Tutorial | Edureka
Python Class | Python Programming | Python Tutorial | EdurekaPython Class | Python Programming | Python Tutorial | Edureka
Python Class | Python Programming | Python Tutorial | Edureka
Edureka!
 
generative-ai-fundamentals and Large language models
generative-ai-fundamentals and Large language modelsgenerative-ai-fundamentals and Large language models
generative-ai-fundamentals and Large language models
AdventureWorld5
 
Prompt Engineering for AI and ML Technical Product Managers - 2023-08-21 00.5...
Prompt Engineering for AI and ML Technical Product Managers - 2023-08-21 00.5...Prompt Engineering for AI and ML Technical Product Managers - 2023-08-21 00.5...
Prompt Engineering for AI and ML Technical Product Managers - 2023-08-21 00.5...
Gabriel Rojas
 
What Are the Problems Associated with ChatGPT?
What Are the Problems Associated with ChatGPT?What Are the Problems Associated with ChatGPT?
What Are the Problems Associated with ChatGPT?
Windzoon Technologies
 
Best Practice on using Azure OpenAI Service
Best Practice on using Azure OpenAI ServiceBest Practice on using Azure OpenAI Service
Best Practice on using Azure OpenAI Service
Kumton Suttiraksiri
 
Python Developer Roadmap 2023
Python Developer Roadmap 2023Python Developer Roadmap 2023
Python Developer Roadmap 2023
Simplilearn
 

What's hot (20)

Generative AI - Responsible Path Forward.pdf
Generative AI - Responsible Path Forward.pdfGenerative AI - Responsible Path Forward.pdf
Generative AI - Responsible Path Forward.pdf
 
Programming with Python
Programming with PythonProgramming with Python
Programming with Python
 
Feature Engineering for NLP
Feature Engineering for NLPFeature Engineering for NLP
Feature Engineering for NLP
 
Python ppt
Python pptPython ppt
Python ppt
 
Generative AI
Generative AIGenerative AI
Generative AI
 
Generative AI Application Development using LangChain and LangFlow
Generative AI Application Development using LangChain and LangFlowGenerative AI Application Development using LangChain and LangFlow
Generative AI Application Development using LangChain and LangFlow
 
Introduction to LLMs
Introduction to LLMsIntroduction to LLMs
Introduction to LLMs
 
A brief primer on OpenAI's GPT-3
A brief primer on OpenAI's GPT-3A brief primer on OpenAI's GPT-3
A brief primer on OpenAI's GPT-3
 
LanGCHAIN Framework
LanGCHAIN FrameworkLanGCHAIN Framework
LanGCHAIN Framework
 
GenAI in Research with Responsible AI
GenAI in Researchwith Responsible AIGenAI in Researchwith Responsible AI
GenAI in Research with Responsible AI
 
Large Language Models - Chat AI.pdf
Large Language Models - Chat AI.pdfLarge Language Models - Chat AI.pdf
Large Language Models - Chat AI.pdf
 
Reproducible AI using MLflow and PyTorch
Reproducible AI using MLflow and PyTorchReproducible AI using MLflow and PyTorch
Reproducible AI using MLflow and PyTorch
 
ChatGPT Use- Cases
ChatGPT Use- Cases ChatGPT Use- Cases
ChatGPT Use- Cases
 
Python PPT
Python PPTPython PPT
Python PPT
 
Python Class | Python Programming | Python Tutorial | Edureka
Python Class | Python Programming | Python Tutorial | EdurekaPython Class | Python Programming | Python Tutorial | Edureka
Python Class | Python Programming | Python Tutorial | Edureka
 
generative-ai-fundamentals and Large language models
generative-ai-fundamentals and Large language modelsgenerative-ai-fundamentals and Large language models
generative-ai-fundamentals and Large language models
 
Prompt Engineering for AI and ML Technical Product Managers - 2023-08-21 00.5...
Prompt Engineering for AI and ML Technical Product Managers - 2023-08-21 00.5...Prompt Engineering for AI and ML Technical Product Managers - 2023-08-21 00.5...
Prompt Engineering for AI and ML Technical Product Managers - 2023-08-21 00.5...
 
What Are the Problems Associated with ChatGPT?
What Are the Problems Associated with ChatGPT?What Are the Problems Associated with ChatGPT?
What Are the Problems Associated with ChatGPT?
 
Best Practice on using Azure OpenAI Service
Best Practice on using Azure OpenAI ServiceBest Practice on using Azure OpenAI Service
Best Practice on using Azure OpenAI Service
 
Python Developer Roadmap 2023
Python Developer Roadmap 2023Python Developer Roadmap 2023
Python Developer Roadmap 2023
 

Similar to Introduction to Machine Learning

Choosing a Machine Learning technique to solve your need
Choosing a Machine Learning technique to solve your needChoosing a Machine Learning technique to solve your need
Choosing a Machine Learning technique to solve your need
GibDevs
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
Niko Vuokko
 
MACHINE LEARNING PRESENTATION (ARTIFICIAL INTELLIGENCE)
MACHINE LEARNING PRESENTATION (ARTIFICIAL INTELLIGENCE)MACHINE LEARNING PRESENTATION (ARTIFICIAL INTELLIGENCE)
MACHINE LEARNING PRESENTATION (ARTIFICIAL INTELLIGENCE)
MAHIRA
 
Afternoons with Azure - Azure Machine Learning
Afternoons with Azure - Azure Machine Learning Afternoons with Azure - Azure Machine Learning
Afternoons with Azure - Azure Machine Learning
CCG
 
Artificial Intelligence Approaches
Artificial Intelligence  ApproachesArtificial Intelligence  Approaches
Artificial Intelligence Approaches
Jincy Nelson
 
Improving AI Development - Dave Litwiller - Jan 11 2022 - Public
Improving AI Development - Dave Litwiller - Jan 11 2022 - PublicImproving AI Development - Dave Litwiller - Jan 11 2022 - Public
Improving AI Development - Dave Litwiller - Jan 11 2022 - Public
Dave Litwiller
 
Informs presentation new ppt
Informs presentation new pptInforms presentation new ppt
Informs presentation new ppt
Salford Systems
 
Barga Data Science lecture 2
Barga Data Science lecture 2Barga Data Science lecture 2
Barga Data Science lecture 2
Roger Barga
 
Machine Learning_Unit 2_Full.ppt.pdf
Machine Learning_Unit 2_Full.ppt.pdfMachine Learning_Unit 2_Full.ppt.pdf
Machine Learning_Unit 2_Full.ppt.pdf
Dr.DHANALAKSHMI SENTHILKUMAR
 
ML_Module_1.pdf
ML_Module_1.pdfML_Module_1.pdf
ML_Module_1.pdf
JafarHussain48
 
Machine Learning in the Financial Industry
Machine Learning in the Financial IndustryMachine Learning in the Financial Industry
Machine Learning in the Financial Industry
Subrat Panda, PhD
 
Machine Learning 2 deep Learning: An Intro
Machine Learning 2 deep Learning: An IntroMachine Learning 2 deep Learning: An Intro
Machine Learning 2 deep Learning: An Intro
Si Krishan
 
Introduction to ML (Machine Learning)
Introduction to ML (Machine Learning)Introduction to ML (Machine Learning)
Introduction to ML (Machine Learning)
SwatiTripathi44
 
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
Lucas Jellema
 
Optimal Model Complexity (1).pptx
Optimal Model Complexity (1).pptxOptimal Model Complexity (1).pptx
Optimal Model Complexity (1).pptx
MurindanyiSudi1
 
H2O World - Intro to Data Science with Erin Ledell
H2O World - Intro to Data Science with Erin LedellH2O World - Intro to Data Science with Erin Ledell
H2O World - Intro to Data Science with Erin Ledell
Sri Ambati
 
lec1.ppt
lec1.pptlec1.ppt
lec1.ppt
SVasuKrishna1
 
Lecture 1
Lecture 1Lecture 1
Lecture 1
Aun Akbar
 
02 Related Concepts
02 Related Concepts02 Related Concepts
02 Related Concepts
Valerii Klymchuk
 
MACHINE LEARNING INTRODUCTION DIFFERENCE BETWEEN SUOERVISED , UNSUPERVISED AN...
MACHINE LEARNING INTRODUCTION DIFFERENCE BETWEEN SUOERVISED , UNSUPERVISED AN...MACHINE LEARNING INTRODUCTION DIFFERENCE BETWEEN SUOERVISED , UNSUPERVISED AN...
MACHINE LEARNING INTRODUCTION DIFFERENCE BETWEEN SUOERVISED , UNSUPERVISED AN...
DurgaDevi310087
 

Similar to Introduction to Machine Learning (20)

Choosing a Machine Learning technique to solve your need
Choosing a Machine Learning technique to solve your needChoosing a Machine Learning technique to solve your need
Choosing a Machine Learning technique to solve your need
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
MACHINE LEARNING PRESENTATION (ARTIFICIAL INTELLIGENCE)
MACHINE LEARNING PRESENTATION (ARTIFICIAL INTELLIGENCE)MACHINE LEARNING PRESENTATION (ARTIFICIAL INTELLIGENCE)
MACHINE LEARNING PRESENTATION (ARTIFICIAL INTELLIGENCE)
 
Afternoons with Azure - Azure Machine Learning
Afternoons with Azure - Azure Machine Learning Afternoons with Azure - Azure Machine Learning
Afternoons with Azure - Azure Machine Learning
 
Artificial Intelligence Approaches
Artificial Intelligence  ApproachesArtificial Intelligence  Approaches
Artificial Intelligence Approaches
 
Improving AI Development - Dave Litwiller - Jan 11 2022 - Public
Improving AI Development - Dave Litwiller - Jan 11 2022 - PublicImproving AI Development - Dave Litwiller - Jan 11 2022 - Public
Improving AI Development - Dave Litwiller - Jan 11 2022 - Public
 
Informs presentation new ppt
Informs presentation new pptInforms presentation new ppt
Informs presentation new ppt
 
Barga Data Science lecture 2
Barga Data Science lecture 2Barga Data Science lecture 2
Barga Data Science lecture 2
 
Machine Learning_Unit 2_Full.ppt.pdf
Machine Learning_Unit 2_Full.ppt.pdfMachine Learning_Unit 2_Full.ppt.pdf
Machine Learning_Unit 2_Full.ppt.pdf
 
ML_Module_1.pdf
ML_Module_1.pdfML_Module_1.pdf
ML_Module_1.pdf
 
Machine Learning in the Financial Industry
Machine Learning in the Financial IndustryMachine Learning in the Financial Industry
Machine Learning in the Financial Industry
 
Machine Learning 2 deep Learning: An Intro
Machine Learning 2 deep Learning: An IntroMachine Learning 2 deep Learning: An Intro
Machine Learning 2 deep Learning: An Intro
 
Introduction to ML (Machine Learning)
Introduction to ML (Machine Learning)Introduction to ML (Machine Learning)
Introduction to ML (Machine Learning)
 
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
The Art of Intelligence – A Practical Introduction Machine Learning for Orac...
 
Optimal Model Complexity (1).pptx
Optimal Model Complexity (1).pptxOptimal Model Complexity (1).pptx
Optimal Model Complexity (1).pptx
 
H2O World - Intro to Data Science with Erin Ledell
H2O World - Intro to Data Science with Erin LedellH2O World - Intro to Data Science with Erin Ledell
H2O World - Intro to Data Science with Erin Ledell
 
lec1.ppt
lec1.pptlec1.ppt
lec1.ppt
 
Lecture 1
Lecture 1Lecture 1
Lecture 1
 
02 Related Concepts
02 Related Concepts02 Related Concepts
02 Related Concepts
 
MACHINE LEARNING INTRODUCTION DIFFERENCE BETWEEN SUOERVISED , UNSUPERVISED AN...
MACHINE LEARNING INTRODUCTION DIFFERENCE BETWEEN SUOERVISED , UNSUPERVISED AN...MACHINE LEARNING INTRODUCTION DIFFERENCE BETWEEN SUOERVISED , UNSUPERVISED AN...
MACHINE LEARNING INTRODUCTION DIFFERENCE BETWEEN SUOERVISED , UNSUPERVISED AN...
 

Recently uploaded

🔥College Call Girls Kolkata 💯Call Us 🔝 8094342248 🔝💃Top Class Call Girl Servi...
🔥College Call Girls Kolkata 💯Call Us 🔝 8094342248 🔝💃Top Class Call Girl Servi...🔥College Call Girls Kolkata 💯Call Us 🔝 8094342248 🔝💃Top Class Call Girl Servi...
🔥College Call Girls Kolkata 💯Call Us 🔝 8094342248 🔝💃Top Class Call Girl Servi...
rukmnaikaseen
 
Call Girls Hyderabad (india) ☎️ +91-7426014248 Hyderabad Call Girl
Call Girls Hyderabad  (india) ☎️ +91-7426014248 Hyderabad  Call GirlCall Girls Hyderabad  (india) ☎️ +91-7426014248 Hyderabad  Call Girl
Call Girls Hyderabad (india) ☎️ +91-7426014248 Hyderabad Call Girl
sapna sharmap11
 
Product Cluster Analysis: Unveiling Hidden Customer Preferences
Product Cluster Analysis: Unveiling Hidden Customer PreferencesProduct Cluster Analysis: Unveiling Hidden Customer Preferences
Product Cluster Analysis: Unveiling Hidden Customer Preferences
Boston Institute of Analytics
 
Interview Methods - Marital and Family Therapy and Counselling - Psychology S...
Interview Methods - Marital and Family Therapy and Counselling - Psychology S...Interview Methods - Marital and Family Therapy and Counselling - Psychology S...
Interview Methods - Marital and Family Therapy and Counselling - Psychology S...
PsychoTech Services
 
9711199012⎷❤✨ Call Girls RK Puram Special Price with a special young
9711199012⎷❤✨ Call Girls RK Puram Special Price with a special young9711199012⎷❤✨ Call Girls RK Puram Special Price with a special young
9711199012⎷❤✨ Call Girls RK Puram Special Price with a special young
Ak47
 
❻❸❼⓿❽❻❷⓿⓿❼KALYAN MATKA CHART FINAL OPEN JODI PANNA FIXXX DPBOSS MATKA RESULT ...
❻❸❼⓿❽❻❷⓿⓿❼KALYAN MATKA CHART FINAL OPEN JODI PANNA FIXXX DPBOSS MATKA RESULT ...❻❸❼⓿❽❻❷⓿⓿❼KALYAN MATKA CHART FINAL OPEN JODI PANNA FIXXX DPBOSS MATKA RESULT ...
❻❸❼⓿❽❻❷⓿⓿❼KALYAN MATKA CHART FINAL OPEN JODI PANNA FIXXX DPBOSS MATKA RESULT ...
#kalyanmatkaresult #dpboss #kalyanmatka #satta #matka #sattamatka
 
Classifying Shooting Incident Fatality in New York project presentation
Classifying Shooting Incident Fatality in New York project presentationClassifying Shooting Incident Fatality in New York project presentation
Classifying Shooting Incident Fatality in New York project presentation
Boston Institute of Analytics
 
MySQL Notes For Professionals sttudy.pdf
MySQL Notes For Professionals sttudy.pdfMySQL Notes For Professionals sttudy.pdf
MySQL Notes For Professionals sttudy.pdf
Ananta Patil
 
A review of I_O behavior on Oracle database in ASM
A review of I_O behavior on Oracle database in ASMA review of I_O behavior on Oracle database in ASM
A review of I_O behavior on Oracle database in ASM
Alireza Kamrani
 
CAP Excel Formulas & Functions July - Copy (4).pdf
CAP Excel Formulas & Functions July - Copy (4).pdfCAP Excel Formulas & Functions July - Copy (4).pdf
CAP Excel Formulas & Functions July - Copy (4).pdf
frp60658
 
202406 - Cape Town Snowflake User Group - LLM & RAG.pdf
202406 - Cape Town Snowflake User Group - LLM & RAG.pdf202406 - Cape Town Snowflake User Group - LLM & RAG.pdf
202406 - Cape Town Snowflake User Group - LLM & RAG.pdf
Douglas Day
 
Call Girls Lucknow 8923113531 Independent Call Girl Service in Lucknow
Call Girls Lucknow 8923113531 Independent Call Girl Service in LucknowCall Girls Lucknow 8923113531 Independent Call Girl Service in Lucknow
Call Girls Lucknow 8923113531 Independent Call Girl Service in Lucknow
hiju9823
 
🔥Book Call Girls Lucknow 💯Call Us 🔝 6350257716 🔝💃Independent Lucknow Escorts ...
🔥Book Call Girls Lucknow 💯Call Us 🔝 6350257716 🔝💃Independent Lucknow Escorts ...🔥Book Call Girls Lucknow 💯Call Us 🔝 6350257716 🔝💃Independent Lucknow Escorts ...
🔥Book Call Girls Lucknow 💯Call Us 🔝 6350257716 🔝💃Independent Lucknow Escorts ...
AK47
 
Hyderabad Call Girls 7339748667 With Free Home Delivery At Your Door
Hyderabad Call Girls 7339748667 With Free Home Delivery At Your DoorHyderabad Call Girls 7339748667 With Free Home Delivery At Your Door
Hyderabad Call Girls 7339748667 With Free Home Delivery At Your Door
Russian Escorts in Delhi 9711199171 with low rate Book online
 
Call Girls Hyderabad ❤️ 7339748667 ❤️ With No Advance Payment
Call Girls Hyderabad ❤️ 7339748667 ❤️ With No Advance PaymentCall Girls Hyderabad ❤️ 7339748667 ❤️ With No Advance Payment
Call Girls Hyderabad ❤️ 7339748667 ❤️ With No Advance Payment
prijesh mathew
 
Salesforce AI + Data Community Tour Slides - Canarias
Salesforce AI + Data Community Tour Slides - CanariasSalesforce AI + Data Community Tour Slides - Canarias
Salesforce AI + Data Community Tour Slides - Canarias
davidpietrzykowski1
 
Do People Really Know Their Fertility Intentions? Correspondence between Sel...
Do People Really Know Their Fertility Intentions?  Correspondence between Sel...Do People Really Know Their Fertility Intentions?  Correspondence between Sel...
Do People Really Know Their Fertility Intentions? Correspondence between Sel...
Xiao Xu
 
Essential Skills for Family Assessment - Marital and Family Therapy and Couns...
Essential Skills for Family Assessment - Marital and Family Therapy and Couns...Essential Skills for Family Assessment - Marital and Family Therapy and Couns...
Essential Skills for Family Assessment - Marital and Family Therapy and Couns...
PsychoTech Services
 
High Profile Call Girls Navi Mumbai ✅ 9833363713 FULL CASH PAYMENT
High Profile Call Girls Navi Mumbai ✅ 9833363713 FULL CASH PAYMENTHigh Profile Call Girls Navi Mumbai ✅ 9833363713 FULL CASH PAYMENT
High Profile Call Girls Navi Mumbai ✅ 9833363713 FULL CASH PAYMENT
ranjeet3341
 
PCI-DSS-Data Security Standard v4.0.1.pdf
PCI-DSS-Data Security Standard v4.0.1.pdfPCI-DSS-Data Security Standard v4.0.1.pdf
PCI-DSS-Data Security Standard v4.0.1.pdf
incitbe
 

Recently uploaded (20)

🔥College Call Girls Kolkata 💯Call Us 🔝 8094342248 🔝💃Top Class Call Girl Servi...
🔥College Call Girls Kolkata 💯Call Us 🔝 8094342248 🔝💃Top Class Call Girl Servi...🔥College Call Girls Kolkata 💯Call Us 🔝 8094342248 🔝💃Top Class Call Girl Servi...
🔥College Call Girls Kolkata 💯Call Us 🔝 8094342248 🔝💃Top Class Call Girl Servi...
 
Call Girls Hyderabad (india) ☎️ +91-7426014248 Hyderabad Call Girl
Call Girls Hyderabad  (india) ☎️ +91-7426014248 Hyderabad  Call GirlCall Girls Hyderabad  (india) ☎️ +91-7426014248 Hyderabad  Call Girl
Call Girls Hyderabad (india) ☎️ +91-7426014248 Hyderabad Call Girl
 
Product Cluster Analysis: Unveiling Hidden Customer Preferences
Product Cluster Analysis: Unveiling Hidden Customer PreferencesProduct Cluster Analysis: Unveiling Hidden Customer Preferences
Product Cluster Analysis: Unveiling Hidden Customer Preferences
 
Interview Methods - Marital and Family Therapy and Counselling - Psychology S...
Interview Methods - Marital and Family Therapy and Counselling - Psychology S...Interview Methods - Marital and Family Therapy and Counselling - Psychology S...
Interview Methods - Marital and Family Therapy and Counselling - Psychology S...
 
9711199012⎷❤✨ Call Girls RK Puram Special Price with a special young
9711199012⎷❤✨ Call Girls RK Puram Special Price with a special young9711199012⎷❤✨ Call Girls RK Puram Special Price with a special young
9711199012⎷❤✨ Call Girls RK Puram Special Price with a special young
 
❻❸❼⓿❽❻❷⓿⓿❼KALYAN MATKA CHART FINAL OPEN JODI PANNA FIXXX DPBOSS MATKA RESULT ...
❻❸❼⓿❽❻❷⓿⓿❼KALYAN MATKA CHART FINAL OPEN JODI PANNA FIXXX DPBOSS MATKA RESULT ...❻❸❼⓿❽❻❷⓿⓿❼KALYAN MATKA CHART FINAL OPEN JODI PANNA FIXXX DPBOSS MATKA RESULT ...
❻❸❼⓿❽❻❷⓿⓿❼KALYAN MATKA CHART FINAL OPEN JODI PANNA FIXXX DPBOSS MATKA RESULT ...
 
Classifying Shooting Incident Fatality in New York project presentation
Classifying Shooting Incident Fatality in New York project presentationClassifying Shooting Incident Fatality in New York project presentation
Classifying Shooting Incident Fatality in New York project presentation
 
MySQL Notes For Professionals sttudy.pdf
MySQL Notes For Professionals sttudy.pdfMySQL Notes For Professionals sttudy.pdf
MySQL Notes For Professionals sttudy.pdf
 
A review of I_O behavior on Oracle database in ASM
A review of I_O behavior on Oracle database in ASMA review of I_O behavior on Oracle database in ASM
A review of I_O behavior on Oracle database in ASM
 
CAP Excel Formulas & Functions July - Copy (4).pdf
CAP Excel Formulas & Functions July - Copy (4).pdfCAP Excel Formulas & Functions July - Copy (4).pdf
CAP Excel Formulas & Functions July - Copy (4).pdf
 
202406 - Cape Town Snowflake User Group - LLM & RAG.pdf
202406 - Cape Town Snowflake User Group - LLM & RAG.pdf202406 - Cape Town Snowflake User Group - LLM & RAG.pdf
202406 - Cape Town Snowflake User Group - LLM & RAG.pdf
 
Call Girls Lucknow 8923113531 Independent Call Girl Service in Lucknow
Call Girls Lucknow 8923113531 Independent Call Girl Service in LucknowCall Girls Lucknow 8923113531 Independent Call Girl Service in Lucknow
Call Girls Lucknow 8923113531 Independent Call Girl Service in Lucknow
 
🔥Book Call Girls Lucknow 💯Call Us 🔝 6350257716 🔝💃Independent Lucknow Escorts ...
🔥Book Call Girls Lucknow 💯Call Us 🔝 6350257716 🔝💃Independent Lucknow Escorts ...🔥Book Call Girls Lucknow 💯Call Us 🔝 6350257716 🔝💃Independent Lucknow Escorts ...
🔥Book Call Girls Lucknow 💯Call Us 🔝 6350257716 🔝💃Independent Lucknow Escorts ...
 
Hyderabad Call Girls 7339748667 With Free Home Delivery At Your Door
Hyderabad Call Girls 7339748667 With Free Home Delivery At Your DoorHyderabad Call Girls 7339748667 With Free Home Delivery At Your Door
Hyderabad Call Girls 7339748667 With Free Home Delivery At Your Door
 
Call Girls Hyderabad ❤️ 7339748667 ❤️ With No Advance Payment
Call Girls Hyderabad ❤️ 7339748667 ❤️ With No Advance PaymentCall Girls Hyderabad ❤️ 7339748667 ❤️ With No Advance Payment
Call Girls Hyderabad ❤️ 7339748667 ❤️ With No Advance Payment
 
Salesforce AI + Data Community Tour Slides - Canarias
Salesforce AI + Data Community Tour Slides - CanariasSalesforce AI + Data Community Tour Slides - Canarias
Salesforce AI + Data Community Tour Slides - Canarias
 
Do People Really Know Their Fertility Intentions? Correspondence between Sel...
Do People Really Know Their Fertility Intentions?  Correspondence between Sel...Do People Really Know Their Fertility Intentions?  Correspondence between Sel...
Do People Really Know Their Fertility Intentions? Correspondence between Sel...
 
Essential Skills for Family Assessment - Marital and Family Therapy and Couns...
Essential Skills for Family Assessment - Marital and Family Therapy and Couns...Essential Skills for Family Assessment - Marital and Family Therapy and Couns...
Essential Skills for Family Assessment - Marital and Family Therapy and Couns...
 
High Profile Call Girls Navi Mumbai ✅ 9833363713 FULL CASH PAYMENT
High Profile Call Girls Navi Mumbai ✅ 9833363713 FULL CASH PAYMENTHigh Profile Call Girls Navi Mumbai ✅ 9833363713 FULL CASH PAYMENT
High Profile Call Girls Navi Mumbai ✅ 9833363713 FULL CASH PAYMENT
 
PCI-DSS-Data Security Standard v4.0.1.pdf
PCI-DSS-Data Security Standard v4.0.1.pdfPCI-DSS-Data Security Standard v4.0.1.pdf
PCI-DSS-Data Security Standard v4.0.1.pdf
 

Introduction to Machine Learning

  • 1. HANDS-ON ONLINE TRAINING ON DATA SCIENCE KNOWLEDGE AND SKILLS FORUM INTRODUCTION TO MACHINE LEARNING FAITHFUL ONWUEGBUCHE OLUWASEUN ODEYEMI AUGUSTINE OKOLIE
  • 2. WHAT IS MACHINE LEARNING • Machine learning is a type of artificial intelligence (AI) that enables computers to learn and make decisions without being explicitly programmed. • Using algorithms that iteratively learn from data, machine learning allows computers to find hidden insights without being explicitly programmed where to look.
  • 3. THINK AND LEARN LIKE A BABY
  • 4. MACHINE LEARNING VS. STATISTICS VS. COMPUTER SCIENCE Machine Learning Statistics Computer Science Objective Focuses on learning from data to make predictions or decisions without being explicitly programmed. It prioritizes prediction accuracy and generalizability. Aims to infer properties of an underlying distribution from a data sample. It emphasizes understanding and interpreting data and probabilistic models. Focuses on the creation and application of algorithms to manipulate, store, and communicate digital information. Methodologie s Typically uses complex models (like neural networks) and large amounts of data to train models for prediction. Utilizes both supervised and unsupervised learning methods. Often employs simpler, more interpretable models. Focuses on hypothesis testing, experimental design, estimation, and mathematical analysis. Involves algorithm design, data structures, computation theory, computer architecture, software development, and more. Validation Measures model performance through methods like cross- validation and seeks to improve generalization to unseen data. Validates models using methods such as confidence intervals, p- values, and hypothesis tests to quantify uncertainty. Uses formal methods for verifying correctness, analyzing computational complexity, and proving algorithmic bounds. Primary Concern Creating models that can learn from and make decisions or predictions based on data. Drawing valid conclusions and quantifying uncertainty about observed data and underlying distributions. Creating efficient algorithms and data structures to solve computational problems.
  • 5. TYPES OF MACHINE LEARNING Three Types of Problems • Supervised • Unsupervised • Reinforcement
  • 6. SUPERVISE D • Trained using labeled examples • Desired output is known • Methods include classification, regression, etc. • Uses patterns to predict the values of the label on additional unlabeled data • Algorithms: • Linear regression • Logistic regression • K-Nearest Neighbors (KNN) • Decision Trees and Random Forests • Support Vector Machines • Naive Bayes • Neural Networks
  • 7. UNSUPERVIS ED • Used against data that has no historical labels • Desired output is unknown • Goal is to explore the data and find some structure within the data • Algorithms • Anomaly detection • K-means clustering • Hierarchical clustering • DBSCAN • Principal Component Analysis (PCA) • Neural Networks
  • 8. REINFORCEMEN T • Algorithm discovers through trial and error which actions yield the greatest rewards. • Three primary components: • the agent (the learner or decision maker), • the environment (everything the agent interacts with) • actions (what the agent can do). • Objective: the agent chooses actions that maximize the expected reward over a given amount of time. • Algorithms • Markov Decision Process • Q-Learning • Deep Q Network (DQN)
  • 9. WHY USE IT? • Machine learning based models can extract patterns from massive amounts of data which humans cannot do because • We cannot retain everything in memory or we cannot perform obvious/redundant computations for hours and days to come up with interesting patterns. • “Humans can typically create one or two good models in a week; machine learning can create thousands of models in a week” (Thomas H. Davenport) • Solve problems we simply could not before
  • 10. USE CASES • Email spam filter • Recommendation systems • Self driving car • Finance • Image Recognition • Competitive machines
  • 11. TYPICAL MACHINE LEARNING PROCESS Source: INTRODUCING AZURE MACHINE LEARNING, pg. 5
  • 12. TO GIVE CREDIT, OR NOT TO GIVE CREDIT • You are asked by your boss while working at Big Bank Inc. to develop an automated decision maker on whether to give a potential client credit or not.
  • 13. WHAT IS THE QUESTION? • What question are we trying to answer here? What problem are we looking to solve?
  • 14. SELECTING DATA Feature • An individual measurable property of a phenomenon being observed • Best found through industry experts
  • 15. SELECTING DATA Feature Extraction • Feature extraction is a general term for methods of constructing combinations of the variables to get around certain problems while still describing the data with sufficient accuracy. • Analysis with a large number of variables generally requires a large amount of memory and computation. • Reducing the amount of resources required to describe a large set of data
  • 16. SELECTING DATA PCA (Principal ComponentAnalysis) • We have a huge list of different features • Many of them will measure related properties and so will be redundant • Summarize with less features
  • 17. PREPARING DATA • Cleaning • Units • Missing Values • Metadata
  • 18. DEVELOPING MODEL • What is the problem being solved? • What is the goal of the model? • Minimize error on the “training” data • Training data is the data used to train the model (all of it but the part we removed)
  • 19. DEVELOPING MODEL Linear Model • Relationships are modeled using linear predictor functions whose unknown model parameters are estimated from the data • Complex way of saying the model draws a line between two categories (classification) or to estimate a value (regression) • Linear regression the most common form of linear model
  • 20. DEVELOPING MODEL Non-linear Model • A nonlinear model describes nonlinear relationships in experimental data • The parameters can take the form of an exponential, trigonometric, power, or any other nonlinear function
  • 21. DEVELOPING MODEL Overfitting vs. Bias in Machine Learning Overfitting​ Bias​ Definition​ Overfitting occurs when a model fits the data more than is warranted. It captures the noise along with the underlying pattern in the data.​ Bias is error from erroneous assumptions in the learning algorithm. High bias can cause an algorithm to miss relevant relations between features and target outputs.​ Consequence​ Overfitting leads to a smaller error on the training data set but a larger one on unseen data, reducing the model's ability to generalize.​ High bias often leads to underfitting, where the model oversimplifies the data and doesn't capture its complexity.​ Example​ Creating a very complex decision tree that classifies each training instance perfectly, but performs poorly on unseen data.​ Fitting a quadratic dataset using a linear model – the model will consistently fail to capture the true relationship and make errors.​
  • 22. DEVELOPING MODEL Overfitting vs. Bias in Machine Learning
  • 24. DEVELOPING MODEL Rule Addition • Minimize error on the “training” data • AND make sure that error in the “unseen” data is close to error in the “training” data
  • 25. DEVELOPING A MODEL • Keep it Simple • Go for simpler models over more complicated models • Generally, the fewer parameters that you have to tune the better • Cross-Validation • K-fold cross validation is a great way to estimate error on training data • Regularization • Can sometimes help penalize certain sources of overfitting. • LASSO • Forces the sum of the absolute value of coefficients to be less than a fixed value • Effectively choosing a simpler model
  • 26. DEVELOPIN G A MODEL Data Snooping or Data Dredging • It's a form of bias that arises when you make decisions based on the same data you've used to train and test your model. • “If a data set has affected any step in the learning process, its ability to access the outcome has been compromised” • Experimenting • Reuse of the same data set to determine quality of model • Once a data set has been used to test the performance of a data set, it should be considered contaminated Source: Learning From Data, pg. 1
  • 27. INTERPRETING RESULTS • Validation • Cross validation • Test set • Once the test set has been used, you must find new data!

Editor's Notes

  1. Coin: that’s a quarter, that’s a dime Unsupervised: That’s a cluster, that’s another cluster Renforcement:
  2. Unsupervised learning works well on transactional data. For example, it can identify segments of customers with similar attributes who can then be treated similarly in marketing campaigns. Or it can find the main attributes that separate customer segments from each other. Popular techniques include self-organizing maps, nearest-neighbor mapping, k-means clustering and singular value decomposition. These algorithms are also used to segment text topics, recommend items and identify data outliers.
  3. The agent will reach the goal much faster by following a good policy. So the goal in reinforcement learning is to learn the best policy.
  4. In WWII, Allied bombers were key to strategic attacks, yet these lumbering giants were constantly shot down over enemy territory. The planes needed more armor, but armor is heavy. So extra plating could only go where the planes were being shot the most. A man named Abraham Wald, a Jewish mathematician who’d been locked out of university positions and ultimately fled the persecution in his own home country of Hungary, was brought in to oversee the operation. He started with a simple diagram—the outline of a plane—and he marked bullet holes corresponding to where each returning bomber had been shot. The result was the anatomy of common plane damage. The wings, nose, and tail were blackened with bullet holes, so these were the spots that needed more armor.
  5. Draw on board
  6. Draw example on board, why ever use linear then?
  7. Draw example on board
  8. More data
  翻译: