Welcome to Scribd!

Skip carousel

Class4 KNN

Uploaded by

thap_dinh

0% found this document useful (0 votes)

33 views20 pages

jkkjjkkj

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

jkkjjkkj

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

0% found this document useful (0 votes)

33 views20 pages

Class4 KNN

Uploaded by

thap_dinh

jkkjjkkj

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Jump to Page

You are on page 1of 20

Search inside document

K nearest neighbor

LING 572
Fei Xia
Week 2: 1/17/2013
1

Outline
Demo
kNN

Demo
ML algorithms: Nave Bayes, Decision
stump, boosting, bagging, SVM, etc.
Task: A binary classification problem with
only two features.
http://www.cs.technion.ac.il/~rani/LocBoost/

The term weight in ML

Some ___ are more important than others
given everything else in the system
Weights of features
Weights of instances
Weights of classifiers
4

The term binary in ML

Classification problem:
Binary: the number of classes is 2
Multi-class: the number is classes is > 2
Features:
Binary: the number of possible feature values is 2.
Real-valued: the feature values are real numbers
File format:
Binary: human un-readable
Text: human readable

kNN

Instance-based (IB) learning

No training: store all training instances.
Lazy learning
Examples:

kNN
Locally weighted regression
Case-based reasoning

The most well-known IB method: kNN

kNN

kNN
Training: record labeled instances as feature vectors
Test: for a new instance d,
find k training instances that are closest to d.
perform majority voting or weighted voting.

Properties:
A lazy classifier. No learning in the training stage.
Feature selection and distance measure are crucial.

The algorithm

Determine parameter K

Calculate the distance between the test instance and

all the training instances

Sort the distances and determine K nearest neighbors

Gather the labels of the K nearest neighbors

Use simple majority voting or weighted voting.

Issues
Whats K?
How do we weight/scale/select features?
How do we combine instances by voting?

Picking K
Split the data into
Training data (true training data and validation data)
Dev data
Test data
Pick k with the lowest error rate on the validation set
use N-fold cross validation if the training data is small

Normalizing attribute values

Distance could be dominated by some attributes with large
numbers:
Ex: features: age, income
Original data: x1=(35, 76K), x2=(36, 80K), x3=(70, 79K)
Rescale: i.e., normalize to [0,1]
Assume: age [0,100], income [0, 200K]
After normalization: x1=(0.35, 0.38),
x2=(0.36, 0.40), x3 = (0.70, 0.395).

The Choice of Features

Imagine there are 100 features, and only 2 of them are
relevant to the target label.
Differences in irrelevant features likely to dominate:
kNN is easily misled in high-dimensional space.
Feature weighting or feature selection is key (It will
be covered in Week #4)

Feature weighting
Reweighting a dimension j by weight wj
Can increase or decrease weight of feature
on that dimension
Setting wj to zero eliminates this dimension
altogether.

Use cross-validation to automatically

choose weights w1, , wn
15

Some similarity measures

Euclidean distance:

Weighted Euclidean distance:

Cosine

Voting by k-nearest neighbors

Suppose we have found the k-nearest neighbors.
Let fi (x) be the class label for the i-th neighbor of x.

(c, fi(x)) is the identity function; that is,

it is 1 if fi(x) = c, and is 0 otherwise.
Let g(c) = i (c, fi(x)); that is, g(c) is the number of
neighbors with label c.

Voting
Majority voting:
c* = arg maxc g(c)
Weighted voting: weighting is on each neighbor
c* = arg maxc i wi (c, fi(x))
Where (c,fi(x)) is 1 if fi(x) = c and 0 otherwise
Weighted voting allows us to use more training examples:
e.g., wi = 1/dist(x, xi)
We can use all the training examples.

Summary of kNN algorithm

Decide k, feature weights, and similarity
measure
Given a test instance x
Calculate the distances between x and all the
training data
Choose the k nearest neighbors
Let the neighbors vote
19

Strengths:
Simplicity (conceptual)
Efficiency at training: no training
Handling multi-class
Stability and robustness: averaging k neighbors
Predication accuracy: when the training data is large
Weakness:
Efficiency at testing time: need to calculate all
distances
Theoretical validity
Sensitivity to irrelevant features
Distance metrics unclear on non-numerical/binary
values
Extension: e.g., Rocchio algorithm
20

The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
From Everand
The Subtle Art of Not Giving a F*ck: A Counterintuitive Approach to Living a Good Life
Mark Manson
Rating: 4 out of 5 stars
4/5 (5794)
Shoe Dog: A Memoir by the Creator of Nike
From Everand
Shoe Dog: A Memoir by the Creator of Nike
Phil Knight
Rating: 4.5 out of 5 stars
4.5/5 (537)
4a 4
Document10 pages
4a 4
thap_dinh
No ratings yet
HTML5 Cheat Sheet
Document1 page
HTML5 Cheat Sheet
Regina Alegrid
No ratings yet
Disser Ta Cao
Document85 pages
Disser Ta Cao
thap_dinh
No ratings yet
Digital Gyroscopes
Document26 pages
Digital Gyroscopes
thap_dinh
100% (1)
Project 1 - Wifi Positioning
Document8 pages
Project 1 - Wifi Positioning
thap_dinh
No ratings yet
Pkdd07 Song
Document17 pages
Pkdd07 Song
thap_dinh
No ratings yet
Kalman and Extended Kalman Concept, Derivation and Properties
Document44 pages
Kalman and Extended Kalman Concept, Derivation and Properties
liuocean613
No ratings yet
2nd Order Comp Filter Simulation
Document3 pages
2nd Order Comp Filter Simulation
thap_dinh
No ratings yet
JFormDesigner License
Document2 pages
JFormDesigner License
thap_dinh
No ratings yet
05 Layout Managers
Document8 pages
05 Layout Managers
thap_dinh
No ratings yet
Algo 3 DFusions Mems
Document5 pages
Algo 3 DFusions Mems
thap_dinh
No ratings yet
Coding The Arduino: "Embedded Systems"
Document21 pages
Coding The Arduino: "Embedded Systems"
thap_dinh
No ratings yet
11 Java Layout Managers
Document22 pages
11 Java Layout Managers
thap_dinh
No ratings yet
Yun 2006
Document12 pages
Yun 2006
thap_dinh
No ratings yet
Links and Lans: Ee 122: Intro To Communication Networks
Document58 pages
Links and Lans: Ee 122: Intro To Communication Networks
thap_dinh
No ratings yet
Indoor Wireless Localization Using Kalman Filtering in Fingerprinting-Based Location Estimation System
Document12 pages
Indoor Wireless Localization Using Kalman Filtering in Fingerprinting-Based Location Estimation System
thap_dinh
No ratings yet
Extended Gradient Predictor and Filter For Smoothing RSSI: Fazli Subhan, Salman Ahmed and Khalid Ashraf
Document5 pages
Extended Gradient Predictor and Filter For Smoothing RSSI: Fazli Subhan, Salman Ahmed and Khalid Ashraf
thap_dinh
No ratings yet
Csharp4 MVC
Document16 pages
Csharp4 MVC
thap_dinh
No ratings yet
Java Media Framework: RTP: Summary: Sources
Document14 pages
Java Media Framework: RTP: Summary: Sources
jhuaraya
No ratings yet
Names & Addresses: EE 122: IP Forwarding and Transport Protocols
Document8 pages
Names & Addresses: EE 122: IP Forwarding and Transport Protocols
thap_dinh
No ratings yet
Lect 7
Document78 pages
Lect 7
thap_dinh
No ratings yet
Aloha
Document47 pages
Aloha
thap_dinh
No ratings yet
Chuong4 Tebao
Document29 pages
Chuong4 Tebao
thap_dinh
No ratings yet
Consume Webservice in An..
Document7 pages
Consume Webservice in An..
thap_dinh
No ratings yet
Untitled Document
Document1 page
Untitled Document
thap_dinh
No ratings yet
Design of Low Noise Amplifier at 4 GHZ
Document4 pages
Design of Low Noise Amplifier at 4 GHZ
thap_dinh
No ratings yet
665 PA - Notes
Document31 pages
665 PA - Notes
thap_dinh
No ratings yet
Transmission Line Theory
Document25 pages
Transmission Line Theory
thap_dinh
No ratings yet
5 Steps To Selecting The Right RF Power Amplifier: Step 1 - Know Your Signal
Document12 pages
5 Steps To Selecting The Right RF Power Amplifier: Step 1 - Know Your Signal
thap_dinh
No ratings yet
Yes Please
From Everand
Yes Please
Amy Poehler
Rating: 4 out of 5 stars
4/5 (1891)
The Yellow House: A Memoir (2019 National Book Award Winner)
From Everand
The Yellow House: A Memoir (2019 National Book Award Winner)
Sarah M. Broom
Rating: 4 out of 5 stars
4/5 (98)
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
From Everand
Hidden Figures: The American Dream and the Untold Story of the Black Women Mathematicians Who Helped Win the Space Race
Margot Lee Shetterly
Rating: 4 out of 5 stars
4/5 (895)
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
From Everand
The Hard Thing About Hard Things: Building a Business When There Are No Easy Answers
Ben Horowitz
Rating: 4.5 out of 5 stars
4.5/5 (344)
The Little Book of Hygge: Danish Secrets to Happy Living
From Everand
The Little Book of Hygge: Danish Secrets to Happy Living
Meik Wiking
Rating: 3.5 out of 5 stars
3.5/5 (399)
Grit: The Power of Passion and Perseverance
From Everand
Grit: The Power of Passion and Perseverance
Angela Duckworth
Rating: 4 out of 5 stars
4/5 (588)
The Emperor of All Maladies: A Biography of Cancer
From Everand
The Emperor of All Maladies: A Biography of Cancer
Siddhartha Mukherjee
Rating: 4.5 out of 5 stars
4.5/5 (271)
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
From Everand
Devil in the Grove: Thurgood Marshall, the Groveland Boys, and the Dawn of a New America
Gilbert King
Rating: 4.5 out of 5 stars
4.5/5 (266)
Never Split the Difference: Negotiating As If Your Life Depended On It
From Everand
Never Split the Difference: Negotiating As If Your Life Depended On It
Chris Voss
Rating: 4.5 out of 5 stars
4.5/5 (838)
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
From Everand
A Heartbreaking Work Of Staggering Genius: A Memoir Based on a True Story
Dave Eggers
Rating: 3.5 out of 5 stars
3.5/5 (231)
Principles: Life and Work
From Everand
Principles: Life and Work
Ray Dalio
Rating: 4 out of 5 stars
4/5 (599)
On Fire: The (Burning) Case for a Green New Deal
From Everand
On Fire: The (Burning) Case for a Green New Deal
Naomi Klein
Rating: 4 out of 5 stars
4/5 (73)
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
From Everand
Elon Musk: Tesla, SpaceX, and the Quest for a Fantastic Future
Ashlee Vance
Rating: 4.5 out of 5 stars
4.5/5 (474)
Team of Rivals: The Political Genius of Abraham Lincoln
From Everand
Team of Rivals: The Political Genius of Abraham Lincoln
Doris Kearns Goodwin
Rating: 4.5 out of 5 stars
4.5/5 (234)
The World Is Flat 3.0: A Brief History of the Twenty-first Century
From Everand
The World Is Flat 3.0: A Brief History of the Twenty-first Century
Thomas L. Friedman
Rating: 3.5 out of 5 stars
3.5/5 (2259)
Angela's Ashes: A Memoir
From Everand
Angela's Ashes: A Memoir
Frank McCourt
Rating: 4.5 out of 5 stars
4.5/5 (440)
Rise of ISIS: A Threat We Can't Ignore
From Everand
Rise of ISIS: A Threat We Can't Ignore
Jay Sekulow
Rating: 3.5 out of 5 stars
3.5/5 (137)
Steve Jobs
From Everand
Steve Jobs
Walter Isaacson
Rating: 4.5 out of 5 stars
4.5/5 (806)
Fear: Trump in the White House
From Everand
Fear: Trump in the White House
Bob Woodward
Rating: 3.5 out of 5 stars
3.5/5 (738)
The Unwinding: An Inner History of the New America
From Everand
The Unwinding: An Inner History of the New America
George Packer
Rating: 4 out of 5 stars
4/5 (45)
Bad Feminist: Essays
From Everand
Bad Feminist: Essays
Roxane Gay
Rating: 4 out of 5 stars
4/5 (1015)
John Adams
From Everand
John Adams
David McCullough
Rating: 4.5 out of 5 stars
4.5/5 (2409)
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
From Everand
The Gifts of Imperfection: Let Go of Who You Think You're Supposed to Be and Embrace Who You Are
Brené Brown
Rating: 4 out of 5 stars
4/5 (1090)
The Glass Castle: A Memoir
From Everand
The Glass Castle: A Memoir
Jeannette Walls
Rating: 4.5 out of 5 stars
4.5/5 (1712)
The Light Between Oceans: A Novel
From Everand
The Light Between Oceans: A Novel
M.L. Stedman
Rating: 4.5 out of 5 stars
4.5/5 (789)
The Outsider: A Novel
From Everand
The Outsider: A Novel
Stephen King
Rating: 4 out of 5 stars
4/5 (1839)
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
From Everand
The Sympathizer: A Novel (Pulitzer Prize for Fiction)
Viet Thanh Nguyen
Rating: 4.5 out of 5 stars
4.5/5 (120)
The Woman in Cabin 10
From Everand
The Woman in Cabin 10
Ruth Ware
Rating: 3.5 out of 5 stars
3.5/5 (2322)
Brooklyn: A Novel
From Everand
Brooklyn: A Novel
Colm Tóibín
Rating: 3.5 out of 5 stars
3.5/5 (1937)
A Man Called Ove: A Novel
From Everand
A Man Called Ove: A Novel
Fredrik Backman
Rating: 4.5 out of 5 stars
4.5/5 (4609)
The Perks of Being a Wallflower
From Everand
The Perks of Being a Wallflower
Stephen Chbosky
Rating: 4.5 out of 5 stars
4.5/5 (2101)
Wolf Hall: A Novel
From Everand
Wolf Hall: A Novel
Hilary Mantel
Rating: 4 out of 5 stars
4/5 (3811)
Little Women
From Everand
Little Women
Louisa May Alcott
Rating: 4 out of 5 stars
4/5 (104)
A Tree Grows in Brooklyn
From Everand
A Tree Grows in Brooklyn
Betty Smith
Rating: 4.5 out of 5 stars
4.5/5 (1929)
Manhattan Beach: A Novel
From Everand
Manhattan Beach: A Novel
Jennifer Egan
Rating: 3.5 out of 5 stars
3.5/5 (792)
The Art of Racing in the Rain: A Novel
From Everand
The Art of Racing in the Rain: A Novel
Garth Stein
Rating: 4 out of 5 stars
4/5 (4200)
Sing, Unburied, Sing: A Novel
From Everand
Sing, Unburied, Sing: A Novel
Jesmyn Ward
Rating: 4 out of 5 stars
4/5 (1103)
Her Body and Other Parties: Stories
From Everand
Her Body and Other Parties: Stories
Carmen Maria Machado
Rating: 4 out of 5 stars
4/5 (821)
The Constant Gardener: A Novel
From Everand
The Constant Gardener: A Novel
John le Carré
Rating: 3.5 out of 5 stars
3.5/5 (104)
A Simulation-Based Process Model For Managing Complex Design Process
Document13 pages
A Simulation-Based Process Model For Managing Complex Design Process
Metehan Agaca
No ratings yet
CSC:361-Software Engineering: Semester: Fall2020
Document39 pages
CSC:361-Software Engineering: Semester: Fall2020
hamsfayyaz
No ratings yet
Experiment No. 2 Rockwell Hardness Test Introduction
Document3 pages
Experiment No. 2 Rockwell Hardness Test Introduction
Ahmad Abd
100% (1)
Harmony XB4 - XB4BD53
Document5 pages
Harmony XB4 - XB4BD53
v
No ratings yet
Finches Statistics Student-1
Document7 pages
Finches Statistics Student-1
api-319172404
No ratings yet
Genbio 1 Notes
Document1 page
Genbio 1 Notes
elisha
No ratings yet
Z Series: VZ-80 Series Portable Radio - VHF/UHF
Document2 pages
Z Series: VZ-80 Series Portable Radio - VHF/UHF
Prima Sony
No ratings yet
X500
Document3 pages
X500
yu3za
No ratings yet
3 1 5b Ohms Law Worksheet
Document5 pages
3 1 5b Ohms Law Worksheet
api-291536660
100% (1)
Notes For Class 11 Maths Chapter 8 Binomial Theorem Download PDF
Document9 pages
Notes For Class 11 Maths Chapter 8 Binomial Theorem Download PDF
Rahul Chauhan
No ratings yet
Nomad Pro Operator Manual
Document36 pages
Nomad Pro Operator Manual
david_stephens_29
No ratings yet
How Convert A Document PDF To Jpeg
Document1 page
How Convert A Document PDF To Jpeg
blasssm7
No ratings yet
Atmel 2565 Using The Twi Module As I2c Slave - Applicationnote - Avr311
Document14 pages
Atmel 2565 Using The Twi Module As I2c Slave - Applicationnote - Avr311
m3y54m
No ratings yet
Ampla's Technology Architecture
Document4 pages
Ampla's Technology Architecture
syeadtalhaali
No ratings yet
Introduction To Percolation
Document25 pages
Introduction To Percolation
pasomaga
100% (1)
Microprocessor I - Lecture 01
Document27 pages
Microprocessor I - Lecture 01
Omar Mohamed Farag Abd El Fattah
No ratings yet
002 Ac Yoke B100-Parker
Document2 pages
002 Ac Yoke B100-Parker
Nubia Barrera
No ratings yet
Three - Dimensional Viscous Confinement and Cooling of Atoms by Resonance Radiation Pressure
Document6 pages
Three - Dimensional Viscous Confinement and Cooling of Atoms by Resonance Radiation Pressure
Wenjun Zhang
No ratings yet
Bates Stamped Edited 0607 w22 QP 61
Document6 pages
Bates Stamped Edited 0607 w22 QP 61
Krishnendu Saha
No ratings yet
ADA Practical File: Kartik Kataria
Document34 pages
ADA Practical File: Kartik Kataria
Kilari Teja
No ratings yet
MODULAR QUIZ - 57 - Steel Design
Document9 pages
MODULAR QUIZ - 57 - Steel Design
Cornelio J. Fernandez
No ratings yet
KD-131 Asme Viii Div3
Document2 pages
KD-131 Asme Viii Div3
comar85
No ratings yet
Urban Road Types 25.01.2022
Document5 pages
Urban Road Types 25.01.2022
Balogun Ibrahim
No ratings yet
CSIE Fisa Disciplina - Baze de Date
Document4 pages
CSIE Fisa Disciplina - Baze de Date
Costin Chelu
No ratings yet
Piston
Document7 pages
Piston
gauravarora93
100% (1)
Microcontroller Based Vehicle Security System
Document67 pages
Microcontroller Based Vehicle Security System
lokesh_045
No ratings yet
Microporous and Mesoporous Materials: Sciencedirect
Document8 pages
Microporous and Mesoporous Materials: Sciencedirect
Assyakur
No ratings yet
Silicon Controlled Rectifier
Document38 pages
Silicon Controlled Rectifier
Pao
No ratings yet
V. Activities: A. Directions: Do The Activity Below. (20 PTS.) Note: Work On This Offline
Document7 pages
V. Activities: A. Directions: Do The Activity Below. (20 PTS.) Note: Work On This Offline
Keith Neomi Cruz
No ratings yet
EXP.2 Enzyme Extraction From Bacteria
Document3 pages
EXP.2 Enzyme Extraction From Bacteria
LinhNguye
No ratings yet