Spaces:
Paused
Paused
File size: 14,073 Bytes
7c91632 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 | WEBVTT - Subtitles by: DownloadYoutubeSubtitles.com
00:00:00.240 --> 00:00:03.760
we know humans learn from their past
00:00:02.320 --> 00:00:05.680
experiences
00:00:03.760 --> 00:00:07.359
and machines follow instructions given
00:00:05.680 --> 00:00:09.599
by humans
00:00:07.359 --> 00:00:11.519
but what if humans can train the
00:00:09.599 --> 00:00:14.000
machines to learn from the past data and
00:00:11.519 --> 00:00:15.839
do what humans can do and much faster
00:00:14.000 --> 00:00:17.760
well that's called machine learning but
00:00:15.839 --> 00:00:20.000
it's a lot more than just learning it's
00:00:17.760 --> 00:00:22.400
also about understanding and reasoning
00:00:20.000 --> 00:00:24.240
so today we will learn about the basics
00:00:22.400 --> 00:00:26.800
of machine learning
00:00:24.240 --> 00:00:28.800
so that's paul he loves listening to new
00:00:26.800 --> 00:00:30.880
songs
00:00:28.800 --> 00:00:33.120
he either likes them or dislikes them
00:00:30.880 --> 00:00:34.880
paul decides this on the basis of the
00:00:33.120 --> 00:00:36.000
song's tempo
00:00:34.880 --> 00:00:39.040
genre
00:00:36.000 --> 00:00:41.440
intensity and the gender of voice for
00:00:39.040 --> 00:00:44.559
simplicity let's just use tempo and
00:00:41.440 --> 00:00:47.680
intensity for now so here tempo is on
00:00:44.559 --> 00:00:50.320
the x axis ranging from relaxed to fast
00:00:47.680 --> 00:00:53.280
whereas intensity is on the y axis
00:00:50.320 --> 00:00:56.879
ranging from light to soaring we see
00:00:53.280 --> 00:00:59.840
that paul likes the song with fast tempo
00:00:56.879 --> 00:01:02.800
and soaring intensity while he dislikes
00:00:59.840 --> 00:01:05.280
the song with relaxed tempo and light
00:01:02.800 --> 00:01:07.360
intensity so now we know paul's choices
00:01:05.280 --> 00:01:10.720
let's say paul listens to a new song
00:01:07.360 --> 00:01:13.680
let's name it as song a song a has fast
00:01:10.720 --> 00:01:15.840
tempo and a soaring intensity so it lies
00:01:13.680 --> 00:01:17.759
somewhere here looking at the data can
00:01:15.840 --> 00:01:20.560
you guess whether paul will like the
00:01:17.759 --> 00:01:23.040
song or not correct so paul likes this
00:01:20.560 --> 00:01:25.119
song by looking at paul's past choices
00:01:23.040 --> 00:01:28.400
we were able to classify the unknown
00:01:25.119 --> 00:01:30.880
song very easily right let's say now
00:01:28.400 --> 00:01:33.439
paul listens to a new song let's label
00:01:30.880 --> 00:01:36.720
it as song b so song b
00:01:33.439 --> 00:01:39.439
lies somewhere here with medium tempo
00:01:36.720 --> 00:01:42.400
and medium intensity neither relaxed nor
00:01:39.439 --> 00:01:44.479
fast neither light nor soaring now can
00:01:42.400 --> 00:01:46.560
you guess whether paul likes it or not
00:01:44.479 --> 00:01:49.200
not able to guess whether paul will like
00:01:46.560 --> 00:01:52.159
it or dislike it are the choices unclear
00:01:49.200 --> 00:01:54.640
correct we could easily classify song a
00:01:52.159 --> 00:01:57.200
but when the choice became complicated
00:01:54.640 --> 00:01:59.119
as in the case of song b yes and that's
00:01:57.200 --> 00:02:01.920
where machine learning comes in let's
00:01:59.119 --> 00:02:04.240
see how in the same example for song b
00:02:01.920 --> 00:02:06.719
if we draw a circle around the song b we
00:02:04.240 --> 00:02:09.440
see that there are four votes for like
00:02:06.719 --> 00:02:11.760
whereas one would for dislike if we go
00:02:09.440 --> 00:02:13.440
for the majority votes we can say that
00:02:11.760 --> 00:02:15.120
paul will definitely like the song
00:02:13.440 --> 00:02:17.120
that's all this was a basic machine
00:02:15.120 --> 00:02:19.200
learning algorithm also it's called k
00:02:17.120 --> 00:02:21.599
nearest neighbors so this is just a
00:02:19.200 --> 00:02:24.319
small example in one of the many machine
00:02:21.599 --> 00:02:27.440
learning algorithms quite easy right
00:02:24.319 --> 00:02:29.840
believe me it is but what happens when
00:02:27.440 --> 00:02:31.760
the choices become complicated as in the
00:02:29.840 --> 00:02:33.920
case of song b that's when machine
00:02:31.760 --> 00:02:35.920
learning comes in it learns the data
00:02:33.920 --> 00:02:38.160
builds the prediction model and when the
00:02:35.920 --> 00:02:40.640
new data point comes in it can easily
00:02:38.160 --> 00:02:43.200
predict for it more the data better the
00:02:40.640 --> 00:02:45.360
model higher will be the accuracy there
00:02:43.200 --> 00:02:47.599
are many ways in which the machine
00:02:45.360 --> 00:02:49.599
learns it could be either supervised
00:02:47.599 --> 00:02:51.280
learning unsupervised learning or
00:02:49.599 --> 00:02:53.680
reinforcement learning let's first
00:02:51.280 --> 00:02:55.519
quickly understand supervised learning
00:02:53.680 --> 00:02:57.280
suppose your friend gives you one
00:02:55.519 --> 00:03:00.000
million coins of three different
00:02:57.280 --> 00:03:02.080
currencies say one rupee one euro and
00:03:00.000 --> 00:03:04.480
one dirham each coin has different
00:03:02.080 --> 00:03:07.120
weights for example a coin of one rupee
00:03:04.480 --> 00:03:09.519
weighs three grams one euro weighs seven
00:03:07.120 --> 00:03:11.440
grams and one dirham weighs four grams
00:03:09.519 --> 00:03:13.920
your model will predict the currency of
00:03:11.440 --> 00:03:16.400
the coin here your weight becomes the
00:03:13.920 --> 00:03:18.400
feature of coins while currency becomes
00:03:16.400 --> 00:03:21.040
the label when you feed this data to the
00:03:18.400 --> 00:03:23.680
machine learning model it learns which
00:03:21.040 --> 00:03:26.319
feature is associated with which label
00:03:23.680 --> 00:03:28.959
for example it will learn that if a coin
00:03:26.319 --> 00:03:30.560
is of 3 grams it will be a 1 rupee coin
00:03:28.959 --> 00:03:32.879
let's give a new coin to the machine on
00:03:30.560 --> 00:03:34.959
the basis of the weight of the new coin
00:03:32.879 --> 00:03:37.599
your model will predict the currency
00:03:34.959 --> 00:03:40.000
hence supervised learning uses labeled
00:03:37.599 --> 00:03:42.400
data to train the model here the machine
00:03:40.000 --> 00:03:44.159
knew the features of the object and also
00:03:42.400 --> 00:03:46.159
the labels associated with those
00:03:44.159 --> 00:03:47.760
features on this note let's move to
00:03:46.159 --> 00:03:49.760
unsupervised learning and see the
00:03:47.760 --> 00:03:51.440
difference suppose you have cricket data
00:03:49.760 --> 00:03:53.760
set of various players with their
00:03:51.440 --> 00:03:56.319
respective scores and wickets taken when
00:03:53.760 --> 00:03:58.640
you feed this data set to the machine
00:03:56.319 --> 00:04:00.959
the machine identifies the pattern of
00:03:58.640 --> 00:04:02.319
player performance so it plots this data
00:04:00.959 --> 00:04:04.799
with the respective wickets on the
00:04:02.319 --> 00:04:06.799
x-axis while runs on the y-axis while
00:04:04.799 --> 00:04:08.879
looking at the data you'll clearly see
00:04:06.799 --> 00:04:10.879
that there are two clusters the one
00:04:08.879 --> 00:04:13.280
cluster are the players who scored
00:04:10.879 --> 00:04:15.680
higher runs and took less wickets while
00:04:13.280 --> 00:04:18.000
the other cluster is of the players who
00:04:15.680 --> 00:04:20.560
scored less runs but took many wickets
00:04:18.000 --> 00:04:22.800
so here we interpret these two clusters
00:04:20.560 --> 00:04:24.800
as batsmen and bowlers the important
00:04:22.800 --> 00:04:27.520
point to note here is that there were no
00:04:24.800 --> 00:04:29.759
labels of batsmen and bowlers hence the
00:04:27.520 --> 00:04:31.360
learning with unlabeled data is
00:04:29.759 --> 00:04:33.199
unsupervised learning so we saw
00:04:31.360 --> 00:04:35.199
supervised learning where the data was
00:04:33.199 --> 00:04:37.520
labeled and the unsupervised learning
00:04:35.199 --> 00:04:39.360
where the data was unlabeled and then
00:04:37.520 --> 00:04:41.280
there is reinforcement learning which is
00:04:39.360 --> 00:04:42.560
a reward based learning or we can say
00:04:41.280 --> 00:04:44.639
that it works on the principle of
00:04:42.560 --> 00:04:46.960
feedback here let's say you provide the
00:04:44.639 --> 00:04:49.919
system with an image of a dog and ask it
00:04:46.960 --> 00:04:52.080
to identify it the system identifies it
00:04:49.919 --> 00:04:54.000
as a cat so you give a negative feedback
00:04:52.080 --> 00:04:55.600
to the machine saying that it's a dog's
00:04:54.000 --> 00:04:57.759
image the machine will learn from the
00:04:55.600 --> 00:04:59.919
feedback and finally if it comes across
00:04:57.759 --> 00:05:01.919
any other image of a dog it will be able
00:04:59.919 --> 00:05:03.840
to classify it correctly that is
00:05:01.919 --> 00:05:05.520
reinforcement learning to generalize
00:05:03.840 --> 00:05:07.680
machine learning model let's see a
00:05:05.520 --> 00:05:09.280
flowchart input is given to a machine
00:05:07.680 --> 00:05:10.960
learning model which then gives the
00:05:09.280 --> 00:05:13.520
output according to the algorithm
00:05:10.960 --> 00:05:16.000
applied if it's right we take the output
00:05:13.520 --> 00:05:18.080
as a final result else we provide
00:05:16.000 --> 00:05:20.639
feedback to the training model and ask
00:05:18.080 --> 00:05:22.160
it to predict until it learns i hope
00:05:20.639 --> 00:05:23.919
you've understood supervised and
00:05:22.160 --> 00:05:26.240
unsupervised learning so let's have a
00:05:23.919 --> 00:05:28.720
quick quiz you have to determine whether
00:05:26.240 --> 00:05:30.560
the given scenarios uses supervised or
00:05:28.720 --> 00:05:32.880
unsupervised learning simple right
00:05:30.560 --> 00:05:35.039
scenario one facebook recognizes your
00:05:32.880 --> 00:05:37.520
friend in a picture from an album of
00:05:35.039 --> 00:05:40.639
tagged photographs
00:05:37.520 --> 00:05:43.840
scenario 2 netflix recommends new movies
00:05:40.639 --> 00:05:46.400
based on someone's past movie choices
00:05:43.840 --> 00:05:48.800
scenario 3 analyzing bank data for
00:05:46.400 --> 00:05:51.120
suspicious transactions and flagging the
00:05:48.800 --> 00:05:53.360
fraud transactions think wisely and
00:05:51.120 --> 00:05:55.440
comment below your answers moving on
00:05:53.360 --> 00:05:57.680
don't you sometimes wonder how is
00:05:55.440 --> 00:05:59.280
machine learning possible in today's era
00:05:57.680 --> 00:06:02.000
well that's because today we have
00:05:59.280 --> 00:06:04.479
humongous data available everybody is
00:06:02.000 --> 00:06:06.240
online either making a transaction or
00:06:04.479 --> 00:06:08.560
just surfing the internet and that's
00:06:06.240 --> 00:06:10.960
generating a huge amount of data every
00:06:08.560 --> 00:06:13.440
minute and that data my friend is the
00:06:10.960 --> 00:06:15.520
key to analysis also the memory handling
00:06:13.440 --> 00:06:17.360
capabilities of computers have largely
00:06:15.520 --> 00:06:20.479
increased which helps them to process
00:06:17.360 --> 00:06:23.280
such huge amount of data at hand without
00:06:20.479 --> 00:06:25.360
any delay and yes computers now have
00:06:23.280 --> 00:06:27.280
great computational powers so there are
00:06:25.360 --> 00:06:29.520
a lot of applications of machine
00:06:27.280 --> 00:06:31.280
learning out there to name a few machine
00:06:29.520 --> 00:06:33.440
learning is used in healthcare where
00:06:31.280 --> 00:06:35.440
diagnostics are predicted for doctor's
00:06:33.440 --> 00:06:37.759
review the sentiment analysis that the
00:06:35.440 --> 00:06:39.600
tech giants are doing on social media is
00:06:37.759 --> 00:06:41.360
another interesting application of
00:06:39.600 --> 00:06:43.280
machine learning fraud detection in the
00:06:41.360 --> 00:06:45.520
finance sector and also to predict
00:06:43.280 --> 00:06:47.120
customer churn in the e-commerce sector
00:06:45.520 --> 00:06:49.759
while booking a gap you must have
00:06:47.120 --> 00:06:51.520
encountered surge pricing often where it
00:06:49.759 --> 00:06:54.240
says the fair of your trip has been
00:06:51.520 --> 00:06:56.000
updated continue booking yes please i'm
00:06:54.240 --> 00:06:58.160
getting late for office
00:06:56.000 --> 00:07:00.240
well that's an interesting machine
00:06:58.160 --> 00:07:02.639
learning model which is used by global
00:07:00.240 --> 00:07:04.639
taxi giant uber and others where they
00:07:02.639 --> 00:07:06.560
have differential pricing in real time
00:07:04.639 --> 00:07:10.000
based on demand the number of cars
00:07:06.560 --> 00:07:12.560
available bad weather rush r etc so they
00:07:10.000 --> 00:07:14.800
use the surge pricing model to ensure
00:07:12.560 --> 00:07:17.280
that those who need a cab can get one
00:07:14.800 --> 00:07:19.599
also it uses predictive modeling to
00:07:17.280 --> 00:07:21.680
predict where the demand will be high
00:07:19.599 --> 00:07:23.759
with the goal that drivers can take care
00:07:21.680 --> 00:07:26.319
of the demand and search pricing can be
00:07:23.759 --> 00:07:29.280
minimized great hey siri can you remind
00:07:26.319 --> 00:07:30.400
me to book a cab at 6 pm today ok i'll
00:07:29.280 --> 00:07:33.120
remind you
00:07:30.400 --> 00:07:35.520
thanks no problem comment below some
00:07:33.120 --> 00:07:37.360
interesting everyday examples around you
00:07:35.520 --> 00:07:39.840
where machines are learning and doing
00:07:37.360 --> 00:07:41.840
amazing jobs so that's all for machine
00:07:39.840 --> 00:07:43.680
learning basics today from my site keep
00:07:41.840 --> 00:07:48.199
watching this space for more interesting
00:07:43.680 --> 00:07:48.199
videos until then happy learning
|