Machine Learning Course 4 Selection of Model

其他 2021-01-30 03:11:00 阅读次数: 0

Course 4

Average Error on testing data

Two main errors:

Error due to bias
Error due to variance

Estimator

Only god knows about the best function
$\hat f$
and we can only get a function from training data called
$f^*$
we say:
$f^*\quad is\quad an\quad estimator \quad of\quad \hat f$
and the difference between f^* and f^hat comes from bias and variance

Bias and Variance of Estimator

Suppose the mean of a variable is μ, and the variance of x is σ²
$\quad N \quad points:\{x^1,x^2,...,x^3\}$

$m=\frac{1}{N}\sum_n x^n\neq\mu \quad s^2=\frac 1 N \sum_n (x^n-m)^2$

$E[m]=E[\frac{1}{N}\sum_n x^n]=\frac 1 N \sum_n E[x^n]=\mu\quad E[s^2]=\frac{N-1}N \sigma ^2$

m is a biased estimator of μ, s² is a biased estimator of σ²
$Var[m]=\frac{\sigma^2}{N}$
which shows how much m deviates μ and the variance depends on the amount of sample

and the relationship of these parameters is below:
在这里插入图片描述

Simple model with small variance, and complicated model with large variance since simpler model is less likely to be influenced by the sampled data.
在这里插入图片描述

Diagnosis

If your model cannot even fit the training data, then you got a large bias, it is Underfitting
If you can fit training data but got large error on testing data, then you probably got a large variance, it is Overfitting

For bias, redesign your model:

Add more features as input
A more complex model maybe needed

For large variance:

More data is needed(Very effective but not always practical)
Regularization

Model Selection

There is usually a trade-off between bias and variance

Select a model that balances two kinds of error to minimize total error

Cross Validation could a possible way to make balance:
在这里插入图片描述

And an advanced method called N-fold Cross Validation can be used

在这里插入图片描述

猜你喜欢

转载自blog.csdn.net/weixin_43366276/article/details/107821803

Machine Learning Course 4 Selection of Model

Machine Learning Notes Course 3

Machine Learning - Coursera week4 model representation

Machine Learning:Model and Cost Function

Model Representation--machine learning

Understanding Model Parameters in Machine Learning

sklearn.model_selection.learning_curve

李宏毅Machine Learning学习笔记4 Classification: Probabilistic Generative Model

3.Your First Machine Learning Model

Deploying a Machine Learning Model as a REST API

深度学习-Course 3 : Structuring Machine Learning Projects

学习笔记之Machine Learning Crash Course | Google Developers

CS229 Machine Learning Stanford Course by Andrew Ng

kaggle官网course Machine—Learning课程Exercise全部答案

【Course】Machine learning：Week 1-Lecture1&Lecture2

【Course】Machine learning：Week 2-编程作业: Linear Regression

machine learning ex4

Model selection

Get a Model! Model Hijacking Attack Against Machine Learning Models

coursera deep learning course4 week4

论文阅读 | BadNets: Identifying Vulnerabilities in the Machine Learning Model Supply Chain

[Machine Learning] 生成模型 & 判别模型（Generative & Discriminative Model）

机器学习肝炎预测模型machine learning for hepatitis prediction model

Angrew Machine Learning ex4

Python Machine Learning-Chapter4

Machine Learning 2014 by Andrew NG (part 4)

Machine Learning by Ng - 编程作业4

ZOJ 3956 Course Selection System

Course Selection System 【01背包】

Course Selection System ZOJ - 3956

今日推荐

美国拟限制 AI 大模型出口中国和俄罗斯

苹果将与 OpenAI 达成协议，将 ChatGPT 应用于 iPhone

openKylin 社区生态委员会第六次会议圆满召开

阿里云正式发布通义千问 2.5

Python 3.13 发布首个 Beta：实验性自由线程模式和 JIT、改进交互式解释器

Stack Overflow 拿我的代码去训练 AI 大模型，还封了我的账号

Pop!_OS 的 COSMIC 桌面完成 App Store 上架工作

报告：Django 仍然是 74% 开发者的首选

《2024 年一季度互联网投融资运行情况》研究报告

15 年前上了“FFmpeg 耻辱柱”，今天他还得谢谢咱——腾讯QQPlayer一雪前耻？

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

周排行

NEFU 117 素数个数的位数

Closest Common Ancestors (Lca,tarjan)

ELK部署

【转载】Hive笔记整理（三）

SQL语句（一）基本表的定义

关于Java web开发中的MySQL的事务语句

MFC创建自定义窗体

如何用一句话激怒程序员？

《逆袭大学》文摘——9.4 基础和应用的平衡中找到大学的节奏

【spring源码分析】@Value注解原理

每日归档

更多

2024-05-11(38)

2024-05-10(38)

2024-05-09(35)

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)