diff options
-rw-r--r-- | ai-slides.tex | 26 |
1 files changed, 14 insertions, 12 deletions
diff --git a/ai-slides.tex b/ai-slides.tex index 32c53c3..3b79401 100644 --- a/ai-slides.tex +++ b/ai-slides.tex @@ -22,14 +22,7 @@ \foilhead{教学大纲} \begin{itemize} \item 课本。Gareth James, Daniela Witten, Trevor Hastie and Robert Tibshirani. {\bf An Introduction to Statistical Learning: with Applications in R}. [下载http://www-bcf.usc.edu/$\sim$gareth/ISL/ISLR Seventh Printing.pdf] -\item 成绩给予:\\ - -\begin{verbatim} - 课堂参与度 30分 - 作业 20分 - 期末考试 50分 -\end{verbatim} - +\item 成绩给予:课堂参与度(30分), 作业(20分), 期末考试(50分)。 \end{itemize} %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% @@ -351,9 +344,12 @@ $\mathrm{income} = \beta_0 + \beta_1 \times \mathrm{education} + \beta_2 \times %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% \foilhead{回归问题,分类问题} -两类响应变量,一类是定量的(如年龄,高度,收入),一类是定性的(也称categorical),如性别,品牌,癌症类型。 +两类响应变量,一类是定量的(如年龄,高度,收入),一类是定性的(也称 + categorical),如性别,品牌,癌症类型。 + +定量的响应变量一般对应回归问题,least squares。 -定量的响应变量一般对应回归问题,least squares。定性的响应变量一般对应分类问题, logistic regression。 +定性的响应变量一般对应分类问题, logistic regression。 预测变量是定量还是定性则没有什么关系。 @@ -544,19 +540,25 @@ Least squares line. $\hat{Y}=\hat{\beta}_0 + \hat{\beta}_1 X$. \end{center} 参数估计也是随机变量,在不同的训练集上有不同的参数估计。参数估计的标准 -错误(standard error) $\mathrm{SE}(\hat{\beta})$。 +错误(Standard Error) $\mathrm{SE}(\hat{\beta})$。 \begin{center} \includegraphics[width=0.99\textwidth]{ISLR_Eq3_8.png} \end{center} -$\sigma$通常未知,可以用Residual Standard Error = $\mathrm{RSE}= \sqrt{\mathrm{RSS}/(n-2)}$估计。 +上述公式的成立条件是: $\epsilon_i$ 之间不相关, $\epsilon_i$ 的Variance都是 $\sigma$。 + +$\sigma$ 通常未知,可以用Residual Standard Error = $\mathrm{RSE}= \sqrt{\mathrm{RSS}/(n-2)}$ 代替。 %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% \foilhead{置信区间} 95\%置信区间(confidence interval)。$[\hat{\beta_1} - 2 \cdot \mathrm{SE}(\hat{\beta_1}), \;\; \hat{\beta_1} + 2 \cdot \mathrm{SE}(\hat{\beta_1})]$。 只有5\%的概率真正的$\beta_1$会落在这个区间\underline{外面}。 +\begin{center} + \includegraphics[width=0.62\textwidth]{DeriveConfidenceInterval.png} +\end{center} + %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% \foilhead{假设检验} |