高级编程技术，第十四周（补充了了第一题结果图和第二题代码修改）

其他 2018-06-20 05:15:15 阅读次数: 2

%matplotlib inline

import random

import numpy as np
import scipy as sp
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns

import statsmodels.api as sm
import statsmodels.formula.api as smf

sns.set_context("talk")

Anscombe's quartet

Anscombe's quartet comprises of four datasets, and is rather famous. Why? You'll find out in this exercise.

anascombe = pd.read_csv('data/anscombe.csv')
anascombe.head()

输出结果为：

Part 1

For each of the four datasets...

Compute the mean and variance of both x and y
Compute the correlation coefficient between x and y
Compute the linear regression line: y=β0+β1x+ϵ (hint: use statsmodels and look at the Statsmodels notebook)

print("the mean of x and y are:")
print(anascombe.groupby('dataset')['x','y'].mean())
print("the variance of x and y are:")
print(anascombe.groupby('dataset')['x', 'y'].var()) 
print("the correlation coefficient between x and y are:")
print(anascombe.groupby('dataset').corr())
print("the first linear regression line:")
lin_model_1 = smf.ols('y ~ x', anascombe.groupby('dataset').get_group('I')).fit()
print(lin_model_1.params)
print("the second linear regression line:")
lin_model_2 = smf.ols('y ~ x', anascombe.groupby('dataset').get_group('II')).fit()
print(lin_model_2.params)
print("the third linear regression line:")
lin_model_3 = smf.ols('y ~ x', anascombe.groupby('dataset').get_group('III')).fit()
print(lin_model_3.params)
print("the fourth linear regression line:")
lin_model_4 = smf.ols('y ~ x', anascombe.groupby('dataset').get_group('IV')).fit()
print(lin_model_4.params)

输出结果为：

Part 2

Using Seaborn, visualize all four datasets.

hint: use sns.FacetGrid combined with plt.scatter

sns.set(color_codes=True)
g = sns.FacetGrid(anascombe, col="dataset")
g.map(plt.scatter, "x", "y")

输出结果为：

猜你喜欢

转载自blog.csdn.net/qq_36319729/article/details/80646590

高级编程技术，第十四周（补充了了第一题结果图和第二题代码修改）

高级编程技术，第十二周（已修改第一题的错误）

高级编程技术，第二周

ayit第十四周训练a题

高级编程技术作业第一周2 第二章课后练习

c++ 第十四章第一题

ayit第十四周训练题d题

高级编程技术第二周作业

《高级编程技术》第二周作业

ayit第十四周周赛e题

ayit第十四周周赛c题

高级编程技术第十四次作业 numpy

高级编程技术第一周作业

《高级编程技术》第一周作业

高级编程技术，第一周

ayit第十四周训练f题

ayit第十四周训练k题

第十四周助教总结-第二组

高级编程技术第四周作业

《高级编程技术》第四周作业

高级编程技术，第四周

大计基编程（第十四周）

2018/12/12acm日常第二周第一题

第二次周赛第一题

hGame2020第二周第一题题解

第十四周

华为机试2016 （第一题和第二题）

LeetCode：每日一题【第二周】

j2ee高级开发技术课程第十四周

第一周水题第二题

今日推荐

NetBSD 禁止提交由 AI 生成的代码

Apache Doris 2.0.10 版本正式发布！

开源日报 | 大模型开战；大模型独角兽被曝卖身；周鸿祎建议谷歌开源所有产品；最大开源AI社区提供1000万美元共享GPU

开源日报 | Chrome内置Gemini的意义不在于Gemini；中国AI追随之路的五大误区；ECharts创始人“下海”养鱼；谷歌I/O开发者大会什么都有，只是没有惊喜

微软回应中国区AI团队“打包赴美”传闻

基于大语言模型的开源知识库问答系统 MaxKB GitHub Star 数量突破 5,000 个！

周排行

static方法和非static方法的区别（java）

如何查找计算机专业paper

java.lang.ClassFormatError: Incompatible magic value 0 in class file com/sitecha

跳跃游戏II

stm32_之【建立工程】

TeaWeb v0.0.9 发布，统计底层优化、主机监控功能改进

事件分发 -----控制字体大小

JavaScript DOM练习（动态表格添加） December 25，2019

JSF Scope & CDI

实现从零搭建一个登录注册页面（附源代码）

每日归档

更多

2024-05-19(0)

2024-05-18(4)

2024-05-17(34)

2024-05-16(6)

2024-05-15(24)

2024-05-14(0)

2024-05-13(18)

2024-05-12(0)

2024-05-11(38)

2024-05-10(38)