小爬虫-从PhysioNet上下载MIT-BIH Arrhythmia Database的ECG数据 - 代码天地

小爬虫-从PhysioNet上下载MIT-BIH Arrhythmia Database的ECG数据

其他 2018-05-31 09:39:07 阅读次数: 0


import urllib.request
import os


def url_open(url):
    '''open url and return source html code'''
    req = urllib.request.Request(url)
    req.add_header('User-Agent', 'Mozilla/5.0 (Windows NT 6.1; Win64; x64) \
     AppleWebKit/537.36 (KHTML, like Gecko) Chrome/65.0.3325.181 Safari/537.36')
    response = urllib.request.urlopen(req)
    html = response.read()
    return html


def save_file(file_url):
    ''' open a url and save file'''

    # get file name
    filename = file_url.split('/')[-1]
    # write file to local
    with open(filename, 'wb') as f:
        file = url_open(file_url)
        f.write(file)


def download_file(folder="files"):
    '''to download file form internet'''

    # build a folder if it doesn't exit
    if not os.path.exists(folder):
        os.makedirs(folder)
    os.chdir(folder)
    # based url
    url = "https://physionet.org/physiobank/database/mitdb/"

    for i in range(100,235):
        file_list = i
        # url of ECG signal head file '*.hea'
        file_url = url + str(file_list) + '.hea'
        # save file
        try:
            save_file(file_url)
        except:
            continue

    # discard the empty files
    file_path = 'D:\\Python\\PyCharm_Projects\\learn_py\\file'
    for root, dirs, files in os.walk(file_path):
        for f in files:
            empty_f = os.path.getsize(file_path + '\\' + f)
            if empty_f <= 0:
                    os.remove(file_path+'\\'+f)


if __name__=='__main__':
    download_file()

猜你喜欢

转载自blog.csdn.net/qq_23869697/article/details/80151289

小爬虫-从PhysioNet上下载MIT-BIH Arrhythmia Database的ECG数据

MIT-BIH ECG 心电数据+matlab绘图详解

MIT-BIH ECG 信号的数据读取方法和Matlab程序

mit-bih,ecg,wfdb工具箱

读取 MIT-BIH 心律数据

MIT-BIH访问及数据

matlab 读取MIT-BIH 数据集

图解MIT-BIH数据库心电数据下载和Matlab读取程序

如何从 MIT-BIH 心律失常数据库获取数据(教程含源码)

MIT-BIH心律失常数据库详解

[Paper]Cardiologist-Level Arrhythmia Detection with Convolutional Neural Networks

ResNet50模型识别二维化的心电信号——以MIT-BIH心律失常数据库为例

基于MATLAB的ECG心电数据去噪-小波变换几种方法

【论文阅读笔记】Cardiologist-level arrhythmia detection with convolutional neural networks

目前常用心电数据库ECG：MITBIH,AHA,CSE,ST-T,PTB,PAF 详细介绍+下载

Andrew Y. Ng式ResNet在MIT-BIH上的Inter-Patient分类实现（4）

Andrew Y. Ng式ResNet在MIT-BIH上的Inter-Patient分类实现（3）

Andrew Y. Ng式ResNet在MIT-BIH上的Inter-Patient分类实现（2）

Andrew Y. Ng式ResNet在MIT-BIH上的Inter-Patient分类实现（1）

基于 tensorflow_gpu1.14的MIT-BIH心电分类复现

ECG library/ ECG 库

matlab从ECG信号数据趋势项的消除

python小波变换去噪-ECG信号

ecg ekg

Python爬虫学习笔记（创建数据库MySQL数据库ERROR: "Can't create database 'spiders'; databas）

数据库（Database）

Mysql Database 数据迁移

DATABASE - 数据库

在tushare上下载数据

Oracle Database Gateways 下载

今日推荐

NetBSD 禁止提交由 AI 生成的代码

Apache Doris 2.0.10 版本正式发布！

开源日报 | 大模型开战；大模型独角兽被曝卖身；周鸿祎建议谷歌开源所有产品；最大开源AI社区提供1000万美元共享GPU

开源日报 | Chrome内置Gemini的意义不在于Gemini；中国AI追随之路的五大误区；ECharts创始人“下海”养鱼；谷歌I/O开发者大会什么都有，只是没有惊喜

微软回应中国区AI团队“打包赴美”传闻

基于大语言模型的开源知识库问答系统 MaxKB GitHub Star 数量突破 5,000 个！

周排行

女程序员是这样被恶搞的

B/S 和 C/S 的优缺点

vector一直申请会怎样？

座头鲸识别比赛(Humpback Whale Identification)总结

Linux高性能服务器编程——I/O复用 select

Mysql连接数据库（当包使用）

通过URI获取的文件路径为null的解决方法

1022-Primes on Interval(素数筛选+二分查找) ZCMU

Python出现： TypeError: expected string or buffer

bzoj2434: [Noi2011]阿狸的打字机 ac自动机+树状数组

每日归档

更多

2024-05-18(4)

2024-05-17(34)

2024-05-16(6)

2024-05-15(24)

2024-05-14(0)

2024-05-13(18)

2024-05-12(0)

2024-05-11(38)

2024-05-10(38)

2024-05-09(35)