使用python3批量下载rbsp数据 - 代码天地

使用python3批量下载rbsp数据

其他 2020-01-24 10:30:02 阅读次数: 0

1. 原始网站

https://www.rbsp-ect.lanl.gov/data_pub/rbspa/

2. 算法说明

进入需要下载的数据所在的目录，获取并解析该目录下的信息，解析出cdf文件名后，将cdf文件下载到内存中，随后保存到硬盘中。程序使用python3实现。

3. 程序代码

#!/bin/python3
# get the rbsp data
# writen by Liangjin Song on 20191219
import sys
import requests
from pathlib import Path

# the url containing the cdf files
url="https://www.rbsp-ect.lanl.gov/data_pub/rbspa/ECT/level2/2016/"
# local path to save the cdf file
path="/home/liangjin/Downloads/test/"

def main():
    re=requests.get(url)
    html=re.text
    cdfs=resolve_cdf(html)

    ncdf=len(cdfs)
    if ncdf == 0:
        return

    print(str(ncdf) + " cdf files are detected.")

    i=1
    # download 
    for f in cdfs:
        rcdf=url+f
        lcdf=path+f
        print(str(i)+ "   Downloading " + rcdf)
        download_cdf(rcdf,lcdf)
        i+=1
    return

# resolve the file name of cdf
def resolve_cdf(html):
    cdfs=list()
    head=html.find("href=")
    
    if head == -1:
        print("The cdf files not found!")
        return cdfs

    leng=len(html)

    while head != -1:
        tail=html.find(">",head,leng)
        # Extract the cdf file name
        cdf=html[head+6:tail-1]
        head=html.find("href=",tail,leng)
        if cdf.find('cdf') == -1:
            continue
        cdfs.append(cdf)
    return cdfs

def download_cdf(rcdf,lcdf):
    rfile=requests.get(rcdf)
    with open(lcdf,"wb") as f:
        f.write(rfile.content)
    f.close()
    return

if __name__ == "__main__":
    lpath=Path(path)
    if not lpath.is_dir():
        print("Path not found: " + path)
        sys.exit(0)
    sys.exit(main())

4. 使用说明

url为远程cdf文件所在路径。
path为本地保存cdf文件的路径。
url和path的末尾都有“/”（Linux下情形，若是Windows，路径分隔符为“\\”，则path末尾应为“\\”）。

5. 运行效果

在这里插入图片描述

不入流的IT宅男

发布了42 篇原创文章 · 获赞 5 · 访问量 2952

私信关注

猜你喜欢

转载自blog.csdn.net/Function_RY/article/details/103622772

使用python3批量下载rbsp数据

使用python3批量下载网站图片

Python3批量下载.dat和.hea文件

python3批量抓取电影天堂下载链接

python3批量为文件重命名

Python3批量转换文件编码

python3批量telnet脚本

Python3批量处理域名解析

20230507使用python3批量转换DOCX文档为TXT

20230508在Ubuntu22.04下使用python3批量转换DOCX文档为TXT

20230809在WIN10下使用python3批量将TXT文件转换为SRT文件

20230811在WIN11下使用python3批量将中英文的SRT格式的字幕合并

Python3批量修改文件名脚本

Python3批量合并excel 格式xlsx和xls都行

Python3批量修改文件名小工具

【python】爬虫篇：python使用psycopg2批量插入数据（三）

使用Spark3批量导入数据至MongoDB

网页视频解密下载 TS解密下载 M3U8批量下载

[云炬python3玩转机器学习笔记] 2-4批量学习、咋西安学习、参数学习和非参数学习

猫抓+M3U8批量下载合并

urllib3批量下载百度图片

mybatis3批量更新批量插入

tp3批量处理几万条数据

BDC3批量创建物料主数据

PHP使用Sqlite3批量插入调优

60-010-020-使用-Nexus3批量上传jar包

1.上传文件到服务器；2批量文件下载；3单个文件下载

7.6批量下载网易云歌曲

MP3批量压缩体积工具

mp3批量剪切

今日推荐

openKylin 社区生态委员会第六次会议圆满召开

阿里云正式发布通义千问 2.5

Python 3.13 发布首个 Beta：实验性自由线程模式和 JIT、改进交互式解释器

Stack Overflow 拿我的代码去训练 AI 大模型，还封了我的账号

Pop!_OS 的 COSMIC 桌面完成 App Store 上架工作

报告：Django 仍然是 74% 开发者的首选

《2024 年一季度互联网投融资运行情况》研究报告

15 年前上了“FFmpeg 耻辱柱”，今天他还得谢谢咱——腾讯QQPlayer一雪前耻？

TIOBE 5 月榜单：Fortran “复活”进入 Top 10

GCC 14.1 发布

面壁智能发布 Eurux-8x22B 开源大模型 —— 堪称「理科状元」

开源日报 | 谷歌扶持鸿蒙上位；开源Rabbit R1；Docker加持的安卓手机；微软的焦虑和野心；海尔电器把开放平台关了

周排行

计算机组成与设计（七）—— 除法器

Integer Approximation(分治+枚举)

大话数据库索引

windows10系统JDK的配置及下载地址

mysql实现秒值转换中原六仔平台搭建

Codeforces Round #556 (Div. 1)

百练1064 网线主管

Codeforces 995F Cowmpany Cowmpensation

子集生成之增量构造法，位向量法，二进制法

ERROR: cmd.exe failed with args /c "/APK\gradle\rungradle.bat...

每日归档

更多

2024-05-10(38)

2024-05-09(35)

2024-05-08(42)

2024-05-07(14)

2024-05-06(40)

2024-05-05(0)

2024-05-04(7)

2024-05-03(19)

2024-05-02(0)

2024-05-01(4)