数据获取_json文件读取和存储

什么是JSON文件?

JSON数据是一种轻量级得数据交换格式,因起简洁和清晰得层次结构使JSON成为理想的数据交换语言。

import pandas as pd
import numpy as np
import os

os.chdir(r'E:\python_learn\train')  # 相对路径

json文件读取 → pd.read_json

read_json读取json文件后,返回DataFrame,是一个index标签乱序的顺序,可以用sort_index对其重新排序

file_name = 'eueo2012.json'
json = pd.read_json(file_name)
json.head()
Team Goals Shots on target Shots off target Shooting Accuracy % Goals-to-shots Total shots (inc. Blocked) Hit Woodwork Penalty goals Penalties not scored ... Saves made Saves-to-shots ratio Fouls Won Fouls Conceded Offsides Yellow Cards Red Cards Subs on Subs off Players Used
0 Croatia 4 13 12 51.9% 16.0% 32 0 0 0 ... 13 81.3% 41 62 2 9 0 9 9 16
1 Czech Republic 4 13 18 41.9% 12.9% 39 0 0 0 ... 9 60.1% 53 73 8 7 0 11 11 19
10 Portugal 6 22 42 34.3% 9.3% 82 6 0 0 ... 10 71.5% 73 90 10 12 0 14 14 16
11 Republic of Ireland 1 7 12 36.8% 5.2% 28 0 0 0 ... 17 65.4% 43 51 11 6 1 10 10 17
12 Russia 5 9 31 22.5% 12.5% 59 2 0 0 ... 10 77.0% 34 43 4 6 0 7 7 16

5 rows × 35 columns

# index排序
json = json.sort_index()
json.head()
Team Goals Shots on target Shots off target Shooting Accuracy % Goals-to-shots Total shots (inc. Blocked) Hit Woodwork Penalty goals Penalties not scored ... Saves made Saves-to-shots ratio Fouls Won Fouls Conceded Offsides Yellow Cards Red Cards Subs on Subs off Players Used
0 Croatia 4 13 12 51.9% 16.0% 32 0 0 0 ... 13 81.3% 41 62 2 9 0 9 9 16
1 Czech Republic 4 13 18 41.9% 12.9% 39 0 0 0 ... 9 60.1% 53 73 8 7 0 11 11 19
2 Denmark 4 10 10 50.0% 20.0% 27 1 0 0 ... 10 66.7% 25 38 8 4 0 7 7 15
3 England 5 11 18 50.0% 17.2% 40 0 0 0 ... 22 88.1% 43 45 6 5 0 11 11 16
4 France 3 22 24 37.9% 6.5% 65 1 0 0 ... 6 54.6% 36 51 5 6 0 11 11 19

5 rows × 35 columns

存储为JSON文件 → to_json

# 创建数据
df = pd.DataFrame(np.random.rand(5,6),columns=list('ABCDEF'))
print(df.head())
          A         B         C         D         E         F
0  0.849306  0.678014  0.348948  0.948653  0.868043  0.315411
1  0.321510  0.264775  0.356553  0.343864  0.200426  0.608816
2  0.771197  0.326742  0.497303  0.138062  0.528672  0.044052
3  0.319267  0.641123  0.154185  0.257138  0.119991  0.623622
4  0.191081  0.329389  0.282885  0.875474  0.329627  0.638168
# 存储为json文件
df.to_json('json_out.json')
print('存储完成!')
存储完成!
发布了10 篇原创文章 · 获赞 0 · 访问量 18

猜你喜欢

转载自blog.csdn.net/weixin_45556639/article/details/105469111
今日推荐