python-fullstack-s13-day24-python模块部分

Json模块提供了四个功能:dumps、dump、loads、load
1
import json 2 # dumps loads一般用于网络传输数据使用,可在多语言环境中使用,但是只支持str,dict,list,bool 3 dic = {"alex": ["woman", "man", "lady_boy"], "age": 13, "hobby": "老女人"} 4 ret = json.dumps(dic) 5 ret2 = json.dumps(dic, ensure_ascii=False) # dumps()括号内可加参数,indent缩进量等参数 6 print(ret) # {"alex": ["woman", "man", "lady_boy"], "age": 13, "hobby": "\u8001\u5973\u4eba"} 7 print(ret2) # {"alex": ["woman", "man", "lady_boy"], "age": 13, "hobby": "老女人"} 8 dic_ret = json.loads(ret) 9 print(dic_ret) # {'alex': ['woman', 'man', 'lady_boy'], 'age': 13, 'hobby': '老女人'}
 1 import json
 2 # 利用json的dumps loads向文件中写多条数据读取多行数据
 3 dic = {"alex": ["woman", "man", "lady_boy"], "age": 13, "hobby": "老女人"}
 4 dic1 = {"alex1": ["woman", "man", "lady_boy"], "age": 13, "hobby": "老女人"}
 5 dic2 = {"alex2": ["woman", "man", "lady_boy"], "age": 13, "hobby": "老女人"}
 6 with open("dumps", mode="w", encoding="utf-8") as f:
 7     f.write(json.dumps(dic) + "\n")
 8     f.write(json.dumps(dic1) + "\n")
 9     f.write(json.dumps(dic2) + "\n")
10 
11 with open("dumps", encoding="utf-8") as f1:
12     for line in f1:
13         print(json.loads(line))
 1 import json
 2 
 3 dic = {"alex": ["woman", "man", "lady_boy"], "age": 13, "hobby": "老女人"}
 4 dic1 = {"alex1": ["woman", "man", "lady_boy"], "age": 13, "hobby": "老女人"}
 5 dic2 = {"alex2": ["woman", "man", "lady_boy"], "age": 13, "hobby": "老女人"}
 6 with open("file", mode="w", encoding="utf-8") as f:
 7     json.dump(dic, f)
 8     # json.dump(dic1, f)
 9     # json.dump(dic2, f)
10 # 能够成功向文件中写入多条数据,但都是写到一行上,而外读取的时候会发生错误。只有写入时是一条数据的时候不会发生错误。
11 with open("file", encoding="utf-8") as f2:
12     print(json.load(f2))



Serialize obj to a JSON formatted str.(字符串表示的json对象)
Skipkeys:默认值是False,如果dict的keys内的数据不是python的基本类型(str,unicode,int,long,float,bool,None),设置为False时,就会报TypeError的错误。此时设置成True,则会跳过这类key
ensure_ascii:,当它为True的时候,所有非ASCII码字符显示为\uXXXX序列,只需在dump时将ensure_ascii设置为False即可,此时存入json的中文即可正常显示。)
If check_circular is false, then the circular reference check for container types will be skipped and a circular reference will result in an OverflowError (or worse).
If allow_nan is false, then it will be a ValueError to serialize out of range float values (nan, inf, -inf) in strict compliance of the JSON specification, instead of using the JavaScript equivalents (NaN, Infinity, -Infinity).
indent:应该是一个非负的整型,如果是0就是顶格分行显示,如果为空就是一行最紧凑显示,否则会换行且按照indent的数值显示前面的空白分行显示,这样打印出来的json数据也叫pretty-printed json
separators:分隔符,实际上是(item_separator, dict_separator)的一个元组,默认的就是(‘,’,’:’);这表示dictionary内keys之间用“,”隔开,而KEY和value之间用“:”隔开。
default(obj) is a function that should return a serializable version of obj or raise TypeError. The default simply raises TypeError.
sort_keys:将数据根据keys的值进行排序。
To use a custom JSONEncoder subclass (e.g. one that overrides the .default() method to serialize additional types), specify it with the cls kwarg; otherwise JSONEncoder is used.

其他参数说明

 
 

用于序列化的两个模块

  • json,用于字符串 和 python数据类型间进行转换
  • pickle,用于python特有的类型 和 python的数据类型间进行转换

pickle模块提供了四个功能:dumps、dump(序列化,存)、loads(反序列化,读)、load  (不仅可以序列化字典,列表...可以把python中任意的数据类型序列化

1 import pickle
2 # 用于网络传输的时候会将数据转化为bytes类型
3 dic = {"alex": ["woman", "man", "lady_boy"], "age": 13, "hobby": "老女人"}
4 print(pickle.dumps(dic))    # b'\x80\x03}q\x00(X\x04\x00\x00\x00alexq\x01]q\    bytes字节类型数据
5 print(pickle.loads(pickle.dumps(dic)))  # {'alex': ['woman', 'man', 'lady_boy'], 'age': 13, 'hobby': '老女人'}
import pickle
dic = {"alex": ["woman", "man", "lady_boy"], "age": 13, "hobby": "老女人"}
dic1 = {"alex1": ["woman", "man", "lady_boy"], "age": 13, "hobby": "老女人"}
dic2 = {"alex2": ["woman", "man", "lady_boy"], "age": 13, "hobby": "老女人"}
with open("files", mode="wb") as f1:    # 因为dump出来的是bytes类型的数据,所以向文件中写入到时候要选择b类型操作
    pickle.dump(dic, f1)        # dump()可多次向一个文件中写入多条数据
    pickle.dump(dic1, f1)
    pickle.dump(dic2, f1)
    pickle.dump(dic2, f1)
    pickle.dump(dic2, f1)
    pickle.dump(dic2, f1)


with open("files", mode="rb") as f2:
    while True:     # 读取的时候当文件句柄为生成器
        try:
            print(pickle.load(f2))
        except EOFError:
            break

  shelve也是python提供给我们的序列化工具,比pickle用起来更简单一些。
  shelve只提供给我们一个open方法,是用key来访问的,使用起来和字典类似。

 1 import shelve
 2 f = shelve.open('shelve_file')
 3 f['key'] = {'int':10, 'float':9.5, 'string':'Sample data'}  #直接对文件句柄操作,就可以存入数据
 4 f.close()
 5 
 6 import shelve
 7 f1 = shelve.open('shelve_file')
 8 existing = f1['key']  #取出数据的时候也只需要直接用key获取即可,但是如果key不存在会报错
 9 f1.close()
10 print(existing)

  这个模块有个限制,它不支持多个应用同一时间往同一个DB进行写操作。所以当我们知道我们的应用如果只进行读操作,我们可以让shelve通过只读方式打开DB

1 import shelve
2 f = shelve.open('shelve_file', flag='r')
3 existing = f['key']
4 f.close()
5 print(existing)

  由于shelve在默认情况下是不会记录待持久化对象的任何修改的,所以我们在shelve.open()时候需要修改默认参数,否则对象的修改不会保存。

import shelve
f1 = shelve.open('shelve_file')
print(f1['key'])
f1['key']['new_value'] = 'this was not here before'
f1.close()

f2 = shelve.open('shelve_file', writeback=True)
print(f2['key'])
f2['key']['new_value'] = 'this was not here before'
f2.close()

  writeback方式有优点也有缺点。优点是减少了我们出错的概率,并且让对象的持久化对用户更加的透明了;但这种方式并不是所有的情况下都需要,首先,使用writeback以后,shelf在open()的时候会增加额外的内存消耗,并且当DB在close()的时候会将缓存中的每一个对象都写入到DB,这也会带来额外的等待时间。因为shelve没有办法知道缓存中哪些对象修改了,哪些对象没有修改,因此所有的对象都会被写入。

猜你喜欢

转载自www.cnblogs.com/bug-ming/p/9270221.html