Python中bytes和str的区别与联系详解-偶像资源网

Bytes和Str的区别

在Python3中，字符序列有两种类型：bytes和str。bytes类型是无符号的8位值（通常以ASCII码显式），而str类型是Unicode代码点（code point）。代码点指编码字符集中，字符所对应的数字。

a = b\'hello world\'
print(isinstance(a, bytes))
print(list(a))
print(a)
\"\"\"
True
[104, 101, 108, 108, 111, 32, 119, 111, 114, 108, 100]
b\'hello world\'
\"\"\"

a = \'hello world\'
print(isinstance(a, str))
print(list(a))
print(a)
\"\"\"
True
[\'h\', \'e\', \'l\', \'l\', \'o\', \' \', \'w\', \'o\', \'r\', \'l\', \'d\']
hello world
\"\"\"

isinstance()方法可以判断对象的类型，例如这里用来判断是str还是bytes。

Python3对文本（str）和二进制数据（bytes）有着严格的区分，不能混用。

x = b\'python\'
y = b\'java\'
z = \'c++\'
w = \'c\'

print(x + y)
# b\'pythonjava\'
print(z + w)
# c++c
print(x + z)
# TypeError: can\'t concat str to bytes

print(\'python\' == b\'python\')
# False

上述示例中str类型和bytes类型间使用=来比较是否相等不会报错，但是会返回False。

Bytes与Str间的转换

str类型和bytes类型间可以相互转换。

str到bytes的转换需要调用encode()方法。

bytes到str间的转换需要调用decode()方法。

x = b\'python\'
y = x.decode(encoding=\'utf-8\')
z = y.encode(encoding=\'utf-8\')
print(y)
print(z)
\"\"\"
python
b\'python\'
\"\"\"

可以观察到encode()和decode()方法都有一个encoding参数用来指定具体的编码规则。

读写文件的注意事项

当要将bytes类型写入到文件中时，必须指定mode=wb。读取二进制文件时可以指定mode=rb或者指定编码方式，使用后者时读出来的就不是bytes类型的字符序列了。

x = b\'python\'

# 错误示例
with open(\'data.bin\', mode=\'w\') as fp:
    fp.write(x)
# TypeError: write() argument must be str, not bytes

# 正确示例
with open(\'data.bin\', mode=\'wb\') as fp:
    fp.write(x)

# 读取二进制文件方式1
with open(\'data.bin\', mode=\'rb\') as fp:
    content = fp.read()
    print(content)
# python

# 读取二进制文件方式2
with open(\'data.bin\', mode=\'r\', encoding=\'utf-8\') as fp:
    content = fp.read()
    print(content, type(content))
# python <class \'str\'>

当读写Unicode数据时，只需要注意下编码方式即可，最好是显式的传递encoding参数。

x = \'世界你好\'

with open(\'data.txt\', mode=\'w\', encoding=\'utf-8\') as fp:
    fp.write(x)

with open(\'data.txt\', mode=\'r\', encoding=\'utf-8\') as fp:
    content = fp.read()
    print(content)
# 世界你好

# 错误示例，编码方式不对
with open(\'data.txt\', mode=\'r\', encoding=\'gbk\') as fp:
    content = fp.read()
    print(content)
# 涓栫晫浣犲ソ

总结

版权声明 1 本网站名称：偶像资源网
2 本站永久网址：https://www.ox520.com
3 本网站的文章部分内容可能来源于网络，仅供大家学习与参考，如有侵权，请联系站长 QQ593098775进行删除处理。
4 本站一切资源不代表本站立场，并不代表本站赞同其观点和对其真实性负责。
5 本站一律禁止以任何方式发布或转载任何违法的相关信息，访客发现请向站长举报
6 本站资源大多存储在云盘，如发现链接失效，请联系我们我们会第一时间更新。

THE END

Python中bytes和str的区别与联系详解

目录

Bytes和Str的区别

Bytes与Str间的转换

读写文件的注意事项

总结

请登录后发表评论