The original description was slightly misleading by saying that utf-8 will encode and find the length, whereas in reality the length was produced by len.
285 B
285 B
byte_size
Returns the length of a string in bytes.
utf-8 encodes a given string, then len finds the length of the encoded string.
def byte_size(string):
return(len(string.encode('utf-8')))
byte_size('😀') # 4
byte_size('Hello World') # 11