[ 源代码: chardet ]
软件包:python3-chardet(5.2.0+dfsg-1)
Universal Character Encoding Detector (Python3)
Chardet is a continuation of Mark Pilgrim's excellent original chardet port from C, and Ian Cordasco's charade Python 3-compatible fork. Chardet takes a sequence of bytes in an unknown character encoding, and attempts to determine the encoding.
Supported encodings:
* Big5, GB2312/GB18030, EUC-TW, HZ-GB-2312, and ISO-2022-CN (Traditional and Simplified Chinese) * EUC-JP, SHIFT_JIS, and ISO-2022-JP (Japanese) * EUC-KR and ISO-2022-KR (Korean) * KOI8-R, MacCyrillic, IBM855, IBM866, ISO-8859-5, and windows-1251 (Russian) * ISO-8859-2 and windows-1250 (Hungarian) * ISO-8859-5 and windows-1251 (Bulgarian) * ISO-8859-1 and windows-1252 (Western European languages) * ISO-8859-7 and windows-1253 (Greek) * ISO-8859-8 and windows-1255 (Visual and Logical Hebrew) * TIS-620 (Thai) * UTF-32 BE, LE, 3412-ordered, or 2143-ordered (with a BOM) * UTF-16 BE or LE (with a BOM) * UTF-8 (with or without a BOM) * ASCII
This library is a port of the auto-detection code in Mozilla.
This package contains the Python 3 version of the library.
其他与 python3-chardet 有关的软件包
|
|
|
|
-
- dep: python3
- interactive high-level object-oriented language (default python3 version)
-
- dep: python3-pkg-resources
- Package Discovery and Resource Access using pkg_resources