koi-utf 文件解释
[root@release nginx]# more koi-utf # This map is not a full koi8-r <> utf8 map: it does not contain
# box-drawing and some other characters. Besides this map contains
# several koi8-u and Byelorussian letters which are not in koi8-r.
# If you need a full and standard map, use contrib/unicode2nginx/koi-utf
# map instead.charset_map koi8-r utf-8 {80 E282AC ; # euro95 E280A2 ; # bullet9A C2A0 ; # 9E C2B7 ; # ·A3 D191 ; # small yoA4 D194 ; # small Ukrainian yeA6 D196 ; # small Ukrainian iA7 D197 ; # small Ukrainian yiAD D291 ; # small Ukrainian soft gAE D19E ; # small Byelorussian short uB0 C2B0 ; # °B3 D081 ; # capital YOB4 D084 ; # capital Ukrainian YEB6 D086 ; # capital Ukrainian IB7 D087 ; # capital Ukrainian YIB9 E28496 ; # numero signBD D290 ; # capital Ukrainian soft GBE D18E ; # capital Byelorussian short UBF C2A9 ; # (C)C0 D18E ; # small yuC1 D0B0 ; # small aC2 D0B1 ; # small bC3 D186 ; # small tsC4 D0B4 ; # small dC5 D0B5 ; # small yeC6 D184 ; # small fC7 D0B3 ; # small gC8 D185 ; # small khC9 D0B8 ; # small iCA D0B9 ; # small jCB D0BA ; # small kCC D0BB ; # small lCD D0BC ; # small mCE D0BD ; # small nCF D0BE ; # small oD0 D0BF ; # small pD1 D18F ; # small yaD2 D180 ; # small rD3 D181 ; # small sD4 D182 ; # small tD5 D183 ; # small uD6 D0B6 ; # small zhD7 D0B2 ; # small vD8 D18C ; # small soft signD9 D18B ; # small yDA D0B7 ; # small zDB D188 ; # small shDC D18D ; # small eDD D189 ; # small shchDE D187 ; # small chDF D18A ; # small hard signE0 D0AE ; # capital YUE1 D090 ; # capital AE2 D091 ; # capital BE3 D0A6 ; # capital TSE4 D094 ; # capital DE5 D095 ; # capital YEE6 D0A4 ; # capital FE7 D093 ; # capital GE8 D0A5 ; # capital KHE9 D098 ; # capital IEA D099 ; # capital JEB D09A ; # capital KEC D09B ; # capital LED D09C ; # capital MEE D09D ; # capital NEF D09E ; # capital OF0 D09F ; # capital PF1 D0AF ; # capital YAF2 D0A0 ; # capital RF3 D0A1 ; # capital SF4 D0A2 ; # capital TF5 D0A3 ; # capital UF6 D096 ; # capital ZHF7 D092 ; # capital VF8 D0AC ; # capital soft signF9 D0AB ; # capital YFA D097 ; # capital ZFB D0A8 ; # capital SHFC D0AD ; # capital EFD D0A9 ; # capital SHCHFE D0A7 ; # capital CHFF D0AA ; # capital hard sign
}
koi-utf 文件是一个字符集映射文件,用于将 KOI8-R 字符集转换为 UTF-8 编码。这些映射定义了如何将 KOI8-R 编码的字符转换为对应的 UTF-8 字符。
koi-utf
文件是一个字符集映射文件,用于将 KOI8-R 字符集转换为 UTF-8 编码。这些映射定义了如何将 KOI8-R 编码的字符转换为对应的 UTF-8 字符。
文件内容解释
# This map is not a full koi8-r <> utf8 map: it does not contain
# box-drawing and some other characters. Besides this map contains
# several koi8-u and Byelorussian letters which are not in koi8-r.
# If you need a full and standard map, use contrib/unicode2nginx/koi-utf
# map instead.charset_map koi8-r utf-8 {80 E282AC ; # euro95 E280A2 ; # bullet9A C2A0 ; # 9E C2B7 ; # ·A3 D191 ; # small yoA4 D194 ; # small Ukrainian yeA6 D196 ; # small Ukrainian iA7 D197 ; # small Ukrainian yiAD D291 ; # small Ukrainian soft gAE D19E ; # small Byelorussian short uB0 C2B0 ; # °B3 D081 ; # capital YOB4 D084 ; # capital Ukrainian YEB6 D086 ; # capital Ukrainian IB7 D087 ; # capital Ukrainian YIB9 E28496 ; # numero signBD D290 ; # capital Ukrainian soft GBE D18E ; # capital Byelorussian short UBF C2A9 ; # (C)C0 D18E ; # small yuC1 D0B0 ; # small aC2 D0B1 ; # small bC3 D186 ; # small tsC4 D0B4 ; # small dC5 D0B5 ; # small yeC6 D184 ; # small fC7 D0B3 ; # small gC8 D185 ; # small khC9 D0B8 ; # small iCA D0B9 ; # small jCB D0BA ; # small kCC D0BB ; # small lCD D0BC ; # small mCE D0BD ; # small nCF D0BE ; # small oD0 D0BF ; # small pD1 D18F ; # small yaD2 D180 ; # small rD3 D181 ; # small sD4 D182 ; # small tD5 D183 ; # small uD6 D0B6 ; # small zhD7 D0B2 ; # small vD8 D18C ; # small soft signD9 D18B ; # small yDA D0B7 ; # small zDB D188 ; # small shDC D18D ; # small eDD D189 ; # small shchDE D187 ; # small chDF D18A ; # small hard signE0 D0AE ; # capital YUE1 D090 ; # capital AE2 D091 ; # capital BE3 D0A6 ; # capital TSE4 D094 ; # capital DE5 D095 ; # capital YEE6 D0A4 ; # capital FE7 D093 ; # capital GE8 D0A5 ; # capital KHE9 D098 ; # capital IEA D099 ; # capital JEB D09A ; # capital KEC D09B ; # capital LED D09C ; # capital MEE D09D ; # capital NEF D09E ; # capital OF0 D09F ; # capital PF1 D0AF ; # capital YAF2 D0A0 ; # capital RF3 D0A1 ; # capital SF4 D0A2 ; # capital TF5 D0A3 ; # capital UF6 D096 ; # capital ZHF7 D092 ; # capital VF8 D0AC ; # capital soft signF9 D0AB ; # capital YFA D097 ; # capital ZFB D0A8 ; # capital SHFC D0AD ; # capital EFD D0A9 ; # capital SHCHFE D0A7 ; # capital CHFF D0AA ; # capital hard sign
}
解释
-
文件开头的注释:
# This map is not a full koi8-r <> utf8 map: it does not contain
:- 这不是一个完整的 KOI8-R 到 UTF-8 的映射。
# box-drawing and some other characters. Besides this map contains
:- 它不包含一些绘图字符和其他字符。
# several koi8-u and Byelorussian letters which are not in koi8-r.
:- 此映射包含一些 KOI8-U 和白俄罗斯字母,这些字母不在 KOI8-R 中。
# If you need a full and standard map, use contrib/unicode2nginx/koi-utf
:- 如果你需要一个完整的标准映射,请使用
contrib/unicode2nginx/koi-utf
。
- 如果你需要一个完整的标准映射,请使用
# map instead.
:- 代替。
-
charset_map koi8-r utf-8 {
:- 定义了一个字符集映射,使用
koi8-r
到utf-8
的转换。
- 定义了一个字符集映射,使用
-
具体映射:
- 每一行定义了一个 KOI8-R 字符和对应的 UTF-8 编码。例如:
80 E282AC ; # euro
:- KOI8-R 编码
80
对应 UTF-8 编码E282AC
,代表欧元符号(€)。
- KOI8-R 编码
C1 D0B0 ; # small a
:- KOI8-R 编码
C1
对应 UTF-8 编码D0B0
,代表小写字母a
。
- KOI8-R 编码
- 每一行定义了一个 KOI8-R 字符和对应的 UTF-8 编码。例如:
总结
koi-utf
文件定义了从 KOI8-R 字符集到 UTF-8 字符集的部分映射。这些映射用于将特定的 KOI8-R 字符转换为对应的 UTF-8 字符,以便在处理包含这些字符的文本时能正确显示。这对于支持多语言和特殊字符的应用程序非常重要。