-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathlatin12utf8map.sed
106 lines (105 loc) · 1.7 KB
/
latin12utf8map.sed
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
# Latin1 incompatibility
# While ISO-8859-1 (the widespread Latin1 that is HTML's historical default
# charset) is a code subset of Unicode, ISO-8859-1's 8bit encoding scheme is no
# code subset of UTF-8. Latin1 letters look quite different when transformed into
# UTF-8. Non-ASCII characters in UTF-8 output look illegible on Latin-1 terminals.
# The many existing and unlabeled Latin1 texts are no legal UTF-8 input. Contrary
# to UTF-8, SCSU, JAVA and HTML allow Latin1 text to pass through transparently
# without being limited to Latin1.
# Source: http://www.czyborra.com/utf/
s/Â//g
s/¡/¡/g
s/¢/¢/g
s/£/£/g
s/¤/¤/g
s/Â¥/¥/g
s/¦/¦/g
s/§/§/g
s/¨/¨/g
s/©/©/g
s/ª/ª/g
s/«/«/g
s/¬/¬/g
s/Â//g
s/®/®/g
s/¯/¯/g
s/°/°/g
s/±/±/g
s/²/²/g
s/³/³/g
s/´/´/g
s/µ/µ/g
s/¶/¶/g
s/·/·/g
s/¸/¸/g
s/¹/¹/g
s/º/º/g
s/»/»/g
s/¼/¼/g
s/½/½/g
s/¾/¾/g
s/¿/¿/g
s/À/À/g
s/Ã /Á/g
s/Â/Â/g
s/Ã/Ã/g
s/Ä/Ä/g
s/Ã…/Å/g
s/Æ/Æ/g
s/Ç/Ç/g
s/È/È/g
s/É/É/g
s/Ê/Ê/g
s/Ë/Ë/g
s/ÃŒ/Ì/g
s/Ã /Í/g
s/ÃŽ/Î/g
s/Ã /Ï/g
s/Ã /Ð/g
s/Ñ/Ñ/g
s/Ã’/Ò/g
s/Ó/Ó/g
s/Ô/Ô/g
s/Õ/Õ/g
s/Ö/Ö/g
s/×/×/g
s/Ø/Ø/g
s/Ù/Ù/g
s/Ú/Ú/g
s/Û/Û/g
s/Ãœ/Ü/g
s/Ã /Ý/g
s/Þ/Þ/g
s/ß/ß/g
s/Ã /à/g
s/á/á/g
s/â/â/g
s/ã/ã/g
s/ä/ä/g
s/Ã¥/å/g
s/æ/æ/g
s/ç/ç/g
s/è/è/g
s/é/é/g
s/ê/ê/g
s/ë/ë/g
s/ì/ì/g
s/Ã/í/g
s/î/î/g
s/ï/ï/g
s/ð/ð/g
s/ñ/ñ/g
s/ò/ò/g
s/ó/ó/g
s/ô/ô/g
s/õ/õ/g
s/ö/ö/g
s/÷/÷/g
s/ø/ø/g
s/ù/ù/g
s/ú/ú/g
s/û/û/g
s/ü/ü/g
s/ý/ý/g
s/þ/þ/g
s/ÿ/ÿ/g