From: Peter Dyballa <Peter_Dyballa@Web.DE>
To: Tech Stuff <techstuff1971@yahoo.com>
Cc: "help-gnu-emacs@gnu.org" <help-gnu-emacs@gnu.org>
Subject: Re: File Encoding Issue on Windows
Date: Thu, 14 Mar 2013 00:58:19 +0100 [thread overview]
Message-ID: <C947B3A5-B97E-4CAB-A219-0E4BCD6B5B04@Web.DE> (raw)
In-Reply-To: <1363218487.89955.YahooMailNeo@web165002.mail.bf1.yahoo.com>
[-- Attachment #1: Type: text/plain, Size: 629 bytes --]
Am 14.03.2013 um 00:48 schrieb Tech Stuff:
> I'm willing to do anything short of actually gonig in and changing every occurrence individually as there are hundreds of them.
Here is a file with a comparison of CP1252 to UTF-8. If you want you can add a new column that does not show the HEX values but the actual characters that would used to represent the UTF-8 bytes.
All the missing code points are US-ASCII in both encodings (CP1252 and UTF-8). The bytes and their meanings are equal and the same in both.
--
Greetings
Pete
Behold the warranty … the bold print giveth and the fine print taketh away.
[-- Attachment #2: CP1252.txt --]
[-- Type: text/plain, Size: 8952 bytes --]
;;; -*- mode: Text; coding: windows-1252-unix; -*-
;
; Time-stamp: <2006-10-28 22:16:09 pete>
;
; ANSI Microsoft Windows Codepage
;
; oct dec hex UCS2 UTF-8
;=====================================
= 200 = 128 = 80 = U+20AC = E2 82 AC : EURO SIGN
= 201 = 129 = 81 = (UNDEFINED)
= 202 = 130 = 82 = U+201A = E2 80 9A : SINGLE LOW-9 QUOTATION MARK
= 203 = 131 = 83 = U+0192 = C6 92 : LATIN SMALL LETTER F WITH HOOK
= 204 = 132 = 84 = U+201E = E2 80 9E : DOUBLE LOW-9 QUOTATION MARK
= 205 = 133 = 85 = U+2026 = E2 80 A6 : HORIZONTAL ELLIPSIS
= 206 = 134 = 86 = U+2020 = E2 80 A0 : DAGGER
= 207 = 135 = 87 = U+2021 = E2 80 A1 : DOUBLE DAGGER
= 210 = 136 = 88 = U+005E = 5E : CIRCUMFLEX ACCENT
= 211 = 137 = 89 = U+2030 = E2 80 B0 : PER MILLE SIGN
= 212 = 138 = 8A = U+0160 = C5 A0 : LATIN CAPITAL LETTER S WITH CARON
= 213 = 139 = 8B = U+2039 = E2 80 B9 : SINGLE LEFT-POINTING ANGLE QUOTATION MARK
= 214 = 140 = 8C = U+0152 = C5 92 : LATIN CAPITAL LIGATURE OE
= 215 = 141 = 8D = (UNDEFINED)
= 216 = 142 = 8E = U+017D = C5 BD : LATIN CAPITAL LETTER Z WITH CARON
= 217 = 143 = 8F = (UNDEFINED)
= 220 = 144 = 90 = (UNDEFINED)
= 221 = 145 = 91 = U+2018 = E2 80 98 : LEFT SINGLE QUOTATION MARK
= 222 = 146 = 92 = U+2019 = E2 80 99 : RIGHT SINGLE QUOTATION MARK
= 223 = 147 = 93 = U+201C = E2 80 9C : LEFT DOUBLE QUOTATION MARK
= 224 = 148 = 94 = U+201D = E2 80 9D : RIGHT DOUBLE QUOTATION MARK
= 225 = 149 = 95 = U+2022 = E2 80 A2 : BULLET
= 226 = 150 = 96 = U+2013 = E2 80 92 : EN DASH
= 227 = 151 = 97 = U+2014 = E2 80 93 : EM DASH
= 230 = 152 = 98 = U+007E = 7E : TILDE
= 231 = 153 = 99 = U+2122 = E2 84 A2 : TRADEMARK SIGN
= 232 = 154 = 9A = U+0161 = C5 A1 : LATIN SMALL LETTER S WITH CARON
= 233 = 155 = 9B = U+203A = E2 80 BA : SINGLE RIGHT-POINTING ANGLE QUOTATION MARK
= 234 = 156 = 9C = U+0153 = C5 93 : LATIN SMALL LIGATURE OE
= 235 = 157 = 9D = (UNDEFINED)
= 236 = 158 = 9E = U+017E = C5 BE : LATIN SMALL LETTER Z WITH CARON
= 237 = 159 = 9F = U+0178 = C5 B8 : LATIN CAPITAL LETTER Y WITH DIAERESIS
= 240 = 160 = A0 = U+00A0 = C2 A0 : NO-BREAK SPACE
¡ = 241 = 161 = A1 = U+00A1 = C2 A1 : INVERTED EXCLAMATION MARK
¢ = 242 = 162 = A2 = U+00A2 = C2 A2 : CENT SIGN
£ = 243 = 163 = A3 = U+00A3 = C2 A3 : POUND SIGN
¤ = 244 = 164 = A4 = U+00A4 = C2 A4 : CURRENCY SIGN
¥ = 245 = 165 = A5 = U+00A5 = C2 A5 : YEN SIGN
¦ = 246 = 166 = A6 = U+00A6 = C2 A6 : BROKEN BAR
§ = 247 = 167 = A7 = U+00A7 = C2 A7 : SECTION SIGN
¨ = 250 = 168 = A8 = U+00A8 = C2 A8 : DIAERESIS
© = 251 = 169 = A9 = U+00A9 = C2 A9 : COPYRIGHT SIGN
ª = 252 = 170 = AA = U+00AA = C2 AA : FEMININE ORDINAL INDICATOR
« = 253 = 171 = AB = U+00AB = C2 AB : LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
¬ = 254 = 172 = AC = U+00AC = C2 AC : NOT SIGN
= 255 = 173 = AD = U+00AD = C2 AD : HYPHEN-MINUS
® = 256 = 174 = AE = U+00AE = C2 AE : REGISTERED SIGN
¯ = 257 = 175 = AF = U+00AF = C2 AF : MACRON
° = 260 = 176 = B0 = U+00B0 = C2 B0 : DEGREE SIGN
± = 261 = 177 = B1 = U+00B1 = C2 B1 : PLUS-MINUS SIGN
² = 262 = 178 = B2 = U+00B2 = C2 B2 : SUPERSCRIPT TWO
³ = 263 = 179 = B3 = U+00B3 = C2 B3 : SUPERSCRIPT THREE
´ = 264 = 180 = B4 = U+00B4 = C2 B4 : ACUTE ACCENT
µ = 265 = 181 = B5 = U+00B5 = C2 B5 : MICRO SIGN
¶ = 266 = 182 = B6 = U+00B6 = C2 B6 : PILCROW SIGN
· = 267 = 183 = B7 = U+00B7 = C2 B7 : MIDDLE DOT
¸ = 270 = 184 = B8 = U+00B8 = C2 B8 : CEDILLA
¹ = 271 = 185 = B9 = U+00B9 = C2 B9 : SUPERSCRIPT ONE
º = 272 = 186 = BA = U+00BA = C2 BA : MASCULINE ORDINAL INDICATOR
» = 273 = 187 = BB = U+00BB = C2 BB : RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
¼ = 274 = 188 = BC = U+00BC = C2 BC : VULGAR FRACTION ONE QUARTER
½ = 275 = 189 = BD = U+00BD = C2 BD : VULGAR FRACTION ONE HALF
¾ = 276 = 190 = BE = U+00BE = C2 BE : VULGAR FRACTION THREE QUARTERS
¿ = 277 = 191 = BF = U+00BF = C2 BF : INVERTED QUESTION MARK
À = 300 = 192 = C0 = U+00C0 = C3 80 : LATIN CAPITAL LETTER A WITH GRAVE
Á = 301 = 193 = C1 = U+00C1 = C3 81 : LATIN CAPITAL LETTER A WITH ACUTE
 = 302 = 194 = C2 = U+00C2 = C3 82 : LATIN CAPITAL LETTER A WITH CIRCUMFLEX
à = 303 = 195 = C3 = U+00C3 = C3 83 : LATIN CAPITAL LETTER A WITH TILDE
Ä = 304 = 196 = C4 = U+00C4 = C3 84 : LATIN CAPITAL LETTER A WITH DIAERESIS
Å = 305 = 197 = C5 = U+00C5 = C3 85 : LATIN CAPITAL LETTER A WITH RING ABOVE
Æ = 306 = 198 = C6 = U+00C6 = C3 86 : LATIN CAPITAL LETTER AE
Ç = 307 = 199 = C7 = U+00C7 = C3 87 : LATIN CAPITAL LETTER C WITH CEDILLA
È = 310 = 200 = C8 = U+00C8 = C3 88 : LATIN CAPITAL LETTER E WITH GRAVE
É = 311 = 201 = C9 = U+00C9 = C3 89 : LATIN CAPITAL LETTER E WITH ACUTE
Ê = 312 = 202 = CA = U+00CA = C3 8A : LATIN CAPITAL LETTER E WITH CIRCUMFLEX
Ë = 313 = 203 = CB = U+00CB = C3 8B : LATIN CAPITAL LETTER E WITH DIAERESIS
Ì = 314 = 204 = CC = U+00CC = C3 8C : LATIN CAPITAL LETTER I WITH GRAVE
Í = 315 = 205 = CD = U+00CD = C3 8D : LATIN CAPITAL LETTER I WITH ACUTE
Î = 316 = 206 = CE = U+00CE = C3 8E : LATIN CAPITAL LETTER I WITH CIRCUMFLEX
Ï = 317 = 207 = CF = U+00CF = C3 8F : LATIN CAPITAL LETTER I WITH DIAERESIS
Ð = 320 = 208 = D0 = U+00D0 = C3 90 : LATIN CAPITAL LETTER ETH
Ñ = 321 = 209 = D1 = U+00D1 = C3 91 : LATIN CAPITAL LETTER N WITH TILDE
Ò = 322 = 210 = D2 = U+00D2 = C3 92 : LATIN CAPITAL LETTER O WITH GRAVE
Ó = 323 = 211 = D3 = U+00D3 = C3 93 : LATIN CAPITAL LETTER O WITH ACUTE
Ô = 324 = 212 = D4 = U+00D4 = C3 94 : LATIN CAPITAL LETTER O WITH CIRCUMFLEX
Õ = 325 = 213 = D5 = U+00D5 = C3 95 : LATIN CAPITAL LETTER O WITH TILDE
Ö = 326 = 214 = D6 = U+00D6 = C3 96 : LATIN CAPITAL LETTER O WITH DIAERESIS
× = 327 = 215 = D7 = U+00D7 = C3 97 : MULTIPLICATION SIGN
Ø = 330 = 216 = D8 = U+00D8 = C3 98 : LATIN CAPITAL LETTER O WITH STROKE
Ù = 331 = 217 = D9 = U+00D9 = C3 99 : LATIN CAPITAL LETTER U WITH GRAVE
Ú = 332 = 218 = DA = U+00DA = C3 9A : LATIN CAPITAL LETTER U WITH ACUTE
Û = 333 = 219 = DB = U+00DB = C3 9B : LATIN CAPITAL LETTER U WITH CIRCUMFLEX
Ü = 334 = 220 = DC = U+00DC = C3 9C : LATIN CAPITAL LETTER U WITH DIAERESIS
Ý = 335 = 221 = DD = U+00DD = C3 9D : LATIN CAPITAL LETTER Y WITH ACUTE
Þ = 336 = 222 = DE = U+00DE = C3 9E : LATIN CAPITAL LETTER THORN
ß = 337 = 223 = DF = U+00DF = C3 9F : LATIN SMALL LETTER SHARP S
à = 340 = 224 = E0 = U+00E0 = C3 A0 : LATIN SMALL LETTER A WITH GRAVE
á = 341 = 225 = E1 = U+00E1 = C3 A1 : LATIN SMALL LETTER A WITH ACUTE
â = 342 = 226 = E2 = U+00E2 = C3 A2 : LATIN SMALL LETTER A WITH CIRCUMFLEX
ã = 343 = 227 = E3 = U+00E3 = C3 A3 : LATIN SMALL LETTER A WITH TILDE
ä = 344 = 228 = E4 = U+00E4 = C3 A4 : LATIN SMALL LETTER A WITH DIAERESIS
å = 345 = 229 = E5 = U+00E5 = C3 A5 : LATIN SMALL LETTER A WITH RING ABOVE
æ = 346 = 230 = E6 = U+00E6 = C3 A6 : LATIN SMALL LETTER AE
ç = 347 = 231 = E7 = U+00E7 = C3 A7 : LATIN SMALL LETTER C WITH CEDILLA
è = 350 = 232 = E8 = U+00E8 = C3 A8 : LATIN SMALL LETTER E WITH GRAVE
é = 351 = 233 = E9 = U+00E9 = C3 A9 : LATIN SMALL LETTER E WITH ACUTE
ê = 352 = 234 = EA = U+00EA = C3 AA : LATIN SMALL LETTER E WITH CIRCUMFLEX
ë = 353 = 235 = EB = U+00EB = C3 AB : LATIN SMALL LETTER E WITH DIAERESIS
ì = 354 = 236 = EC = U+00EC = C3 AC : LATIN SMALL LETTER I WITH GRAVE
í = 355 = 237 = ED = U+00ED = C3 AD : LATIN SMALL LETTER I WITH ACUTE
î = 356 = 238 = EE = U+00EE = C3 AE : LATIN SMALL LETTER I WITH CIRCUMFLEX
ï = 357 = 239 = EF = U+00EF = C3 AF : LATIN SMALL LETTER I WITH DIAERESIS
ð = 360 = 240 = F0 = U+00F0 = C3 B0 : LATIN SMALL LETTER ETH
ñ = 361 = 241 = F1 = U+00F1 = C3 B1 : LATIN SMALL LETTER N WITH TILDE
ò = 362 = 242 = F2 = U+00F2 = C3 B2 : LATIN SMALL LETTER O WITH GRAVE
ó = 363 = 243 = F3 = U+00F3 = C3 B3 : LATIN SMALL LETTER O WITH ACUTE
ô = 364 = 244 = F4 = U+00F4 = C3 B4 : LATIN SMALL LETTER O WITH CIRCUMFLEX
õ = 365 = 245 = F5 = U+00F5 = C3 B5 : LATIN SMALL LETTER O WITH TILDE
ö = 366 = 246 = F6 = U+00F6 = C3 B6 : LATIN SMALL LETTER O WITH DIAERESIS
÷ = 367 = 247 = F7 = U+00F7 = C3 B7 : DIVISION SIGN
ø = 370 = 248 = F8 = U+00F8 = C3 B8 : LATIN SMALL LETTER O WITH STROKE
ù = 371 = 249 = F9 = U+00F9 = C3 B9 : LATIN SMALL LETTER U WITH GRAVE
ú = 372 = 250 = FA = U+00FA = C3 BA : LATIN SMALL LETTER U WITH ACUTE
û = 373 = 251 = FB = U+00FB = C3 BB : LATIN SMALL LETTER U WITH CIRCUMFLEX
ü = 374 = 252 = FC = U+00FC = C3 BC : LATIN SMALL LETTER U WITH DIAERESIS
ý = 375 = 253 = FD = U+00FD = C3 BD : LATIN SMALL LETTER Y WITH ACUTE
þ = 376 = 254 = FE = U+00FE = C3 BE : LATIN SMALL LETTER THORN
ÿ = 377 = 255 = FF = U+00FF = C3 BF : LATIN SMALL LETTER Y WITH DIAERESIS
next prev parent reply other threads:[~2013-03-13 23:58 UTC|newest]
Thread overview: 24+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-03-12 3:08 File Encoding Issue on Windows Tech Stuff
2013-03-12 10:50 ` Peter Dyballa
2013-03-12 14:57 ` Tech Stuff
2013-03-12 16:32 ` W. Greenhouse
2013-03-13 17:44 ` Tech Stuff
2013-03-13 20:37 ` Peter Dyballa
2013-03-13 21:11 ` Tech Stuff
2013-03-13 22:16 ` Peter Dyballa
2013-03-13 23:26 ` Tech Stuff
2013-03-13 23:41 ` Peter Dyballa
2013-03-13 23:48 ` Tech Stuff
2013-03-13 23:58 ` Peter Dyballa [this message]
2013-03-14 0:38 ` Axel E. Retif
2013-03-14 2:24 ` Tech Stuff
2013-03-14 2:35 ` Tech Stuff
2013-03-14 2:59 ` Axel E. Retif
2013-03-14 4:23 ` Tech Stuff
2013-03-14 6:07 ` Axel E. Retif
2013-03-12 17:23 ` Peter Dyballa
[not found] <mailman.21917.1363080184.855.help-gnu-emacs@gnu.org>
2013-03-13 12:33 ` Phoenix Gris
2013-03-13 14:48 ` Peter Dyballa
2013-03-13 15:29 ` Filipp Gunbin
2013-03-13 17:16 ` Eli Zaretskii
2013-03-13 20:33 ` Stefan Monnier
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=C947B3A5-B97E-4CAB-A219-0E4BCD6B5B04@Web.DE \
--to=peter_dyballa@web.de \
--cc=help-gnu-emacs@gnu.org \
--cc=techstuff1971@yahoo.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
Code repositories for project(s) associated with this external index
https://git.savannah.gnu.org/cgit/emacs.git
https://git.savannah.gnu.org/cgit/emacs/org-mode.git
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.