all messages for Emacs-related lists mirrored at yhetil.org
 help / color / mirror / code / Atom feed
From: Peter Dyballa <Peter_Dyballa@Web.DE>
To: Tech Stuff <techstuff1971@yahoo.com>
Cc: "help-gnu-emacs@gnu.org" <help-gnu-emacs@gnu.org>
Subject: Re: File Encoding Issue on Windows
Date: Thu, 14 Mar 2013 00:58:19 +0100	[thread overview]
Message-ID: <C947B3A5-B97E-4CAB-A219-0E4BCD6B5B04@Web.DE> (raw)
In-Reply-To: <1363218487.89955.YahooMailNeo@web165002.mail.bf1.yahoo.com>

[-- Attachment #1: Type: text/plain, Size: 629 bytes --]


Am 14.03.2013 um 00:48 schrieb Tech Stuff:

> I'm willing to do anything short of actually gonig in and changing every occurrence individually as there are hundreds of them.

Here is a file with a comparison of CP1252 to UTF-8. If you want you can add a new column that does not show the HEX values but the actual characters that would used to represent the UTF-8 bytes.

All the missing code points are US-ASCII in both encodings (CP1252 and UTF-8). The bytes and their meanings are equal and the same in both.

--
Greetings

  Pete

Behold the warranty … the bold print giveth and the fine print taketh away.


[-- Attachment #2: CP1252.txt --]
[-- Type: text/plain, Size: 8952 bytes --]

;;; -*- mode: Text; coding: windows-1252-unix; -*-
;
;	Time-stamp: <2006-10-28 22:16:09 pete>
;
;   ANSI Microsoft Windows Codepage
;
;   oct   dec   hex    UCS2    UTF-8
;=====================================
€ = 200 = 128 = 80 = U+20AC = E2 82 AC : EURO SIGN
  = 201 = 129 = 81 = 	      	       	 (UNDEFINED)
‚ = 202 = 130 = 82 = U+201A = E2 80 9A : SINGLE LOW-9 QUOTATION MARK
ƒ = 203 = 131 = 83 = U+0192 =    C6 92 : LATIN SMALL LETTER F WITH HOOK
„ = 204 = 132 = 84 = U+201E = E2 80 9E : DOUBLE LOW-9 QUOTATION MARK
… = 205 = 133 = 85 = U+2026 = E2 80 A6 : HORIZONTAL ELLIPSIS
† = 206 = 134 = 86 = U+2020 = E2 80 A0 : DAGGER
‡ = 207 = 135 = 87 = U+2021 = E2 80 A1 : DOUBLE DAGGER
ˆ = 210 = 136 = 88 = U+005E =       5E : CIRCUMFLEX ACCENT
‰ = 211 = 137 = 89 = U+2030 = E2 80 B0 : PER MILLE SIGN
Š = 212 = 138 = 8A = U+0160 =    C5 A0 : LATIN CAPITAL LETTER S WITH CARON
‹ = 213 = 139 = 8B = U+2039 = E2 80 B9 : SINGLE LEFT-POINTING ANGLE QUOTATION MARK
Œ = 214 = 140 = 8C = U+0152 =    C5 92 : LATIN CAPITAL LIGATURE OE
  = 215 = 141 = 8D = 	    	       	 (UNDEFINED)
Ž = 216 = 142 = 8E = U+017D =    C5 BD : LATIN CAPITAL LETTER Z WITH CARON
  = 217 = 143 = 8F = 	    	       	 (UNDEFINED)
  = 220 = 144 = 90 = 	    	       	 (UNDEFINED)
‘ = 221 = 145 = 91 = U+2018 = E2 80 98 : LEFT SINGLE QUOTATION MARK
’ = 222 = 146 = 92 = U+2019 = E2 80 99 : RIGHT SINGLE QUOTATION MARK
“ = 223 = 147 = 93 = U+201C = E2 80 9C : LEFT DOUBLE QUOTATION MARK
” = 224 = 148 = 94 = U+201D = E2 80 9D : RIGHT DOUBLE QUOTATION MARK
• = 225 = 149 = 95 = U+2022 = E2 80 A2 : BULLET
– = 226 = 150 = 96 = U+2013 = E2 80 92 : EN DASH
— = 227 = 151 = 97 = U+2014 = E2 80 93 : EM DASH
˜ = 230 = 152 = 98 = U+007E =       7E : TILDE
™ = 231 = 153 = 99 = U+2122 = E2 84 A2 : TRADEMARK SIGN
š = 232 = 154 = 9A = U+0161 =    C5 A1 : LATIN SMALL LETTER S WITH CARON
› = 233 = 155 = 9B = U+203A = E2 80 BA : SINGLE RIGHT-POINTING ANGLE QUOTATION MARK
œ = 234 = 156 = 9C = U+0153 =    C5 93 : LATIN SMALL LIGATURE OE
  = 235 = 157 = 9D = 	    	       	 (UNDEFINED)
ž = 236 = 158 = 9E = U+017E =    C5 BE : LATIN SMALL LETTER Z WITH CARON
Ÿ = 237 = 159 = 9F = U+0178 =    C5 B8 : LATIN CAPITAL LETTER Y WITH DIAERESIS
  = 240 = 160 = A0 = U+00A0 =    C2 A0 : NO-BREAK SPACE
¡ = 241 = 161 = A1 = U+00A1 =    C2 A1 : INVERTED EXCLAMATION MARK
¢ = 242 = 162 = A2 = U+00A2 =    C2 A2 : CENT SIGN
£ = 243 = 163 = A3 = U+00A3 =    C2 A3 : POUND SIGN
¤ = 244 = 164 = A4 = U+00A4 =    C2 A4 : CURRENCY SIGN
¥ = 245 = 165 = A5 = U+00A5 =    C2 A5 : YEN SIGN
¦ = 246 = 166 = A6 = U+00A6 =    C2 A6 : BROKEN BAR
§ = 247 = 167 = A7 = U+00A7 =    C2 A7 : SECTION SIGN
¨ = 250 = 168 = A8 = U+00A8 =    C2 A8 : DIAERESIS
© = 251 = 169 = A9 = U+00A9 =    C2 A9 : COPYRIGHT SIGN
ª = 252 = 170 = AA = U+00AA =    C2 AA : FEMININE ORDINAL INDICATOR
« = 253 = 171 = AB = U+00AB =    C2 AB : LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
¬ = 254 = 172 = AC = U+00AC =    C2 AC : NOT SIGN
­ = 255 = 173 = AD = U+00AD =    C2 AD : HYPHEN-MINUS
® = 256 = 174 = AE = U+00AE =    C2 AE : REGISTERED SIGN
¯ = 257 = 175 = AF = U+00AF =    C2 AF : MACRON
° = 260 = 176 = B0 = U+00B0 =    C2 B0 : DEGREE SIGN
± = 261 = 177 = B1 = U+00B1 =    C2 B1 : PLUS-MINUS SIGN
² = 262 = 178 = B2 = U+00B2 =    C2 B2 : SUPERSCRIPT TWO
³ = 263 = 179 = B3 = U+00B3 =    C2 B3 : SUPERSCRIPT THREE
´ = 264 = 180 = B4 = U+00B4 =    C2 B4 : ACUTE ACCENT
µ = 265 = 181 = B5 = U+00B5 =    C2 B5 : MICRO SIGN
¶ = 266 = 182 = B6 = U+00B6 =    C2 B6 : PILCROW SIGN
· = 267 = 183 = B7 = U+00B7 =    C2 B7 : MIDDLE DOT
¸ = 270 = 184 = B8 = U+00B8 =    C2 B8 : CEDILLA
¹ = 271 = 185 = B9 = U+00B9 =    C2 B9 : SUPERSCRIPT ONE
º = 272 = 186 = BA = U+00BA =    C2 BA : MASCULINE ORDINAL INDICATOR
» = 273 = 187 = BB = U+00BB =    C2 BB : RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
¼ = 274 = 188 = BC = U+00BC =    C2 BC : VULGAR FRACTION ONE QUARTER
½ = 275 = 189 = BD = U+00BD =    C2 BD : VULGAR FRACTION ONE HALF
¾ = 276 = 190 = BE = U+00BE =    C2 BE : VULGAR FRACTION THREE QUARTERS
¿ = 277 = 191 = BF = U+00BF =    C2 BF : INVERTED QUESTION MARK
À = 300 = 192 = C0 = U+00C0 =    C3 80 : LATIN CAPITAL LETTER A WITH GRAVE
Á = 301 = 193 = C1 = U+00C1 =    C3 81 : LATIN CAPITAL LETTER A WITH ACUTE
 = 302 = 194 = C2 = U+00C2 =    C3 82 : LATIN CAPITAL LETTER A WITH CIRCUMFLEX
à = 303 = 195 = C3 = U+00C3 =    C3 83 : LATIN CAPITAL LETTER A WITH TILDE
Ä = 304 = 196 = C4 = U+00C4 =    C3 84 : LATIN CAPITAL LETTER A WITH DIAERESIS
Å = 305 = 197 = C5 = U+00C5 =    C3 85 : LATIN CAPITAL LETTER A WITH RING ABOVE
Æ = 306 = 198 = C6 = U+00C6 =    C3 86 : LATIN CAPITAL LETTER AE
Ç = 307 = 199 = C7 = U+00C7 =    C3 87 : LATIN CAPITAL LETTER C WITH CEDILLA
È = 310 = 200 = C8 = U+00C8 =    C3 88 : LATIN CAPITAL LETTER E WITH GRAVE
É = 311 = 201 = C9 = U+00C9 =    C3 89 : LATIN CAPITAL LETTER E WITH ACUTE
Ê = 312 = 202 = CA = U+00CA =    C3 8A : LATIN CAPITAL LETTER E WITH CIRCUMFLEX
Ë = 313 = 203 = CB = U+00CB =    C3 8B : LATIN CAPITAL LETTER E WITH DIAERESIS
Ì = 314 = 204 = CC = U+00CC =    C3 8C : LATIN CAPITAL LETTER I WITH GRAVE
Í = 315 = 205 = CD = U+00CD =    C3 8D : LATIN CAPITAL LETTER I WITH ACUTE
Î = 316 = 206 = CE = U+00CE =    C3 8E : LATIN CAPITAL LETTER I WITH CIRCUMFLEX
Ï = 317 = 207 = CF = U+00CF =    C3 8F : LATIN CAPITAL LETTER I WITH DIAERESIS
Ð = 320 = 208 = D0 = U+00D0 =    C3 90 : LATIN CAPITAL LETTER ETH
Ñ = 321 = 209 = D1 = U+00D1 =    C3 91 : LATIN CAPITAL LETTER N WITH TILDE
Ò = 322 = 210 = D2 = U+00D2 =    C3 92 : LATIN CAPITAL LETTER O WITH GRAVE
Ó = 323 = 211 = D3 = U+00D3 =    C3 93 : LATIN CAPITAL LETTER O WITH ACUTE
Ô = 324 = 212 = D4 = U+00D4 =    C3 94 : LATIN CAPITAL LETTER O WITH CIRCUMFLEX
Õ = 325 = 213 = D5 = U+00D5 =    C3 95 : LATIN CAPITAL LETTER O WITH TILDE
Ö = 326 = 214 = D6 = U+00D6 =    C3 96 : LATIN CAPITAL LETTER O WITH DIAERESIS
× = 327 = 215 = D7 = U+00D7 =    C3 97 : MULTIPLICATION SIGN
Ø = 330 = 216 = D8 = U+00D8 =    C3 98 : LATIN CAPITAL LETTER O WITH STROKE
Ù = 331 = 217 = D9 = U+00D9 =    C3 99 : LATIN CAPITAL LETTER U WITH GRAVE
Ú = 332 = 218 = DA = U+00DA =    C3 9A : LATIN CAPITAL LETTER U WITH ACUTE
Û = 333 = 219 = DB = U+00DB =    C3 9B : LATIN CAPITAL LETTER U WITH CIRCUMFLEX
Ü = 334 = 220 = DC = U+00DC =    C3 9C : LATIN CAPITAL LETTER U WITH DIAERESIS
Ý = 335 = 221 = DD = U+00DD =    C3 9D : LATIN CAPITAL LETTER Y WITH ACUTE
Þ = 336 = 222 = DE = U+00DE =    C3 9E : LATIN CAPITAL LETTER THORN
ß = 337 = 223 = DF = U+00DF =    C3 9F : LATIN SMALL LETTER SHARP S
à = 340 = 224 = E0 = U+00E0 =    C3 A0 : LATIN SMALL LETTER A WITH GRAVE
á = 341 = 225 = E1 = U+00E1 =    C3 A1 : LATIN SMALL LETTER A WITH ACUTE
â = 342 = 226 = E2 = U+00E2 =    C3 A2 : LATIN SMALL LETTER A WITH CIRCUMFLEX
ã = 343 = 227 = E3 = U+00E3 =    C3 A3 : LATIN SMALL LETTER A WITH TILDE
ä = 344 = 228 = E4 = U+00E4 =    C3 A4 : LATIN SMALL LETTER A WITH DIAERESIS
å = 345 = 229 = E5 = U+00E5 =    C3 A5 : LATIN SMALL LETTER A WITH RING ABOVE
æ = 346 = 230 = E6 = U+00E6 =    C3 A6 : LATIN SMALL LETTER AE
ç = 347 = 231 = E7 = U+00E7 =    C3 A7 : LATIN SMALL LETTER C WITH CEDILLA
è = 350 = 232 = E8 = U+00E8 =    C3 A8 : LATIN SMALL LETTER E WITH GRAVE
é = 351 = 233 = E9 = U+00E9 =    C3 A9 : LATIN SMALL LETTER E WITH ACUTE
ê = 352 = 234 = EA = U+00EA =    C3 AA : LATIN SMALL LETTER E WITH CIRCUMFLEX
ë = 353 = 235 = EB = U+00EB =    C3 AB : LATIN SMALL LETTER E WITH DIAERESIS
ì = 354 = 236 = EC = U+00EC =    C3 AC : LATIN SMALL LETTER I WITH GRAVE
í = 355 = 237 = ED = U+00ED =    C3 AD : LATIN SMALL LETTER I WITH ACUTE
î = 356 = 238 = EE = U+00EE =    C3 AE : LATIN SMALL LETTER I WITH CIRCUMFLEX
ï = 357 = 239 = EF = U+00EF =    C3 AF : LATIN SMALL LETTER I WITH DIAERESIS
ð = 360 = 240 = F0 = U+00F0 =    C3 B0 : LATIN SMALL LETTER ETH
ñ = 361 = 241 = F1 = U+00F1 =    C3 B1 : LATIN SMALL LETTER N WITH TILDE
ò = 362 = 242 = F2 = U+00F2 =    C3 B2 : LATIN SMALL LETTER O WITH GRAVE
ó = 363 = 243 = F3 = U+00F3 =    C3 B3 : LATIN SMALL LETTER O WITH ACUTE
ô = 364 = 244 = F4 = U+00F4 =    C3 B4 : LATIN SMALL LETTER O WITH CIRCUMFLEX
õ = 365 = 245 = F5 = U+00F5 =    C3 B5 : LATIN SMALL LETTER O WITH TILDE
ö = 366 = 246 = F6 = U+00F6 =    C3 B6 : LATIN SMALL LETTER O WITH DIAERESIS
÷ = 367 = 247 = F7 = U+00F7 =    C3 B7 : DIVISION SIGN
ø = 370 = 248 = F8 = U+00F8 =    C3 B8 : LATIN SMALL LETTER O WITH STROKE
ù = 371 = 249 = F9 = U+00F9 =    C3 B9 : LATIN SMALL LETTER U WITH GRAVE
ú = 372 = 250 = FA = U+00FA =    C3 BA : LATIN SMALL LETTER U WITH ACUTE
û = 373 = 251 = FB = U+00FB =    C3 BB : LATIN SMALL LETTER U WITH CIRCUMFLEX
ü = 374 = 252 = FC = U+00FC =    C3 BC : LATIN SMALL LETTER U WITH DIAERESIS
ý = 375 = 253 = FD = U+00FD =    C3 BD : LATIN SMALL LETTER Y WITH ACUTE
þ = 376 = 254 = FE = U+00FE =    C3 BE : LATIN SMALL LETTER THORN
ÿ = 377 = 255 = FF = U+00FF =    C3 BF : LATIN SMALL LETTER Y WITH DIAERESIS

  reply	other threads:[~2013-03-13 23:58 UTC|newest]

Thread overview: 24+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-03-12  3:08 File Encoding Issue on Windows Tech Stuff
2013-03-12 10:50 ` Peter Dyballa
2013-03-12 14:57   ` Tech Stuff
2013-03-12 16:32     ` W. Greenhouse
2013-03-13 17:44       ` Tech Stuff
2013-03-13 20:37         ` Peter Dyballa
2013-03-13 21:11           ` Tech Stuff
2013-03-13 22:16             ` Peter Dyballa
2013-03-13 23:26               ` Tech Stuff
2013-03-13 23:41                 ` Peter Dyballa
2013-03-13 23:48                   ` Tech Stuff
2013-03-13 23:58                     ` Peter Dyballa [this message]
2013-03-14  0:38                     ` Axel E. Retif
2013-03-14  2:24                       ` Tech Stuff
2013-03-14  2:35                         ` Tech Stuff
2013-03-14  2:59                           ` Axel E. Retif
2013-03-14  4:23                             ` Tech Stuff
2013-03-14  6:07                               ` Axel E. Retif
2013-03-12 17:23     ` Peter Dyballa
     [not found] <mailman.21917.1363080184.855.help-gnu-emacs@gnu.org>
2013-03-13 12:33 ` Phoenix Gris
2013-03-13 14:48   ` Peter Dyballa
2013-03-13 15:29   ` Filipp Gunbin
2013-03-13 17:16   ` Eli Zaretskii
2013-03-13 20:33   ` Stefan Monnier

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=C947B3A5-B97E-4CAB-A219-0E4BCD6B5B04@Web.DE \
    --to=peter_dyballa@web.de \
    --cc=help-gnu-emacs@gnu.org \
    --cc=techstuff1971@yahoo.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this external index

	https://git.savannah.gnu.org/cgit/emacs.git
	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.