Hello Joe,
You know, I've been 'somewhat' following this thread and am curious
about the 'why' 8 bit (vs 7 bit) is such an issue.
Because it can give problems, in mailreaders, and if the NodeList would
carry characters with the 8 bit set, soms compiling software may go wrong.
I get the one byte vs two bytes (characters) for unicode,
just not the 8 bit vs 7 bit of a single byte (ie: high ascii).
I'll show you, read on.
I also get what high ascii offers but since code pages are all about presentation and the nodelist is simply data, it would seem to me that
the issue would be with the readers and or editors not the data. Dunno.
Many mailers need a compiled NodeList, so that compile process could go wrong when 8 bit characters are presented, hence it is still only 7 bit.
Now just so you know, I've written my own mailer, nodelist compiler/diff processor, echomail tosser etc, so I'm all too familiar with the
nodelist and FTSC specs in general, but I frankly never gave the whole
8/7 bit issue a second thought.
So if someone would enlighten me, I'd appreciate it. Please
understand, this is simply a request for info with the main goal of ensuring my stuff continues to work like it's supposed to... Not to try
and start something!
For the completeness here a partial list of characters above 128:
Special signs in Acorn Archimedes/AxThousend/RiscPC in RISC OS 3.xx
and higher,
or the IyonixPC, A9home, BeagleBoard xM, Panda Board, Raspberry Pi computers all with ISO 8859/1 Latin 1 character set/system font in RISC OS,
in conjunction to M$-DOS/Windows with CP 437.
To get by pressing Alt together with a digit
at the numerical keypad:
Acorn H Binairy | MSDos/Windows H Binairy
ISO 8859/1 e bbbbbbbb | IBM CodePage e bbbbbbbb
description Latin 1 x 76543210 | 437 USA/UK x 76543210
|
a\ a accent grave = Alt 224 E0 11100000 | = Alt 133 85 10000101 a/ a accent aigu = Alt 225 E1 11100001 | = Alt 160 A0 10100000 a^ a accent circonflexe = Alt 226 E2 11100010 | = Alt 131 83 10000011 a~ a tilde = Alt 227 E3 11100011 | =
a" a trema/umlaut = Alt 228 E4 11100100 | = Alt 132 84 10000100 ao a with o = Alt 229 E5 11100101 | = Alt 134 86 10000110 ae a with e = Alt 230 E6 11100110 | = Alt 145 91 10010001 c, c with cedille = Alt 231 E7 11100111 | = Alt 135 87 10000111 e\ e accent grave = Alt 232 E8 11101000 | = Alt 138 8A 10001010 e/ e accent aigu = Alt 233 E9 11101001 | = Alt 130 82 10000010 e^ e accent circonflexe = Alt 234 EA 11101010 | = Alt 136 88 10001000 e" e trema/umlaut = Alt 235 EB 11101011 | = Alt 137 89 10001001 i\ i accent grave = Alt 236 EC 11101100 | = Alt 141 8D 10001101 i/ i accent aigu = Alt 237 ED 11101101 | = Alt 161 A1 10100001 i^ i accent circonflexe = Alt 238 EE 11101110 | = Alt 140 8C 10001100 i" i trema/umlaut = Alt 239 EF 11101111 | = Alt 139 8B 10001011 n~ n tilde = Alt 241 F1 11110001 | = Alt 164 A4 10100100 o\ o accent grave = Alt 242 F2 11110010 | = Alt 149 95 10010101 o/ o accent aigu = Alt 243 F3 11110011 | = Alt 162 A2 10100010 o^ o accent circonflexe = Alt 244 F4 11110100 | = Alt 147 93 10010011 o~ o tilde = Alt 245 F5 11110101 | =
o" o trema/umlaut = Alt 246 F6 11110110 | = Alt 148 94 10010100 o/ o with slash = Alt 248 F8 11111000 | = Alt 237 ED 11101101 u\ u accent grave = Alt 249 F9 11111001 | = Alt 151 97 10010111 u/ u accent aigu = Alt 250 FA 11111010 | = Alt 163 A3 10100011 u^ u accent circonflexe = Alt 251 FB 11111011 | = Alt 150 96 10010110 u" u trema/umlaut = Alt 252 FC 11111100 | = Alt 129 81 10000001
This table is still incomplete, and only for personal reference.
See: Resources:$.Apps.!Chars from the RISC OS IconBar.
I have not the energy to also make such a table for CodePage 850 (International), or others.
Bjorn Felten and Michiel van der Vlist know everthing about CP850.
Greetings from: Henri Derksen, SysOp UniCornBBS.demon.nl.
---
* Origin: Computing Apart Together (2:280/1208)