newby
  
 
 
  
 
 
Group: Members 
Posts: 171 
Joined: June 2006 | 
  | 
Posted: Mar. 11 2008,14:04 | 
   | 
 
 
  
Damn Small Characters for Interlingual Interchange (DaSCII)
  -- a proposal --
  THE PROBLEM
  The majority languages spoken by over 50% of people worldwide are(in order): Chinese (962 million), English (322m), Spanish (266m), Russian (170m), Portuguese (170m), Japanese (125m), German (98m), Bengali (189m) & Hindi 182m).  Neither 7-bit nor 8-bit systems provide enough characters to directly cover these languages.
  THE SOLUTION
  By examining the transliteration systems for each of the languages, the number of unique characters can be drastically reduced.  The dravidian languages (Bengali, Hindi, et cetera) present the greatest difficulty due to a great number of diacritical and other marks.  Therefore:
  1. Start with the IBM 8-bit character set.
  2. Insert characters 177 - 250 from the Indian Script Code for Information Interchange.
  3. Add the following characters:
     158 --- (the Euro symbol)    166 - z (with a tail underneath, used in Arabic transliteration)    167 - t (with a tail underneath, used in Arabic transliteration)    169 - e rising tone (the other rising-tone characters are covered)    170 - a falling-rising tone    235 - e falling-rising tone    236 - i falling-rising tone    237 - o falling-rising tone    238 - u falling-falling tone    251 - s (with a tail underneath, used in Arabic transliteration)    252 - d (with a tail underneath, used in Arabic transliteration)    253 - o (with a shallow u-shaped mark above, used in Korean transliteration)    254 - u (with a shallow u-shaped mark above, used in Korean transliteration)
  USAGE
  1. Where a character is found in DaSCII, use it. 2. Where a character is not found in DaSCII, use transliterations, for example:
     German - Use dipthongs suggested in the BGN/PCGN 2000 Agreement    Russian - Use dipthongs suggested in the BGN/PCGN 1947 Agreement    Romanji - Use the umlauted character for the high tone.
  == This will allow transliterations of over 50% of human languages with _one_ font == 
 |