17.8. What about Chinese characters?

Chinese characters (“han ⁴ zi ⁴” in Chinese, kanji in Japanese) represent an entirely different approach to writing from alphabets or syllabaries. (A syllabary, such as Japanese hiragana or Amharic writing, has one lerfu for each syllable of the spoken language.) Very roughly, Chinese characters represent single elements of meaning; also very roughly, they represent single syllables of spoken Chinese. There is in principle no limit to the number of Chinese characters that can exist, and many thousands are in regular use.

It is hopeless for Lojban, with its limited lerfu and shift words, to create an alphabet which will match this diversity. However, there are various possible ways around the problem.

First, both Chinese and Japanese have standard Latin-alphabet representations, known as “pinyin” for Chinese and “romaji” for Japanese, and these can be used. Thus, the word “han⁴zi⁴” is conventionally written with two characters, but it may be spelled out as:

Example 17.19.

.y'y.bu	.abu	ny.	vo	zy.	.ibu	vo
h	a	n	4	z	i	4

The cmavo vo is the Lojban digit “4”. It is grammatical to intersperse digits (of selma'o PA) into a string of lerfu words; as long as the first cmavo is a lerfu word, the whole will be interpreted as a string of lerfu words. In Chinese, the digits can be used to represent tones. Pinyin is more usually written using accent marks, the mechanism for which was explained in Section 17.6.

The Japanese company named “Mitsubishi” in English is spelled the same way in romaji, and could be spelled out in Lojban thus:

Example 17.20.

my.	.ibu	ty.	sy.	.ubu	by.	.ibu	sy.	.y'y.bu	.ibu
m	i	t	s	u	b	i	s	h	i

Alternatively, a really ambitious Lojbanist could assign lerfu words to the individual strokes used to write Chinese characters (there are about seven or eight of them if you are a flexible human being, or about 40 if you are a rigid computer program), and then represent each character with a tei, the stroke lerfu words in the order of writing (which is standardized for each character), and a foi. No one has as yet attempted this project.