Writing systems: Difference between revisions

From OLPC
Jump to navigation Jump to search
m (Minor edit)
(del dead scripts)
 
(18 intermediate revisions by 4 users not shown)
Line 1: Line 1:
We need to support all of the writing systems (alphabets, syllabaries, logographs) used for major languages of countries that join the OLPC project. Support includes
We need to support all of the writing systems (alphabets, syllabaries, logographs) used for major languages of countries that join the OLPC project. Full support includes


* Fonts
* [[Fonts]]
* Locales
* [[Locales]]
* [[Keyboard layouts]] and other [[input methods]]
* Keyboards and Input Methods
* Renderinng for screen and printer
* [[Rendering engines|Rendering]] for screen and printer
* Text-to Speech
* Text-to-Speech
* OCR
* OCR

Several countries have expressed interest in OLPC. Here are their basic requirements for writing systems and [[languages]]:

* India -- 10 official writing systems, 20 official languages
* China -- [[Simplified Chinese]], [[Mongolian]], [[Tibetan]], and others
* Egypt -- [[Arabic]]
* Nigeria -- [[Hausa]] (Arabic), [[Igbo]] (Extended Latin), [[Yoruba]] (Extended Latin)
* Brazil -- [[Portuguese]]
* Argentina -- [[Spanish]]
* Thailand -- [[Thai]]

In every one of these countries there are dozens to hundreds of minority languages.


The following links give information on the topics listed above for a particular writing system, to the extent that contributors to the project have found them. If you know of other issues or resources, please add them on the appropriate pages.
The following links give information on the topics listed above for a particular writing system, to the extent that contributors to the project have found them. If you know of other issues or resources, please add them on the appropriate pages.
Line 24: Line 12:
[[Extended Latin]] -- Western European languages and more than a thousand others
[[Extended Latin]] -- Western European languages and more than a thousand others


[[Arabic]] -- Arabic, [[Farsi]] (Iranian), Dari, Pashto, Urdu, Sindhi, Hausa and others. A number of languages formerly written in Arabic are now written in the Latin alphabet. This includes Turkish, Swahili, Bahasa Melayu/Bahasa Indonesya, and others.
[[Arabic]] -- [[Arabic]], [[Farsi]] (Iranian), Dari, Pashto, [[Urdu]], [[Sindhi]], [[Hausa]] and others. A number of languages formerly written in [[Arabic alphabet]] are now written in the [[Latin alphabet]]. This includes Turkish, [[Swahili]], Bahasa Melayu/Bahasa Indonesya, and others.


[[Hebrew]] -- Yiddish, Ladino, etc.
[[Hebrew]] -- [[Yiddish]], Ladino, etc.


[[Cyrillic]] (Кирилица) -- Russian, Bulgarian, Serbian, Macedonian, Ukrainian, Kazakh, and other Turkic languages, etc. More than 200 other languages were written in Cyrillic under Soviet rule.
[[Cyrillic]] (Кирилица) -- [[Russian]], Bulgarian, Serbian, Macedonian, Ukrainian, Belarusian, etc. More than 200 other languages have been written with [[Cyrillic alphabet]]. Even when a country offcially switches to the [[Latin alphabet]], there is still a large body of literature which is only available in Cyrillic. In particular, many Turkic languages use a form of Cyrillic with additional letters which may not be well supported by off-the-shelf systems.


[[Greek]] -- and [[Coptic]]
[[Greek]] -- and [[Coptic]]
Line 38: Line 26:
[[Syriac]] -- Liturgical language
[[Syriac]] -- Liturgical language


[[Ethiopic]] -- For Languages of Ethiopia using Ethiopic alphabet. Amharic, Tigrigna, Oromo, Gurage, etc
[[Ethiopic]] -- For Languages of Ethiopia using Ethiopic alphabet. [[Amharic]], Tigrigna, Oromo, Gurage, etc.


[[Thaana]] -- for the Dhivehi language of the Republic of Maldives
[[Thaana]] -- for the Dhivehi language of the Republic of Maldives


[[Devanagari]] -- Hindi, Marathi, Nepali etc.
[[Devanagari]] -- [[Hindi]], [[Marathi]], [[Nepali]] etc.


[[Bengali]]
[[Bengali]]
Line 70: Line 58:
Lao [[ພາສາລາວ]]
Lao [[ພາສາລາວ]]


[[Khmer]] (Cambodia)
[[Khmer]] ([[Cambodia]])


[[Tibetan]] -- Tibetan, Dzongkha (Bhutan)
[[Tibetan]] -- Tibetan, Dzongkha (Bhutan)
Line 88: Line 76:
[[Canadian Aboriginal Syllabics]] -- Native American languages of Canada and the Northern USA, including Inuktitut
[[Canadian Aboriginal Syllabics]] -- Native American languages of Canada and the Northern USA, including Inuktitut


[[Yi]] -- Minority language in China
[http://en.wikipedia.org/wiki/Yi Yi] -- Minority language in China


[[Braille]] -- for the blind, in many languages
[[Braille]] -- for the blind, in many languages


[http://en.wikipedia.org/wiki/Limbu Limbu]
[[Limbu]]


[http://en.wikipedia.org/wiki/Tai_le Tai Le] Language spoken by the Dai people in China
[[Tai Le]]


[[Philippine Scripts]] -- obsolete
Philippine Scripts -- Buhid, Hanunoo, Tagbanya. Obsolete, replaced by Latin.


[http://en.wikipedia.org/wiki/Osmanya Osmanya] -- [[Somali]]
[[Osmanya]]


[[Math]]
[[Math]]
Line 104: Line 92:
[[APL]] -- A Programming Language
[[APL]] -- A Programming Language


Linux systems now routinely come with support for 25 or more of these writing systems, and there are free tools for making [[keyboard layouts]] for any [[languages]] and writing systems. There are also large [[Unicode]] [[fonts]] such as [http://home.att.net/~jameskass/code2000_page.htm Code2000] with the characters for even more writing systems.
[[Dead and Artificial Scripts]]

Linux systems now routinely come with support for 20 or more of these writing systems, and there are free tools for making keyboard layouts for any language and writing system. There are also large [[Unicode]] [[fonts]] such as [http://home.att.net/~jameskass/code2000_page.htm Code2000] with the characters for even more writing systems.


Windows and Macintosh also support many writing systems with fonts and keyboards.
Windows and Macintosh also support many writing systems with fonts and keyboards.


==External links==
==External links==

[http://www.unicode.org/charts/ Unicode code charts] for all of these [[writing systems]] are available online in PDF format, so you can see the characters even if you don't have a matching font installed on your computer.
* [http://www.unicode.org/charts/ Unicode code charts] for all of these writing systems are available online in PDF format, so you can see the characters even if you don't have a matching font installed on your computer.

[[Category:Keyboard]]
[[Category:Language support]]
[[Category:Languages (international)]]
[[Category:Fonts]]
[[Category:Alphabets]]

Latest revision as of 04:21, 31 March 2008

We need to support all of the writing systems (alphabets, syllabaries, logographs) used for major languages of countries that join the OLPC project. Full support includes

The following links give information on the topics listed above for a particular writing system, to the extent that contributors to the project have found them. If you know of other issues or resources, please add them on the appropriate pages.

Extended Latin -- Western European languages and more than a thousand others

Arabic -- Arabic, Farsi (Iranian), Dari, Pashto, Urdu, Sindhi, Hausa and others. A number of languages formerly written in Arabic alphabet are now written in the Latin alphabet. This includes Turkish, Swahili, Bahasa Melayu/Bahasa Indonesya, and others.

Hebrew -- Yiddish, Ladino, etc.

Cyrillic (Кирилица) -- Russian, Bulgarian, Serbian, Macedonian, Ukrainian, Belarusian, etc. More than 200 other languages have been written with Cyrillic alphabet. Even when a country offcially switches to the Latin alphabet, there is still a large body of literature which is only available in Cyrillic. In particular, many Turkic languages use a form of Cyrillic with additional letters which may not be well supported by off-the-shelf systems.

Greek -- and Coptic

Armenian

Georgian

Syriac -- Liturgical language

Ethiopic -- For Languages of Ethiopia using Ethiopic alphabet. Amharic, Tigrigna, Oromo, Gurage, etc.

Thaana -- for the Dhivehi language of the Republic of Maldives

Devanagari -- Hindi, Marathi, Nepali etc.

Bengali

Gujarati

Gurmukhi -- for Punjabi

Kannada

Malayalam

Oriya

Tamil

Telugu

Urdu

Sinhala -- (Sri Lanka)

Myanmar

Thai

Lao ພາສາລາວ

Khmer (Cambodia)

Tibetan -- Tibetan, Dzongkha (Bhutan)

Mongolian

Traditional Chinese

Simplified Chinese

Japanese

Korean

Cherokee

Canadian Aboriginal Syllabics -- Native American languages of Canada and the Northern USA, including Inuktitut

Yi -- Minority language in China

Braille -- for the blind, in many languages

Limbu

Tai Le Language spoken by the Dai people in China

Philippine Scripts -- Buhid, Hanunoo, Tagbanya. Obsolete, replaced by Latin.

Osmanya -- Somali

Math

APL -- A Programming Language

Linux systems now routinely come with support for 25 or more of these writing systems, and there are free tools for making keyboard layouts for any languages and writing systems. There are also large Unicode fonts such as Code2000 with the characters for even more writing systems.

Windows and Macintosh also support many writing systems with fonts and keyboards.

External links

  • Unicode code charts for all of these writing systems are available online in PDF format, so you can see the characters even if you don't have a matching font installed on your computer.