Formatting rules for Tibetan text
|This section contains Tibetan script. Without proper Tibetan rendering support configured, you may see other symbols instead of Tibetan script.|
- 1 Introduction
- 2 Basic formatting rules for Tibetan text
- 3 Advanced formatting with LibreOffice
- 4 LibreOffice tools
- 5 Printing
- 6 Internal Links
- 7 External References
Current office programs like Microsoft office 2003 - 2013, LibreOffice 4.x and OpenOffice 3.x handle Tibetan script quite well, and once they are set up correctly, line breaks are handled correctly in most cases. If your text processor breaks Tibetan syllables in the middle, you either need to update to a newer version, or check the setup for Microsoft Office or LibreOffice.
The following describes the formatting process using the example of the following short Tibetan text:
- Download example: Tibetan LibreOffice sample (unformatted)
This example (see image ) shows quite a number of formatting short-comings:
- There are shads ། at a beginning of a line, which is forbidden,
- There is no difference in font size for headline, commentary at the end and main text,
- There is no yig mgo ༄༅ or sbrul shad ༈ marking the start of the text.
- There is no justification.
The following chapter shows how to enhance the formatting of our example.
Basic formatting rules for Tibetan text
- Line breaks must not occur in the middle of a syllable. (Your word processor should take care of that already).
- Line breaks can appear after a syllable separator dot tsheg ་ (preferably not in the middle of a Tibetan word). Exception: the sequence nga <tsheg> <shad> ང་། Here the tsheg is a so-called non-breaking tsheg.
- Additionally, line breaks are possible after shad །, terma-sign gter ma ༔ and a visarga ཿ.
Inter-syllable marker tsheg ་
- There is never a tsheg after a visarga. Example: oṃ āḥ huṃ, Wylie: oM AHhU~M, ཨོཾ་ཨཱཿཧཱུྃ་ — there is no tsheg after āḥ.
- Tibetan uses only non-breaking spaces which do not vary in size on left-right justification.
- Note: To insert a non-breaking space, press <Ctrl>-<Shift>-<Space> (this works in LibreOffice or Microsoft Office). Most Tibetan keyboards also support '_' (underscore) as a short-cut for non-breaking-space.
Usage of punctuation character shad །
- A line must not start with a shad །.
- A shad ། is used as a Tibetan inter-punctuation, similar but not identical to a comma.
- Verses, headlines or ends of longer paragraphs are ended by the sequence <shad> <space> <shad> ། །.
- Exception: if the last letter of a line is either a ka ཀ or a ga ག, one shad ། is ommitted. This is also the case if ka ཀ or ga ག have vowel-signs. A shad ། is not omitted if they have a sub- or superscript. Examples:
- Incorrect: གི།, ཀུ། །,
- Correct: གི, ཀུ །, སྐུ།, གྲུ། །.
Rules for replacing shad ། by rin chen spungs shad ༑
- In Tibetan, especially in pechas, it is considered a special case, if the last syllable of an expression that is terminated by a shad ། breaks to a new line. In that case the shad ། or double shad shad ། ། is replaced by rin chen spungs shad ༑ or ༑ ༑. This serves as an optitical indication that there is a left-over syllable at the beginning of the line that actually belongs to the preceding line.
- a special case would be for example le'u: in a line starting with ལེའུ། །, no rin chen spungs shad would be used, since le'u is pronounced as two syllables.
- Variants: some books-prints do not use rin chen spungs shad replacements, however the majority of books seems to apply the same rules as are used with pechas.
- Sometimes in the sequence ། ། only the first shad is replaced: ༑ །, but this style is considered less beautiful.
- Correctly using rin chen spungs shad ༑ in long texts can be very time consuming, because even very small formatting or content changes might move rin chen spungs shad ༑ into positions where they are no longer correct. Same is true for shad ། that suddenly need to become rin chen spungs shad ༑. The Tibetan Formatting LibreOffice Extension can do the application of the rin chen spungs shad ༑ automatically within LibreOffice.
Numbers and special signs
- Numbers: Usually the Space character in Tibetan text is quite wide and occurs only after a shad ། or ༑, gter ma ༔, or visarga ཿ. Exception are numbers and embedded Western text. Tibetan numbers are separated from left and right Tibetan letters by smaller spaces:
- Terma signs: In case a section of text that is actually a gter ma, a single terma symbol ༔ replaces both shad ། and double shad ། །. Wood-block pechas sometimes simplify the gter ma ༔ so that it looks like a visarga ཿ, but digital texts should use the proper terma sign ༔.
- Honorific marks: An honorific emphasis can be expressed by a special prefix ༧, by colour, or by circles under the syllable as in the following example: .
- Root text identifier: ༷A simple circle under a Tibetan letter is used in a similar way as underlining is used with Western script, it's usage is similar to the honorific mark. The simple subjoined circle is often used to identify root-text citations within a commentary: སྐུ༷་གསུ༷ང་ཐུག༷ས།.
- Sanskrit Halanta ྄, Tib. sog mé, སྲོག་མེད་, when placed under a Tibetan letter within a transliterated word, it signifies that his letter has no vowel: ཛཔ྄་ is therefore pronounced jap and not japa.
- Repetition sign ༴, dü tag, བསྡུས་རྟགས་, is used to indicate within a repeating structure that the text should continue similarly as above, comparable to etc..
Head letters, yig mgo ༄༅ and sbrul shad ༈
- The head letter yig mgo ༄༅། ། in pechas always marks the upper left corner of a front-page of a pecha. It serves for quick optical discrimination between front- and back-side of a pecha. This usage does not occur in books. Additionally, in both pechas and books, the yig mgo ༄༅། ། is used at the start of headlines, and at the start of the first paragraph of a longer text. (See image )
- The sbrul shad ༈ is used to indicate:
- the start of a smaller text or prayer,
- a chapter boundary
- in text or prayer collections, the start of a new text or prayer
- in pechas, to mark insertions into a text: a sbrul shad ༈ would mark both beginning and end of the insert
Small print yig chung
- Commentaries and annotations in Tibetan books and pechas are printed using an about 25-30% smaller font-size. In pechas, additionally headlines are printed in yig chung size. See image  for an example. If normal-size Tibetan and yig chung are mixed, it is important to align the letter-heads of different size characters to the same height. The character super- and sub-script function of the word processor can be used to correctly align the letters. Example:
Image  shows the same text as image  with all formatting rules applied.
- You can download the formatted example: Tibetan LibreOffice sample (simple formatting)
Advanced formatting with LibreOffice
- Left/right justification: Western text is left/right justified by slightly expanding white spaces between words. This method does not lead to acceptable results with Tibetan since often there are only very few or even no white spaces in a line. To correctly justify Tibetan, the spacing between all characters should be adapted equally. The width of the white-space character should not be changed significantly; therefore Tibetan texts use the non-breaking space (Unicode U+A0) as white space, which doesn't change width on justification. * Linux-based operating systems: LibreOffice on Linux directly supports left/right justification. The option "Paragraph / Expand single words" (see image ) needs to be activated.
- Windows-based operating systems: Unfortunately this does not work with Windows versions of LibreOffice. Windows users need to download the Tibetan Formatting LibreOffice Extension. This extension installs a Tibetan menu in LibreOffice. The option "Tibetan / Insert justification characters" will insert in all paragraphs, which are formatted as left/right justified (using LibreOffice's Format / Paragraph / Alignment / Justified), invisible zero-space characters after each tsheg: this makes LibreOffice correctly justify Tibetan in Windows. See image  for an example.
- Tip: Left/right justification, especially when using the Windows extension, and after using the automatic rin chen spungs chad ༑ insertion, might lead to ugly spaces after tshegs. In many cases this effect can be reduced by slightly compressing all characters by 0,1 to 0,2 pt using the LibreOffice command "Format / Character / Position / Condensed" option. See image .
- The sample file with justification (Windows method) can be found here: Download example: Tibetan LibreOffice sample (justified Tibetan).
Note: See Configuring Microsoft Office 2003, 2007 or 2010 for Tibetan script for justification with Microsoft Office 2010.
The Tibetan Formatting LibreOffice Extension supports two tasks for Tibetan processing in LibreOffice:
- Left/Right justification for Windows computers
- Automatic application of rin chen spungs chad ༑ insertion. Especially for longer texts this can save a lot of time when formatting texts.
See Tibetan Formatting LibreOffice Extension for more details.
- For best results, first create a PDF of a document you intent to print. Some printer drivers do not correctly handle Tibetan stacks and generate inferior results, on direct print from Office.
- Additionally PDFs make sure that the print-output is always the same regardless of printer and computer used for printing.
- When printing from Adobe reader, make sure to select "Page Scaling: None" in the print dialog, to prevent slight compression of the pages. This is especially important for pecha printing.
- How to edit Tibetan texts - This article explains the basic setup to be able to work with Tibetan texts in Office (LibreOffice or Microsoft)
- Configuring Microsoft Office 2003, 2007 or 2010 for Tibetan script
- LibreOffice / OpenOffice configuration for Tibetan script support.
- A spellchecker for Tibetan and Libre- or OpenOffice. A spellchecking extension for Libre- and OpenOffice provided by Elie Roux.
- Tibetan Formatting LibreOffice Extension - A tool that allows to right/left justify Tibetan and perform automatic formatting (rin chen spungs shad handling).