Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Introduction:
This document describes minimal requirements specifications for Indian
languages text layout required for content format in E-publishing.
This documents covers major issues of E-content in Indian languages in order to
create standardize format of text layout like storage, rendering problems, vertical
writing, margins areas, page numbers, repeated head, line breaking etc and CSS
requirements for Indian languages.
The main purpose of this document is to gather the information from E-publishers
about the page text layout they are using for E-publishing.
Storage Requirements:
1. Support of Unicode 6.2 and IVS
For the global language support EPUB should support Unicode and also
should support SVG Fonts and IVS (Ideographic Variation Sequence).
UNICODE is the Universal character encoding standard, used for
representing text for information processing. Unicode encodes all of the
individual characters used for all the written languages of the world. The
standards provide information about the character and their use. Unicode
uses a 16 bit encoding that provides code point for more than 65000
characters (65536). It assigns each character a unique hexadecimal numeric
value and name.
Reference URL : http://www.unicode.org/versions/Unicode6.2.0/
Copyright@W3C India
Page 1
Page 2
2. Fonts
Open Type fonts convert the Unicode code numbers to their glyphs on the
display interface. They are directly based on Unicode. Open Type provides a
series of enhancements to the TrueType format, the most significant of
which allows PostScript font data to nest inside a TrueType software
wrapper.
Open Type allows type designers and font foundries to create larger
character sets within fonts. Within the parameters of the TrueType and
Type 1 formats, fonts are limited to 256 characters. If a typeface designer
wanted to create an extended ligature set, small caps, swash and alternate
characters, or characters to support multiple languages, these had to be
put into another font. The large character set capabilities of Open Type
allows type designers much more latitude in typeface design, resulting in
better graphic communication.
SVG Fonts: The purpose of SVG fonts is to allow for delivery of glyph
outlines in display-only environments. SVG fonts that accompany Web
pages must be supported only in browsing and viewing situations. Graphics
editing applications or file translation tools must not attempt to convert
SVG fonts into system fonts.
Reference URL: http://www.w3.org/TR/SVG/fonts.html
Page 3
Positioning of all running heads and page numbers in the same book should
be consistent. The following ways might be used for positioning running
heads and page numbers in horizontal writing system:
Copyright@W3C India
Page 4
When the principal text direction is Vertical, every text including table
entries is in Vertical writing mode.
Copyright@W3C India
Page 5
I
n
d
i
a
Copyright@W3C India
Page 6
Styling Requirements:
The following CSS issues should help in the implementation of text layout
for Indian languages:
The first-letter pseudo-element represents the first letter of the first line of
a block, if it is not preceded by any other content (such as images or inline
tables) on its line. It allows that first letter to be styled individually, without
markup. It may be used for "initial caps" and "drop caps", which are
common typographical effects in text in Latin script.
Copyright@W3C India
Page 7
Line breaking
Unicode Line Breaking Algorithm UAX #14-(Word wrapping)
Characters not starting a line): A line should not begin with the
characters shown below:
closing brackets (cl-02),
hyphens (cl-03),
dividing punctuation marks (cl-04),
middle dots (cl-05),
full stops (cl-06),
commas (cl-07),
iteration marks (cl-09),
Reference URL: http://unicode.org/reports/tr14/
Reference URL: http://www.w3.org/TR/2007/WD-css3-text-20070306/#linebreaking
Indentation
Sometimes some of the character of a word is indented as in figure-3
the is indented
Example in Bangla:
Copyright@W3C India
Page 8
What should be the solution or rule for such type of styling issue in case
of Indian language Some time people said that styling is done on the
basis of the syllable, but what is the definition of syllable. The definition
of syllable depends on the pronunciation of the word. In the example
the syllables are , , , , but styling is done as
which is not as per the syllable. So we should define the rule instead
of defining it by syllable basis.
Letter spacing
Copyright@W3C India
Page 9
Underlining
Copyright@W3C India
Page 10
First, add the font to your book files in the normal way, by adding an
@font-face statement at the beginning of your CSS, something like this:
@font-face {
font-family: Prophecy Script;
font-style: normal;
font-weight: normal;
src:url("Fonts/Prophecy_Script.ttf");
}
That makes the font available. To apply it to your text, you have to add it
to one of your styles, also in the CSS:
p.letter {
font-family: "Prophecy Script";
font-weight: normal;
font-style: normal;
font-size: 1em;
margin: 1em 0 0 0;
-webkit-hyphens:none;
}
Copyright@W3C India
Page 11
Copyright@W3C India
Page 12
Copyright@W3C India
Page 13
First Letter
In Cursive Text like Arabic and Urdu the styling is applied to whole word
Copyright@W3C India
Page 14
Vertical-align
This property affects the vertical positioning inside a line box of the
boxes generated by an inline-level element.
Text-decoration
This property describes decorations that are added to the text of an
element using the element's color. When specified on or propagated to
an inline element, it affects all the boxes generated by that element, and
is further propagated to any in-flow block-level boxes that split the
inline.
Value: none | [underline || overline || line-through || blink] |
Letter-spacing
This property specifies spacing behavior between text characters.
Text-indent
This property specifies the indentation of the first line of text in a block
container. More precisely, it specifies the indentation of the first box
that flows into the block's first line box. The box is indented with respect
to the left (or right, for right-to-left layout) edge of the line box. User
agents must render this indentation as blank space.
Copyright@W3C India
Page 15
voice-volume
The voice-volume property allows authors to control the amplitude of
the audio waveform generated by the speech synthesizer, and is also
used to adjust the relative volume level of audio cues within the audio
"box" model.
voice-balance
The voice-balance property controls the spatial distribution of audio
output across a lateral sound stage: one extremity is on the left, the
other extremity is on the right hand side, relative to the listener's
position.
speak
The speak property determines whether or not to render text aurally.
speak-as
The speak-as property determines in what manner text gets rendered
aurally, based upon a basic predefined list of possible values.
Copyright@W3C India
Page 16
Pause properties
The pause-before and pause-after properties specify a prosodic
boundary (silence with a specific duration) that occurs before (or after)
the speech synthesis rendition of the selected element, or if any cuebefore (or cue-after) is specified, before (or after) the cue within the
audio "box" model.
Rest properties
The rest-before and rest-after properties specify a prosodic boundary
(silence with a specific duration) that occurs before (or after) the speech
synthesis rendition of an element within the audio "box" model.
Cue properties
The cue-before and cue-after properties specify auditory icons (i.e.
pre-recorded / pre-generated sound clips) to be played before (or after)
the selected element within the audio "box" model.
Copyright@W3C India
Page 17
E-publishing survey
Types of survey:
1. Online survey
2. Offline survey
Copyright@W3C India
Page 18
2 Offline survey
1.
2.
3.
4.
Outcome:
I.
II.
III.
Final report should be prepared to clearly bring out an objective and concrete
outcomes so as to use the same for future actions.
The final outcome should also help in the implementation of Indian languages text layout in the
following areas:
1. E-Publishing in Indian languages
- Page Formats for Indian languages Documents.
- Positioning of Running Heads and Page Numbers.
- Positioning of Closing Brackets, Purnaviram at Line End
- Vertical Writing Mode and Horizontal Writing Mode.
- Paragraph Adjustment Rules.
- Mixed Text Composition in Horizontal Writing Mode.
- Mixed Text Composition in Vertical Writing Mode.
2. CSS
Copyright@W3C India
Page 19
Annexure I
E-Publishing related questions
A. General Questions:
1) Does your organization work for India languages publishing?
a. Yes
b. No
2) If no, then do you have any plans to localize your content in Indian Languages?
2. Are you using e publishing? If so, how e- publishing supplements your publishing?
a. Increase your circulation
b. No
b. No
Copyright@W3C India
b. ISFOC
c. Proprietary Font
d. Others
Page 20
b. No
10. What are the proactive measures that your organization had taken up to avoid the bugs
and problems regarding the picture clarity and the simplified use of script as well as
language?
11. How much space or memory you people uses on web server, so that your data can be
easily retrieved over the network
12. What format of the images does your organization uses for the images that you being
publish in your paper?
a. JPEG
b. TIFF
c. BMP
d. Others
Copyright@W3C India
Page 21
a. Satisfied
b. Need improvement
c. Unsatisfied
d. Please
specify...............................
17. What are the trends of publishing?
18. What type of problems you phase regarding publication?
19. Frequency and Circulation of Publication?
a. Daily
b. Monthly
c. Quarterly
d. Others
b. State wise
c. Others
Copyright@W3C India
Page 22