Sony PEG-NZ90 Picsel PLAIN TEST File Format Support - Page 4

Plain Text, Character Sets

Page 4 highlights

Picsel Text File Format Support Page 4 ePAGE supports the most popular global file formats. The formats interpreted by ePAGE are richly expressive, containing not just text but sophisticated layout and rendering features, rich fonts, colour, images, tables, graphics and many other document features. Picsel is continuously developing its file format support to eventually cover every feature of the native file. With such a wealth of features across many document types, this is inevitably an ongoing process, with milestone releases of new functionality planned at periodic intervals. The approach involves researching the feature set most commonly found in real documents, building support early for the most frequently used elements, and ensuring these features are reproduced with total faithfulness to the original. The emphasis of ePAGE is on displaying rich content rather than on reproducing the document creation facilities of the original application. This document describes the features supported in ePAGE. This level of support already covers the vast majority of characteristics that occur in day to day documents of this type, and the specific features are described with notes where appropriate. Those features planned for future implementation are also described, for completeness. Plain Text While there are many document formats which encode extra information such as page size and pictures into a single file, the most basic commonly used file format for text is called a plain text file. This has no special file structure; from the first byte to the last is a seqeuence of characters. Some of these characters do not appear visually, but instead take new lines or indent to a tab stop, as defined in the ASCII standard (ANSI X3.4, initially approved in 1969). Feature Plain 7-bit ASCII text Latin1 text Newlines Tabs Unlimited page size Support Yes Yes Yes Yes Yes Notes Converted to sequences of spaces Character Sets ePAGE can work with all characters in the Unicode specification. Text documents are encoded in other encodings, and ePAGE handles these by converting Latin1 to UTF-16 Unicode. (In addition to the encoding

  • 1
  • 2
  • 3
  • 4
  • 5

Picsel Text File Format Support
Page
4
ePAGE supports the most popular global file formats.
The formats
interpreted by ePAGE are richly expressive, containing not just text but
sophisticated layout and rendering features, rich fonts, colour, images,
tables, graphics and many other document features.
Picsel is
continuously developing its file format support to eventually cover every
feature of the native file.
With such a wealth of features across many
document types, this is inevitably an ongoing process, with milestone
releases of new functionality planned at periodic intervals.
The
approach involves researching the feature set most commonly found in
real documents, building support early for the most frequently used
elements, and ensuring these features are reproduced with total
faithfulness to the original. The emphasis of ePAGE is on displaying rich
content rather than on reproducing the document creation facilities of
the original application.
This document describes the features supported in ePAGE.
This level of
support already covers the vast majority of characteristics that occur in
day to day documents of this type, and the specific features are
described with notes where appropriate.
Those features planned for
future implementation are also described, for completeness.
Plain Text
While there are many document formats which encode extra
information such as page size and pictures into a single file, the most
basic commonly used file format for text is called a plain text file. This
has no special file structure; from the first byte to the last is a seqeuence
of characters. Some of these characters do not appear visually, but
instead take new lines or indent to a tab stop, as defined in the ASCII
standard (ANSI X3.4, initially approved in 1969).
Feature
Support Notes
Plain 7-bit ASCII text
Yes
Latin1 text
Yes
Newlines
Yes
Tabs
Yes
Converted to sequences of spaces
Unlimited page size
Yes
Character Sets
ePAGE can work with all characters in the Unicode specification.
Text
documents are encoded in other encodings, and ePAGE handles these
by converting Latin1 to UTF-16 Unicode. (In addition to the encoding