-
Notifications
You must be signed in to change notification settings - Fork 8
Changelog (pagecontent)
Christian Clausner edited this page Jul 16, 2019
·
9 revisions
- Added TextStyle to Page element (default text style for whole page)
- Added "orientation" float attribute to Page element (skew for the whole page)
- Added underlineStyle attribute to TextStyleType (valid values: singleLine, doubleLine, other)
- Added MapRegion and CustomRegion
- New confidence attribute ("conf") for Page, Coords (polygons), BaseLine, ReadingOrder, and AlternativeImage (value 0..1, confidence of algorithm that produced the object)
- New "header" boolean attribute for TableCellRole (is the cell a column or row header?)
- New "index" int attribute for TextLine (Position (order number) of the text line within its parent text region)
- New "imageXResolution", "imageYResolution", "imageResolutionUnit" attributes for Page
- New element "MetadataItem" for Metadata (for extended metadata)
-
- "type" string attribute (optional, one of: author, imageProperties, processingStep, other)
-
- "name" string attribute (optional)
-
- "value" string attribute (required)
-
- "date" dateTime (e.g. 2002-05-30T09:00:00) (optional)
- New "Grid" element for TableRegion (visible or virtual table grid / separators)
-
- "GridPoints" sub element (one row in the points matrix)
- Changed RelationType: Instead of 2 RegionRef sub elements there is now SourceRegionRef and TargetRegionRef. Added "id" attribute (required)
- Added feature for semantic labels / tags for: MetadataItem, Page, regions, textlines, words, glyphs, reading order groups, Relation
-
- Labels element - groups labels from the same model
-
- Label element - the actual label
- Added "AlternativeImage" as optional child element to Region, TextLine, Word, and Glyph. External image for the respective object.
- Added Graphemes (New level below glyphs for compound characters)
- Added User-defined attributes for Page, Metadata, regions, text lines, words, glyphs, reading order groups
- Added concept of region role (roles that nested regions take on in context of a parent region; e.g. TableCellRole)
- Extended text type constraint to include "list-label"
- New attributes for reading order groups (type, custom, comments, regionRef)
- New boolean attribute "continuation" for regions and reading order groups, indicating that an object is the continuation of another object in the previous column or page, for example
- New string attributes "textColourRgb" and "bgColourRgb" for TextStyle elements. Alternative to named colours. Format is RGB encoded (red value) + (256 x green value) + (65536 x blue value).