Skip to content

Changelog (pagecontent)

Christian Clausner edited this page Jul 16, 2019 · 9 revisions

Version 2019-07-15

  • Added TextStyle to Page element (default text style for whole page)
  • Added "orientation" float attribute to Page element (skew for the whole page)
  • Added underlineStyle attribute to TextStyleType (valid values: singleLine, doubleLine, other)

Version 2018-07-15

  • Added MapRegion and CustomRegion
  • New confidence attribute ("conf") for Page, Coords (polygons), BaseLine, ReadingOrder, and AlternativeImage (value 0..1, confidence of algorithm that produced the object)
  • New "header" boolean attribute for TableCellRole (is the cell a column or row header?)
  • New "index" int attribute for TextLine (Position (order number) of the text line within its parent text region)
  • New "imageXResolution", "imageYResolution", "imageResolutionUnit" attributes for Page
  • New element "MetadataItem" for Metadata (for extended metadata)
    • "type" string attribute (optional, one of: author, imageProperties, processingStep, other)
    • "name" string attribute (optional)
    • "value" string attribute (required)
    • "date" dateTime (e.g. 2002-05-30T09:00:00) (optional)
  • New "Grid" element for TableRegion (visible or virtual table grid / separators)
    • "GridPoints" sub element (one row in the points matrix)
  • Changed RelationType: Instead of 2 RegionRef sub elements there is now SourceRegionRef and TargetRegionRef. Added "id" attribute (required)
  • Added feature for semantic labels / tags for: MetadataItem, Page, regions, textlines, words, glyphs, reading order groups, Relation
    • Labels element - groups labels from the same model
    • Label element - the actual label
  • Added "AlternativeImage" as optional child element to Region, TextLine, Word, and Glyph. External image for the respective object.

Version 2017-07-15

  • Added Graphemes (New level below glyphs for compound characters)
  • Added User-defined attributes for Page, Metadata, regions, text lines, words, glyphs, reading order groups
  • Added concept of region role (roles that nested regions take on in context of a parent region; e.g. TableCellRole)
  • Extended text type constraint to include "list-label"
  • New attributes for reading order groups (type, custom, comments, regionRef)
  • New boolean attribute "continuation" for regions and reading order groups, indicating that an object is the continuation of another object in the previous column or page, for example
  • New string attributes "textColourRgb" and "bgColourRgb" for TextStyle elements. Alternative to named colours. Format is RGB encoded (red value) + (256 x green value) + (65536 x blue value).

Older Changes

See Changelog on primaresearch.org

Clone this wiki locally