- Parser. A specialised software application for navigating through structured data.
- Markup. The process of identifying and encoding data entities within arbitrary data.
- Prologue. A segment of initialising data or code, typically required as an element of processing.
- Tag. In XML context it is an identifiably named paired structure (start tag and end tag) used to enclose the data.
- Attribute. Contextual information pertaining to an element
- Element. In XML context and element represents the tag and all other related entities such as the data constituent and elements.
- Schema. A form of data definition for an XML document.
- Encoding. Has several contextual meanings but typically refers to the character set encoding of a particular document.
XML (eXtensible Markup Language) is a digital standard for describing structured and partially structured data. The XML data model and other features are extensively described in the W3C published standards documents:
- XML Standards (Fifth Edition)
- XPath Specification
- XSL Transformations (XSLT)
- XQuery and XPath Data Models
- XML Schema Definition
XML was approved and published as a standard by the W3C (World Wide Web Consortium) in 1998. The standard itself is a derivative of "Standard Generalised Markup Language" (SGML). Thus it is quite similar in many characteristics to another SGML derivative, HTML.
SGML itself is a derivation of GML - "Generalized Markup Language" and BNF - "Backus-Naur Form".
There are numerous reasons for using XML.
- Can be used to store arbitary data.
- Typically intended to be in a form that is human and machine readable.
- Simple structured open format for long-term sustainability.
- Mature, established and stable. BNF/GML was in use in the late 1950's.
- It can express data as a tree, mitigating the need for the complex interactions required for platforms such as relational databases.
- It's a free, open standard maintained by the W3C (World Wide Web Consortium).