This is an extension that adds Pickle testing to Lua modules, that is specs among other types of testing. Lua is part of the Scribunto extension, which can be used on a wiki running the Mediawiki software.
The project is ongoing, and as such will be extremely unstable — it might even trigger a meltdown of your computer. :D
The head of the extension is targeted at the head of Mediawiki and the head of Extension:Scribunto. There are made no attempts to make the extension compatible with older versions. The versions can be downloaded from Wikimedias git repository, or from GitHub.
A help portal for users are available at Mediawiki.
- Mediawiki (issues | code)
- Gerrit
git clone https://gerrit.wikimedia.org/r/mediawiki/core.git
- GitHub
git clone https://github.com/wikimedia/mediawiki.git
- Gerrit
- Scribunto (issues | code)
- Gerrit
git clone https://gerrit.wikimedia.org/r/mediawiki/extensions/Scribunto
- GitHub
git clone https://github.com/wikimedia/mediawiki-extensions-Scribunto.git
- Gerrit
- Pickle (issues | code)
- GitHub
git clone https://github.com/jeblad/Pickle.git
- GitHub
The purpose of this extension is to make it possible to do efficient continuous testing by detecting whether Lua modules (pages) are holding a “Pickle”, whereby the Lua code on those pages can run both in an automatic and a manual mode to verify correct operation of some other related module. This makes it possible to do continuous testing in an environment where new code are not only added as part of continuous integration, but even developed in the production environment itself.
The document is targeted to those that need a quick overview of the extension at a system level. That is a developer or system maintainer with technical experience. It is not meant as documentation for the casual user. An end user documentation is available at Help:Pickle
For a full install procedure check out Extension:Pickle
Core documents
Related documents
- Wikitech-l: New policy: Wikimedia Engineering Architecture Principles
- Mediawiki: Wikimedia Technical Committee
- Mediawiki: Wikimedia Engineering Architecture Principles
- Phabricator: Establish Architecture Principles as a policy
- RFC2119: Key words for use in RFCs to Indicate Requirement Levels
Additional documents
- Wikipedia: Continuous testing
- Wikipedia: Continuous integration
- Wikipedia: Continuous delivery
- Wikipedia: Unit testing
- The RSpec Book
- RSpec: API documentation
- busted: Elegant Lua unit testing
One of the core design choices was to make the impact on the servers as small as possible, which means not running anything without it being highly likely that the results can be used in further processing. It starts with short circuit on content model, which is available in the title structure, and only then request additional data, thus the number of modules that should need further inspection should be fairly low, and then the overall load from the extension should be small.
If a relation between a tester (the code testing a module) and a testee (the module being tested) is found, then page properties are used to maintain a page specific state. If that state changes then extension data is set up and later consumed by additional handlers. That makes it possible to create a pluggable system where components can be added and detached as necessary.
Pickle code is written at the site similarly as other Lua code, the only requirement is that pickle code should be a subpage with a specific name.
The pickle code is assumed to connect with the provided Lua modules, and follows the usual “describe”, “context”, and “it” ladder. Each of those manipulates subject and expect to set the expected behavior from some subunit within the testee. This is fairly similar to how RSpec and Busted works.
Core areas of Reliability, Availability, Serviceability/Maintainability, Usability, and Safety (RAS(U), RAMS) is assumed to be sufficiently covered. An alternate formulation could be to use availability as the sole composite measure.
- Availability is usually a percentage given by reliability, maintainability, serviceability, performance and security, with a calculation based on agreed service time and downtime. There has been no attempts to calculate such a number for the extension itself.
- Reliability is a measure of how long system can perform its function without interruption, usually given as MTBF or MTBSI. There has been no attempts to measure this for the extension itself.
- Maintainability is a measure of how fast a system can be restored to normal operation, usually given as MTRS. There has been no attempts to measure this for the extension itself, but as the extension has extensive testing it is assumed that it should be fairly good.
- Serviceability (also supportability) is the ability of technical support personnel to install, configure, and monitor computer products, identify exceptions or faults, debug or isolate faults. In this context the ability to support this extension. There has been no attempts to measure this for the extension itself, but as the extension has extensive testing it is assumed that it should be fairly good.
- Usability is how easily a system can be used. The extension is based upon time-proven techniques from the current interface, and no new types of interactions are defined.
- Safety — other than use of a system user for logging purposes, it is not known that any part of the code needs special safety engineering
At a high level the extension consists of core units to run the tests, to extract and track the state from those tests, and inject the tracked state into the common user interface. In addition, there are some support libraries.
To run the tests
- LuaLibrary (PHP, Lua)
- engine (Lua)
- extractor (Lua)
- render (Lua)
- report (Lua)
To extract and track the state
- ExtractStatus (PHP)
- InvokeSubpage (PHP)
Support libraries
- Strategies (PHP)
- Sublinks (PHP)
- TAP (PHP)
- Observer (PHP)
- TNamedStrategies.php
- TNamedStrategy.php
- util.lua
- Stack.lua
The extension interact with the core system through hooks registered in Hooks.php especially Pickle\Hooks::onContentAlterParserOutput()
. This function will track state changes, and set page properties and extension data accordingly. Other hooks will then pick up the extension data and act on them, thereby setting page indicators, tracking categories, and do public logging.
Calling Pickle\Hooks::onContentAlterParserOutput()
will also run a call to identify a strategy to invoke the proper subpage (InvokeSubpage), process the result from that page (TAP), and then identify the status from the remaining data (ExtractStatus). The call is set up so it can handle both calls made for the tester (aka the pickle-page) and the testee (aka the ordinary module).
The test itself is run as Lua, and is set up in the Pickle.setupInterface()
call (Pickle.lua) by a call from Pickle\LuaLibrary::register()
(LuaLibrary.php).
Tests follow an adaptation of spec-style, with “describe”, “context”, and “it”. Those will then encapsulate tests, which use subject and expect to verify correct behavior of the module under test. The code for those modules can be found in engine. The engine will produce reports which are pure data about the tests. Those reports are then rendered. As of this writing there are three different types of renders. Two of them are plain text renders, one giving a compact style layout and one that giving a full style. The third render creates a web style layout.
Test data can be provided to “describe”, “context”, and “it” as nearly free-form text. Data from the text is extracted and cast into the correct type. The present set of extractors are far from comprehensive, but assumed to be sufficient to make the system functional.
The extension does not provide additional subsystems that can be used by the system (site) as such, but will itself use features from the system. In particular Category, Indicator, and LogEntry, will access facilities from Mediawiki. How this is done is described on Help:Pickle/Tracking categories, Help:Pickle/Page status indicators, and Help:Pickle/Test status log.
In addition to facilities used by the extension it is possible to make new site specific testing frameworks as long as it exports TAP-formatted reports. By following the TAP standard (it is a personal standard, and not a rigid one) the extension can use the new framework as its own. If the call must be changed, possibly also the name of the subpage holding the tests, then the configuration must be adjusted. The configuration section in extension.json would be InvokeSubpage.
The following is in a somewhat logical order.
The unit finds a strategy for accessing a subpage, where the test has a positive outcome if a given page exist (or should exist) in a correct location. That is a relation between a testee (the module being tested) and a tester (the code testing a module) can be established. If the relation exist, or should exist, then the strategy provide methods for the name of the subpage, and also the question that will be parsed and will then provide the report. When a strategy finds a usable relation it will not itself track the relation but leave it to the caller.
The strategies use options from the section InvokeSubpage
in the extensions own extension.json. It is possible to adapt this section to additional entries, typically new ways to invoke the subpage.
The strategy classes are divided into a common interface and a singleton, where the singleton is the access point for higher level code. All PHP classes that implements the strategy are implementations of the interface.
The unit finds a strategy for interpreting the report from a subpage, where the test has a positive outcome if a given parser evaluates to true. That is the tests reports are interpreted as TAP-formatted, and according to the given report it can be simplified into a final state. If a state is recognized, then the strategy provide a method for the name of the strategy which is the same as the state. The current (final) state will then be compared to previous state by the caller.
The strategies use options from the section ExtractStatus
in the extensions own extension.json. It is possible to adapt this section to additional entries, typically new states.
The strategy classes are divided into a common interface and a singleton, where the singleton is the access point for higher level code. All PHP classes that implements the strategy are implementations of the interface.
This is the main access point for passing data from Mediawiki to Scribunto, with Pickle\LuaLibrary::register()
and pickle.setupInterface( opts )
as the main access points. Data is set in Pickle\LuaLibrary::register()
and passed as the opts
structure to pickle.setupInterface( opts )
. By setting the members in the mw
structure values can be made global within Scribunto.
All values passed through opts
should come from the configuration in extension.json, and not be user provided data.
This subsystem is central to proper operation of the extension.
This is a set of classes for setting up and running tests. It contains the main routines to make the frames for “describe”, “context”, and “it”, and the same for “subject” and “expect”. The first three forms the nouns (the frames) in pickle definitions, while the two last forms the access points for further verification.
Routines for this subsystem has a path from source to sink. This should not create any security issue server side, as data is not passed back without proper filtering. Proper escaping should still be done to avoid messing up the output.
This subsystem is central to proper operation of the extension.
This is a set of classes for extracting and casting strings to correct type. It contains a limited set of cast operations, only scalar data types, and a Json type that will be cast into a table. It is used for extracting data from descriptions provided to the frames, that is “describe”, “context”, and “it”, and the strings will then be cast and provided to the functions used by the frames.
This subsystem is a possible security risk because it casts strings into data, as it does not create functions the risk should be limited. If an extractor is made that casts strings into functions the risk could be higher, but note that this only happens in Lua.
The cast methods does not check for valid boundaries before the cast operation is performed, and it will fail if they are wrong.
This subsystem is central to proper operation of the extension.
This is a set of classes for rendering reports in the proper format. It contains sets of renders for a specific style and for each type of report. Each style is in a different named folder, with libraries for rendering a specific report (a type) being called the same in all folders. There are three types that covers most of the use cases; compact, full, and vivid. Compact style is for interactive use in the test console. Full style is for text views where it is necessary to include all details, typically for integration with other systems. Vivid style is for use in the parser function.
Routines for this subsystem has paths from source to sink, and should escape text if necessary before it is returned. Still note that this subsystem will only be able to display its output after sanitation, so there isn't any real security risk only a need to avoid messing up the output.
This subsystem is central to proper operation of the extension.
This is a set of classes for collecting and holding reports. Reports are generated by the engine and holds pure data that will be rendered later on by renders. They should be viewed as data containers only.
This subsystem has paths from source to sink, but should not pose any risk in itself. Proper escaping should be done in the render subsystem.
This subsystem is central to proper operation of the extension.
This is a utility class that holds a number of strategies. Main purpose is to replace the pesky singletons with something that can be easily tested. Most of the strategies use this as a base class.
This subsystem is central to proper operation of the extension.
If pickle-status-current
is set, then a link will be created as a subtitle that points to Special:Log and the proper log entries created for track/good
, track/fail
, track/skip
, track/todo
, track/missing
, track/unknown
, and any additional entries that might exist. In the current development version the link is set up to filter on the page only and not anything further. It might be wise to filter on the module track log.
No values should be passed directly from source to sink in this subsystem.
This subsystem is decoupled from the rest of the system, and only respond on messages passed as extension data. The subsystem can be completely removed if necessary.
A parser to compact a messy and large TAP report into something that is easier to parse repeatedly. This compacted report will maintain the fallback chain, but in its present form it is hard coded. It could be wise to generalize the form.
No values should be passed directly from source to sink in this subsystem.
This subsystem is central to proper operation of the extension, but it can be removed if necessary.
The visual design of the test-console follows the same design idea as used by the debug-console, where a panel is positioned below the usual editor and then extended with Javascript in the browser. It is only the div
that is added directly from the PHP code, but additional code includes ext.pickle.console.js which creates more of the user interface.
On each click of the button a new report will be rendered, in a compact and orderly fashion.
The user interface is created whenever a proper subpage is detected.
If the pickle-status-current
extension data is set to a valid state, then it will be passed on to Mediawiki as a tracking category, and then used in the provided tools for such categories. The names of the tracking categories are formed from message keys created from the name.
The categories are defined trough entries in Category
from extension.json. Only named entries will be acted upon, less the default strategy which has an overridden name. The names the strategies respond to emerge from ExtractStatus
upstream in the pipeline.
There is no direct path from source to sink in the code except for the title object, but this object should be properly handled by the existing system.
This subsystem is decoupled from the rest of the system, and only respond on messages passed as extension data. The subsystem can be completely removed if necessary.
If the pickle-status-current
extension data is set to a valid state, then it will be passed on to Mediawiki as a page indicator, and then used in the provided tools for such indicators. The names of the page indicators are formed from message keys created from the name.
The indicators are defined trough entries in Indicator
from extension.json. Only named entries will be acted upon, less the default strategy which has an overridden name. The names the strategies respond to emerge from ExtractStatus
upstream in the pipeline.
There is no direct path from source to sink in the code except for some fully parsed messages, but that is part of the unprotected and sanitized output and should be properly handled by the existing system.
This subsystem is decoupled from the rest of the system, and only respond on messages passed as extension data. The subsystem can be completely removed if necessary.
If the pickle-status-current
extension data is valid and set to a different value from pickle-status-previous
, then the state change will be passed on to Mediawiki as a log event, and then used in the provided tools for such logs. The names of the log entries are formed from message keys created from the name.
The indicators are defined trough entries in LogEntry
from extension.json. Only named entries will be acted upon, less the default strategy which has an overridden name. The names the strategies respond to emerge from ExtractStatus
upstream in the pipeline.
There is no direct path from source to sink in the code except for the title object, but this object should be properly handled by the existing system.
This subsystem is decoupled from the rest of the system, and only respond on messages passed as extension data.
This is a system user that will be used for reporting state changes that can't be attributed to a specific user. The scenario is typically that some change has taken place that impact a Lua module, and that module is under test. When a user does something that triggers a rendering of the page, the state change is detected, and then the normal behavior would be to log the change with the active user. This would be rather unpopular, as it would not reflect the real cause of the state change. Instead of using that user an alternate account is used.
The user is defined through its numeric id, the entry ObserverID
from extension.json, which work quite well for a single site. On a multisite setup it is necessary to block name changes.
The user class is a singleton as there should be no more than one instance.
Intentionally left blank.
Intentionally left blank.