Home

Information Architecture

The key idea is to split parsing into two stages. They're analogous to the lexer + parser pair in a compiler. Dividing the parsing into two pieces allows each to be simpler.

The first stage (this repo) crawls and converts original sources to JSON. The actual schema of the JSON mirrors the original content as much as possible. And so, each type of original source will have very different looking JSON. But, being JSON (instead of PDF, HTML, etc.) they're all easily read by the next stage. The second stage can focus on converting the source schema to a particular app's needs.

Public Law Data Flow (Horizontal)

Example: U.S.A. / Oregon Administrative Rules

Public Law Data Flow Example - OAR

Example: Canada / Dept. of Justice Legal Glossaries

Public Law Data Flow Example - DoJ Glossaries

Current project: International Law in support of Ukraine

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Home

Information Architecture

Example: U.S.A. / Oregon Administrative Rules

Example: Canada / Dept. of Justice Legal Glossaries

Clone this wiki locally