Skip to content

Commit

Permalink
update new files
Browse files Browse the repository at this point in the history
  • Loading branch information
lingulist committed Mar 16, 2022
1 parent 12f569c commit ce10e92
Show file tree
Hide file tree
Showing 7 changed files with 4,790 additions and 4,780 deletions.
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
# CLDF dataset derived from Bodt's "Lexical Cognates in Western Kho-Bwa" from 2019

[![CLDF validation](https://github.com/lexibank/bodtkhobwa/workflows/CLDF-validation/badge.svg)](https://github.com/lexibank/bodtkhobwa/actions?query=workflow%3ACLDF-validation)
[![CLDF validation](https://github.com/lingpy/bodtkhobwa//workflows/CLDF-validation/badge.svg)](https://github.com/lingpy/bodtkhobwa//actions?query=workflow%3ACLDF-validation)

## How to cite

Expand Down Expand Up @@ -50,7 +50,7 @@ The collection of the data was funded by:
## Statistics


[![CLDF validation](https://github.com/lexibank/bodtkhobwa/workflows/CLDF-validation/badge.svg)](https://github.com/lexibank/bodtkhobwa/actions?query=workflow%3ACLDF-validation)
[![CLDF validation](https://github.com/lingpy/bodtkhobwa//workflows/CLDF-validation/badge.svg)](https://github.com/lingpy/bodtkhobwa//actions?query=workflow%3ACLDF-validation)
![Glottolog: 100%](https://img.shields.io/badge/Glottolog-100%25-brightgreen.svg "Glottolog: 100%")
![Concepticon: 90%](https://img.shields.io/badge/Concepticon-90%25-green.svg "Concepticon: 90%")
![Source: 100%](https://img.shields.io/badge/Source-100%25-brightgreen.svg "Source: 100%")
Expand Down
16 changes: 4 additions & 12 deletions cldf/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -12,24 +12,15 @@ property | value
[dc:conformsTo](http://purl.org/dc/terms/conformsTo) | [CLDF Wordlist](http://cldf.clld.org/v1.0/terms.rdf#Wordlist)
[dc:format](http://purl.org/dc/terms/format) | <ol><li>http://concepticon.clld.org/contributions/Bodt-2019-664</li></ol>
[dc:license](http://purl.org/dc/terms/license) | https://creativecommons.org/licenses/by/4.0/
[dcat:accessURL](http://www.w3.org/ns/dcat#accessURL) | https://github.com/lexibank/bodtkhobwa
[prov:wasDerivedFrom](http://www.w3.org/ns/prov#wasDerivedFrom) | <ol><li><a href="https://github.com/lexibank/bodtkhobwa/tree/74c1a87">lexibank/bodtkhobwa v2.0-24-g74c1a87</a></li><li><a href="https://github.com/glottolog/glottolog/tree/v4.4">Glottolog v4.4</a></li><li><a href="https://github.com/concepticon/concepticon-data/tree/v2.5.0">Concepticon v2.5.0</a></li><li><a href="https://github.com/cldf-clts/clts/tree/v2.1.0">CLTS v2.1.0</a></li></ol>
[prov:wasGeneratedBy](http://www.w3.org/ns/prov#wasGeneratedBy) | <ol><li><strong>lingpy-rcParams</strong>: <a href="./lingpy-rcParams.json">lingpy-rcParams.json</a></li><li><strong>python</strong>: 3.8.10</li><li><strong>python-packages</strong>: <a href="./requirements.txt">requirements.txt</a></li></ol>
[dcat:accessURL](http://www.w3.org/ns/dcat#accessURL) | https://github.com/lingpy/bodtkhobwa/
[prov:wasDerivedFrom](http://www.w3.org/ns/prov#wasDerivedFrom) | <ol><li><a href="https://github.com/lingpy/bodtkhobwa//tree/v3.0.1">lingpy/bodtkhobwa/ v3.0.1</a></li><li><a href="https://github.com/glottolog/glottolog/tree/v4.5">Glottolog v4.5</a></li><li><a href="https://github.com/concepticon/concepticon-data/tree/v2.5.0">Concepticon v2.5.0</a></li><li><a href="https://github.com/cldf-clts/clts//tree/b12a7df">CLTS v2.1.0-26-gb12a7df</a></li></ol>
[prov:wasGeneratedBy](http://www.w3.org/ns/prov#wasGeneratedBy) | <ol><li><strong>lingpy-rcParams</strong>: <a href="./lingpy-rcParams.json">lingpy-rcParams.json</a></li><li><strong>python</strong>: 3.9.9</li><li><strong>python-packages</strong>: <a href="./requirements.txt">requirements.txt</a></li></ol>
[rdf:ID](http://www.w3.org/1999/02/22-rdf-syntax-ns#ID) | bodtkhobwa
[rdf:type](http://www.w3.org/1999/02/22-rdf-syntax-ns#type) | http://www.w3.org/ns/dcat#Distribution


## <a name="table-formscsv"></a>Table [forms.csv](./forms.csv)


Raw lexical data item as it can be pulled out of the original datasets.

This is the basis for creating rows in CLDF representations of the data by
- splitting the lexical item into forms
- cleaning the forms
- potentially tokenizing the form


property | value
--- | ---
[dc:conformsTo](http://purl.org/dc/terms/conformsTo) | [CLDF FormTable](http://cldf.clld.org/v1.0/terms.rdf#FormTable)
Expand All @@ -53,6 +44,7 @@ Name/Property | Datatype | Description
`Loan` | `boolean` |
`Graphemes` | `string` |
`Profile` | `string` |
`Partial_Cognacy` | `string` |

## <a name="table-languagescsv"></a>Table [languages.csv](./languages.csv)

Expand Down
19 changes: 11 additions & 8 deletions cldf/cldf-metadata.json
Original file line number Diff line number Diff line change
Expand Up @@ -11,18 +11,18 @@
"dc:related": null,
"dc:source": "sources.bib",
"dc:title": "CLDF dataset derived from Bodt's \"Lexical Cognates in Western Kho-Bwa\" from 2019",
"dcat:accessURL": "https://github.com/lexibank/bodtkhobwa",
"dcat:accessURL": "https://github.com/lingpy/bodtkhobwa/",
"prov:wasDerivedFrom": [
{
"rdf:about": "https://github.com/lexibank/bodtkhobwa",
"rdf:about": "https://github.com/lingpy/bodtkhobwa/",
"rdf:type": "prov:Entity",
"dc:created": "v2.0-24-g74c1a87",
"dc:created": "v3.0.1",
"dc:title": "Repository"
},
{
"rdf:about": "https://github.com/glottolog/glottolog",
"rdf:type": "prov:Entity",
"dc:created": "v4.4",
"dc:created": "v4.5",
"dc:title": "Glottolog"
},
{
Expand All @@ -32,9 +32,9 @@
"dc:title": "Concepticon"
},
{
"rdf:about": "https://github.com/cldf-clts/clts",
"rdf:about": "https://github.com/cldf-clts/clts/",
"rdf:type": "prov:Entity",
"dc:created": "v2.1.0",
"dc:created": "v2.1.0-26-gb12a7df",
"dc:title": "CLTS"
}
],
Expand All @@ -45,7 +45,7 @@
},
{
"dc:title": "python",
"dc:description": "3.8.10"
"dc:description": "3.9.9"
},
{
"dc:title": "python-packages",
Expand All @@ -60,7 +60,6 @@
"tables": [
{
"dc:conformsTo": "http://cldf.clld.org/v1.0/terms.rdf#FormTable",
"dc:description": "\nRaw lexical data item as it can be pulled out of the original datasets.\n\nThis is the basis for creating rows in CLDF representations of the data by\n- splitting the lexical item into forms\n- cleaning the forms\n- potentially tokenizing the form\n",
"dc:extent": 4720,
"tableSchema": {
"columns": [
Expand Down Expand Up @@ -130,6 +129,10 @@
{
"datatype": "string",
"name": "Profile"
},
{
"datatype": "string",
"name": "Partial_Cognacy"
}
],
"foreignKeys": [
Expand Down
Loading

0 comments on commit ce10e92

Please sign in to comment.