Skip to content

Latest commit

 

History

History
61 lines (40 loc) · 2.59 KB

README.md

File metadata and controls

61 lines (40 loc) · 2.59 KB

CLDF dataset derived from Peiros' "Genetic classification of Austro-Asiatic languages" from 2004

CLDF validation

How to cite

If you use these data please cite

  • the original source

    Peiros, I. I. (2004): Genetičeskaja klassifikacija avstroaziatskix jazykov [Genetic classification of Austro-Asiatic languages]. Russian State University for the Humanities, Russian State University for the Humanities, Moscow.

  • the derived dataset using the DOI of the particular released version you were using

Description

Lexicostatistic classification of a larger range of Austro-Asiatic languages.

This dataset is licensed under a CC-BY-4.0 license

Available online at https://rusneb.ru/catalog/000199_000009_002728473/

Conceptlists in Concepticon:

Statistics

CLDF validation Glottolog: 87% Concepticon: 100% Source: 100% BIPA: 100% CLTS SoundClass: 100%

  • Varieties: 109 (linked to 85 different Glottocodes)
  • Concepts: 100 (linked to 100 different Concepticon concept sets)
  • Lexemes: 10,706
  • Sources: 1
  • Synonymy: 1.02
  • Cognacy: 10,706 cognates in 1,936 cognate sets (808 singletons)
  • Cognate Diversity: 0.17
  • Invalid lexemes: 0
  • Tokens: 45,870
  • Segments: 277 (0 BIPA errors, 0 CLTS sound class errors, 278 CLTS modified)
  • Inventory size (avg): 41.90

Contributors

Name GitHub user Description Role
Johann-Mattis List @LinguList patron, code Editor
Mei-Shin Wu @MacyL orthography profile Other
Ilia Peiros original data compilation Author

CLDF Datasets

The following CLDF datasets are available in cldf: