Skip to content

Latest commit

 

History

History
58 lines (39 loc) · 2.52 KB

README.md

File metadata and controls

58 lines (39 loc) · 2.52 KB

CLDF dataset derived from Liú et al.'s "Collection of Basic Words in Chinese Dialects" from 2007

CLDF validation

How to cite

If you use these data please cite

  • the original source

    Líu, L.; Wáng, H.; Bǎi, Y. (2007): Xiàndài Hànyǔ fāngyán héxīncí, tèzhēng cíjí 现代汉语方言核心词·特征词集 [Collection of basic vocabulary words and characteristic dialect words in modern Chinese dialects]. Nánjīng: Fènghuáng.

  • the derived dataset using the DOI of the particular released version you were using

Description

This dataset is licensed under a CC-BY-4.0 license

Conceptlists in Concepticon:

Statistics

CLDF validation Glottolog: 100% Concepticon: 100% Source: 100% BIPA: 100% CLTS SoundClass: 100%

  • Varieties: 19 (linked to 19 different Glottocodes)
  • Concepts: 203 (linked to 202 different Concepticon concept sets)
  • Lexemes: 4,302
  • Sources: 1
  • Synonymy: 1.12
  • Cognacy: 5,909 cognates in 832 cognate sets (382 singletons)
  • Cognate Diversity: 0.15
  • Invalid lexemes: 0
  • Tokens: 21,895
  • Segments: 145 (0 BIPA errors, 0 CLTS sound class errors, 145 CLTS modified)
  • Inventory size (avg): 50.32

Contributors

Name GitHub user Description Role
Liú Lìlǐ data collector DataCollector, Editor, Author
Wáng Hóngzhōng data collector DataCollector, Editor, Author
Bǎi Yíng data collector DataCollector, Editor, Author
Johann-Mattis List @LinguList maintainer Editor

CLDF Datasets

The following CLDF datasets are available in cldf: