Skip to content

IWN-En contains linked Wordnet data which was linked with the help of manual efforts by lexicographers in CFILT

License

Notifications You must be signed in to change notification settings

cfiltnlp/IWN-En

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Computation for Indian Language Technology Logo

IndoWordnet - English Wordnet Mapping

GitHub issues GitHub forks GitHub stars GitHub license Twitter Follow

About

This repository contains the linked IndoWordnet data with English Wordnet published at the Langauge Resources and Evaluation conference (LREC) in 2018. The paper is available here and here.

IndoWordnet can be accessed online via this URL

We acknowledge the lexicographers from CFILT lab who created this data by manually linking English and Hindi Wordnet synsets alogn with the engineers/researchers who enabled the data curation.

Recent Updates

  • Version 1.0: IWN-EN release with Assamese, Bodo, Kashmiri, Konkani, Manipuri, Marathi, Nepali, Oriya, and Sanskrit synsets. All IWN synset linkages are now present here (to English Wordnet).
  • Version 0.5: IWN-EN release with Hindi, Bengali, Gujarati, Kannada, Malayalam, Punjabi, Tamil, Telugu and Urdu Wordnet synsets.
  • Version 0.0.1: IWN-EN initial release with Hindi Wordnet and English Wordnet mapping.

Usage

The raw format dataset files can also be found on this Git repository under the data folder.

Maintainer(s)

Diptesh Kanojia
Shivam Mhaskar

Citation

Diptesh Kanojia, Kevin Patel, and Pushpak Bhattacharyya. 2018. Indian Language Wordnets and their Linkages with Princeton WordNet. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan. European Language Resources Association (ELRA).

BiBTeX Citation

@inproceedings{kanojia-etal-2018-indian,
    title = "{I}ndian {L}anguage {W}ordnets and their {L}inkages with {P}rinceton {W}ord{N}et",
    author = "Kanojia, Diptesh  and
      Patel, Kevin  and
      Bhattacharyya, Pushpak",
    booktitle = "Proceedings of the Eleventh International Conference on Language Resources and Evaluation ({LREC} 2018)",
    month = may,
    year = "2018",
    address = "Miyazaki, Japan",
    publisher = "European Language Resources Association (ELRA)",
    url = "https://aclanthology.org/L18-1728",
}

About

IWN-En contains linked Wordnet data which was linked with the help of manual efforts by lexicographers in CFILT

Topics

Resources

License

Stars

Watchers

Forks