Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some words in Adlam.txt look improperly parsed #64

Open
NeilSureshPatel opened this issue May 25, 2023 · 0 comments
Open

Some words in Adlam.txt look improperly parsed #64

NeilSureshPatel opened this issue May 25, 2023 · 0 comments

Comments

@NeilSureshPatel
Copy link
Contributor

There are number of words in the Adlam.txt that look to be improperly parsed. These all have a capital "Na" in the middle of the word followed by a "nyondal" (apostrophe) and a "ba".

Sample:
𞤤𞤫𞤴𞤯𞤫𞥅𞤼𞤫𞤐'𞤦𞤫𞤤𞤫
𞤥𞤢𞤳𞥆𞤮𞤐'𞤦𞤫𞤤𞤫

It looks like these are two words that are merged. Perhaps these started out as . . .

𞤤𞤫𞤴𞤯𞤫𞥅𞤼𞤫 𞤐'𞤦𞤫𞤤𞤫
𞤥𞤢𞤳𞥆𞤮 𞤐'𞤦𞤫𞤤𞤫

I was going to clean these up but I think it maybe related to an upstream issue of how the words are extracted from the source material. Even if the words are separated as shown above the text would not be quite right. Word initial pre-nasalized consonants in Adlam do not use the nyondal.

These words would properly be written as 𞤐𞤦𞤫𞤤𞤫 and 𞤐𞤦𞤫𞤤𞤫. So maybe there are two issues here, one being the parsing and the other being the source material.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant