Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support Alternate IPA symbols? #1

Open
llfkj opened this issue Jul 21, 2017 · 1 comment
Open

Support Alternate IPA symbols? #1

llfkj opened this issue Jul 21, 2017 · 1 comment

Comments

@llfkj
Copy link

llfkj commented Jul 21, 2017

I'm currently scraping and converting IPA characters in the wild into X-SAMPA and back.

According to https://en.wikipedia.org/wiki/X-SAMPA (citing wikipedia. I'm sorry :), there are a few other symbols in common use for two of the IPA characters: ᵿ (= ʊ̈ or ʉ̞) and ᵻ (= ɪ̈ or ɨ̞) . I added them to a piece of my code (which I had based on the tables in conv.js) to make the parsing more robust.

In case you want to do the same, here's something pastable (python format, sorry)

  [
    "ɪ̈",
    "I\\"
  ],
  [
    "ɨ̞",
    "I\\"
  ],
  [
    "ᵻ",
    "I\\"
  ],
  [
    "ʊ̈",
    "U\\"
  ],
  [
    "ʉ̞",
    "U\\"
  ],
  [
    "ᵿ",
    "U\\"
  ]
@dohliam
Copy link
Owner

dohliam commented Jul 21, 2017

Thanks for this, I hadn't seen these before! Since it seems like they're a part of X-SAMPA I'll go ahead and add them to conv.js to make it more complete. Let me know if you come across any other missing symbols!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants