-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Mappings for representing English in Cirth (as in LOTR) and Futhorc/Runes (as in the Hobbit) #34
base: mode/english
Are you sure you want to change the base?
Mappings for representing English in Cirth (as in LOTR) and Futhorc/Runes (as in the Hobbit) #34
Conversation
… English This closely follows the table at https://ring-lord.tripod.com/cirth/angerthaserebor.htm
Any thoughts or comments on the usefulness of this welcome ... I found it helpful in transcribing things where many of the other tools on the web can't do it accurately but Glǽmscribe can! |
Comparing the Cirth LOTR mapping to Table 2 (page 4) in The Letters of Middle-Earth, the differences are:
Comparing the Futhorc Hobbit mapping to Table 1 (page 3) in The Letters of Middle-Earth, it maps fairly closely apart from some of the dipthongs EA ST and EO at the end. |
Hi David, thanks a lot for your contribution ! Having Cirth modes for English would be a wonderful thing. Just for info, I've been working for a long while on a way to make English modes possible in glaemscribe, and we (my friend Bertrand Bellet and I, Bertrand helping me a lot on all the theoretical linguistic aspects) have chosen a difficult way, this is why the repository has not evolved during the past few years - I haven't yet committed the work in progress, so things were looking a bit stalled (which was not totally false but not true either). The engine is now complete, and it relies on our custom version of espeakNG, to produce a non-ambiguous IPA representation of english. The logic behind is phonemic, but the engine can also produce multiple phonetic variations (we have currently 4 variants : traditional RP, modern RP, US, and 'as close to JRRT as possible'). To make yourself an idea of what it could be like, you can have a look at the synthesis chart here (where things are grouped by lexical sets and ambiguous cases). The column mapping being the following : en-tengwar=J.R.R.T.'s pronunciation, en-tengwar-gb=modern RP, en-tengwar-rp=traditional RP, en-tengwar-us=US pronunciation: You could probably build an awesome mode on the top of the engine, and it's just a pity that it's not yet available. The current problem is documentation, as we've made a lot of options available that need to be explained, see : The integration in a mode is really easy, as you just need to enable the engine with a flag in the mode's source code. The engine will then be automatically applied to the input text just before the mode processing, so that instead of working on the english input directly, you would work on an already transformed version such as, for example :
We're currently working on the documentation after a very long hiatus, and we're trying to pack everything up for the end of August. So my first reaction is : what do you feel about the potential use of the english engine for your mode ? Do you see any use for it ? Having a Cirth mode and futhorc mode in addition of the Tengwar mode for the release of the english engine would be totally awesome and extremely timely. Closing this big parenthesis, another immediate reaction on your remarks and questions would be : do not hesitate to use the option system if you have interrogations on what to use to render an ambiguous case. This is generally my ultimate solution to solve the problem of Tolkien's hesitations : you leave the choice to the end user so everyone's satisfied. You can have very atomic options, or if you're afraid that they'd clash together, you can have more general options that affect multiple behaviors. |
Wow thanks for the detailed info! I had seen you were working on something with espeak which I think is brilliant... I'd love to give the espeak engine a try; is this working on master? |
Actually, the same problem lies in the "so-called" XD orthographic modes for tengwar. Tolkien introduces a lot of phonetic tricks inside of them, which make them totally hybrid, and totally unregular. That's a nightmare for a transcriptor like glaemscribe, and is probably more difficult to implement than a phonemic mode. We had a discussion with Bertrand on the matter and decided not to implement any of the orthographic modes described on Amanye Tenceli due to all those exceptions. If we were to implement one, we would probably chose something very regular (which is not really interesting) and maybe, on this basis, add some spices, but in a non-ambiguous way exactly as you did with IPA variants. The english engine is probably not working on the master (honestly, can't remember) :) So I've just committed my |
OK that's really useful, thank you, will check it out! |
Having a look at the Book of Mazarbul's transcription, I could check that an automatic translation from the latin orthography "à la Tolkien" is almost impossible, indeed...
Tolkien makes the distinction between e representing /e/ (they) and the schwa /ə/ (have, taken, bridge). However, he does not make any distinction between a representing /æ/ (have) and /e/ (taken). From the IPA we thus cannot deduce the original orthography for /e(ɪ)/. It makes the mode fall into the category of unregular "orthographic" modes where we need to propose a custom orthography to allow to reflect Tolkien's choices and customisations, as you've done in your proposal. |
One remark playing with the Erebor mode : because glaemscribe is an automatic tool/stuff and addressed to a large audience (amateurs up to scholars) we try to make modes "usable" in a generic context, and that means "filling the gaps" when there are some. This is really touchy, because it means adding a bit of invention to Tolkien's work, or at least merging ideas that come from different chronological sources (sometimes clashing with each other) so we try to make this clear in the docs (when they exist 😞 😞), but also, we try to stay as close as possible to Tolkien's choices or to canonical/natural choices (for tengwar e.g., there's often a phonological logic behind a system than you can use to reconstruct what is missing). One nice tool is the option system of glaemscribe, that allows to propose multiple solutions to augment the satisfaction of picky scholars - and leave the responsibility to the end user. Playing with your Erebor mode, I think that to be closer to what we've done yet, you should probably fill a few gaps, since some common english letters are not represented at all : For https://en.wikipedia.org/wiki/English_orthography So we could handle separately, Also, there are some cirth that exist in Dan Smith's fonts that I've not included in the ds_cirth charset but could (maybe) : Maybe they could be used, I.D.K, but the documentation is very sparse, and probably some time should be taken to evaluate each cirth's origin before use. |
Just in case, I've just added support for the diacritics of the Cirth Erebor fonts in the |
One more note : changing the So I've added an |
Great, thanks for the updates! Will have a look through them... |
One remark : since you've merge the |
Use the closest options to the Anglo-Saxon names for these runes
Explain that we're not doing much phonetic here
Realized this was actually the wrong character and replaced with the correct one
afbd0e3
to
ea1efbb
Compare
So far I've implemented options for variant characters for both the Erebor and the Futhorc scripts. Any comments on this welcome... I have along the way come across a bug that if I produce a sequence |
…or always using k for c, and j for soft g
I've added an approach to the handling of the
I initially included a I've implemented this all in the preprocessor, is that OK? Any comments welcome |
Also abandon using ck - it now would interfere with a natural double consonant if enabled nassty tricksy hobbitses
I've implemented the numeric sub-dots, the underbar for double vowels and the circumflex for double consonants. I couldn't get virtual characters to render in the Glaemscribe editor; this seems to be true in the current live online editor as well: As a result, I've left all the diacritics non-virtual for now as it makes it easier for me to test (I can't run the full website locally) but this should obviously be changed |
Notes:
|
For the virtual characters to be handled you need not to forget to add a call to the |
Edit: wrong guess, see below. |
Also put just one underdot in the senter of 6's six strokes; unsure how Tolkien would have done this
On the standard site's IDE if I just update the main example with the following, it happens...
Also if I say |
Do you have a place for putting sample texts? Or the docs that appear in your main site when a mode is selected? |
Found the problem ! The fix is in the I realized the popups in the debugger were not working anymore. Fixed too. |
Fantastic! Will wait for you to push tomorrow :) |
Seems to me completely ok since the automatic behavior is not enabled by default : after thought, those orthographic modes need quite some attention and niggling and using orthographic variants cannot be avoided anyway. So, having an automatic behavior might be "too much" by default. This being said, as long as there's a way to overload the automatic behavior (exemple : write So the base principle could be : disable all automatic behaviors by default. Regarding the idea of
I do agree !
Probably the easiest way to do those kind of disambiguation, indeed.
That's a good question. If multiple choices could be made by Tolkien within a same text, then we should probably stay simple and let everything be done manually by the user, all the more that marking the schwas is a deliberate and advanced move, so ...
Yes, there are multiple ways to do this, one is to use macros when a lot of options may be mixed together (undocumented), another is to make use of sheaves, e.g. one way of refactoring the handling of consonants that can be doubled with a few lines :
For tengwar modes I usually split those into témar (series) to make the mode file feel like the classical charts. It makes lines shorter and more readable. I usually like to make some variable definitions at the start of the rule group when options leading to base characters are involved (like the variables {CIRTH_G} and {CIRTH_N} once for all).
Seems to me that "alt" is perfect since we cannot do better. It's really painful to work with Cirth since we cannot name them. |
Yes but it's in a private repository since the source code of the website itself is not public, so one way to do this would be for you to prepare some explanations in a doc file, that I would layout and style (when there are charts and quotes, a bit of html/css can't be really avoided). That's how we worked with Da Def for the Lang Belta mode. If you feel like it ? |
Ahh really sorry. I thought I had already pushed the fix but seeing my console history I jumped to something else without reaching the end of the flow ! Now you can merge the |
Thanks for all the detailed feedback and help, will make some changes! |
JFYI : since we've talked about it at the beginning of the conversation and your intervention gave me the mood to work on glaemscribe 😄 , I've just implemented an additional Angerthas Daeron mode for english using the new text-to-speach engine based on espeakNG, in the For the numeral system (to make things fully usable, which is not possible with the few runes given by Tolkien) I had to devise my own system based on the Cirth and took inspiration from the pentimal system.
|
Nice chart ! I appreciate the effort to layout it symmetrically and the style reminds me of the Mendeliev table. Covered with runes, it feels like some kind of alchemic cryptic parchment :) Take your time for making the PR evolve, glaemscribe is a long running project ! ;) |
This contains two new mappings following how Tolkien represented English in his major works: One for the Cirth as used in key parts of Lord of the Rings (
english-cirth-erebor.glaem
) and one for his adaptation of Anglo-Saxon runes / Futhorc as used in the Hobbit (english-futhorc.glaem
).Cirth / LOTR
This mapping closely follows the table at https://ring-lord.tripod.com/cirth/angerthaserebor.htm
It can be used to represent the primary examples of Tolkien writing English in Cirth (see the Mellonath Daeron Index
of Certh Specimina). For each of these, the English text, a phoneticized version that can be inputted into Glaemscribe, and the resulting Cirth characters are given:
The Lord of the Riŋs translat+ʌd from the red book/
!/ I ab@9 I b4 I !/ I @lhg I 8@cugac8?9 I 3@b6 I !/ I @z9 I 2ne}
Balin|sʌn|ov|Fundin|Lord|ov|Moria
2calugZub43Sdluab@9b46b@lc
It currently takes an approach of trying to include all the character combinations presented in the tables, even those that are not useful for English. It uses some IPA symbols where there isn't a standard alphabetic character (notably for the schwa vowel representations) or to differentiate between English sounds (so
ŋ
whenng
is not pronounced followed with a hardg
sound, butng
otherwise).Where there are alternative characters available, these are selected by prefixing them with a
+
. I'm not sure if there is a better way to do this but it does make it possible to represent everything fairly easily...Anglo-Saxon Runes / The Hobbit
This mapping closely follows the table at https://ring-lord.tripod.com/runes/dwafrune.htm (itself based on the Hobbit)
It maps to the Unicode Runic range; some of these mappings are a slight compromise as appropriate letters are not available (see the comments in the mapping)
It can be used to represent the primary examples of Tolkien writing English in Futhorc (see the Mellonath Daeron Runes Specimina index, and Some comprehensive analysis of the usage of these runes). For each of these, a mildly phoneticized version of the English text that can be inputted into Glaemscribe, and the resulting Unicode Runic characters are given: