ISO 639-1 is not sufficient for language fields #582

schneidermic0 · 2024-01-19T14:00:08Z

Currently, it is specified that language fields follow ISO 639-1.

See:

SAP language code representing all possible languages does differentiate not only for the language but also for countries. ISO639-1 does not specify any country information in the language code.

For example there are differentiations in SAP system for different countries with language English. As SAP language code "EN" represents English United States SAP language code "6N" supports English United Kingdom. There are further country-specific SAP language codes for Englsish. But all are represented by ISO 639-1 language code "en".

Same issue also exists for other languages (Arabic, Chinese, Dutch, French, German or Spanish).

See SAP Note https://launchpad.support.sap.com/#/notes/73606.

During serialisation and transforming SAP language into the ISO 639-1 code, the information of the country is lost (or the wrong language code might be stored in the system).

schneidermic0 · 2024-01-19T14:12:01Z

Instead of ISO 639-1 we could use following option to represent SAP language

SAP language code like "EN", "6N" ...
Locale like "en_US", "en_GB", ...
Combination of ISO 639-1 language and ISO 3166-1 country code separated by an underscore*

*) Remark: I saw many examples where the locale is rendered with a hyphen instead of an underscore (e.g., "en-US", "en-GB", ...). SAP's APIs serialize it with an underscore.

wurzka · 2024-03-20T13:00:48Z

See also:

RFC5646 section 4.1 states following:

Use as precise a tag as possible, but no more specific than is
justified. Avoid using subtags that are not important for
distinguishing content in an application.

As far as I understand this section, we could stay with our existing language tags (e.g., "en" for SAP language "EN" representing English (US)), but can additional information as soon as it is needed. I.e., if it should be English (Great Britain), we could use language tag "en-GB". Same would be valid for any other region/script for the English language.

schneidermic0 · 2024-03-28T11:59:57Z

I have also checked, how SAP's I18N converter classes work (cl_i18n_languages) for BCP47:

If you convert "en" or "en-US" to SAP1-language, it will return in both cases the same SAP1-language. If you do the same for "en-GB" it will return a different language.

If you convert from a SAP1-language to BCP-language, it will always return the full tag (e.g., "E" will be converted to "en-US". However, here we could (not sure, yety whether we should) shorten the tag to "en".

I tested the behavior (describe above for English) with the language above also with several other languages like German or Chinese. It was the same.

schneidermic0 · 2024-03-28T12:04:02Z

Necessary steps to address this issue:

Decision which approach to follow
Adapt schema generator in the tools repository (see Removed requirements for language field abap-file-formats-tools#310)
Adapt all schemas in this repository (see Removed maxLength and pattern for language field #607)
Update documentation in this repository (see updated documentation for the language field #611)
Add list of all supported languages based on SAP Note https://launchpad.support.sap.com/#/notes/73606 (see updated documentation for the language field #611)

schneidermic0 · 2024-04-08T11:02:16Z

Decision: We plan to follow the approach of BCP47 language tags (see above). Whenever possible we stick to short language tags using the main language only, whenever possible.

schneidermic0 · 2024-04-09T14:20:52Z

Theoretically, we could replace the existing pattern ("^[a-z]+$") in the schema with value "^[a-z]{2,3}(?:-[A-Z][a-z]{3})?(?:-[A-Z]{2})?$" to address all languages supported by SAP (which is a subset of BCP47 language tags)

We think this would be somehow over engineered. We don't have patterns for other fields so far. Any objections?

This means the schema will only have the addition "minLength": 2.

Old code for original Language

        "originalLanguage": {
          "title": "Original Language",
          "description": "Original language of the ABAP object",
          "type": "string",
          "minLength": 2,
          "maxLength": 2,
          "pattern": "^[a-z]+$"
        },

New code for original language

        "originalLanguage": {
          "title": "Original Language",
          "description": "Original language of the ABAP object",
          "type": "string",
          "minLength": 2
        },

schneidermic0 · 2024-04-09T14:21:43Z

Maybe, it is more helpful if we list all supported languages based on SAP Note https://launchpad.support.sap.com/#/notes/73606 in our documentation

schneidermic0 · 2024-04-12T16:15:44Z

I think all necessary steps for the repository are done. @Markus1812 Thanks for your contributions.

I close this issue :)

schneidermic0 mentioned this issue Jan 31, 2024

Translation for INTF in AFF abapGit/abapGit#6774

Merged

schneidermic0 added decided Design decision made. Implementation by SAP is open bug Something isn't working labels Apr 8, 2024

This was referenced Apr 9, 2024

Removed requirements for language field SAP/abap-file-formats-tools#310

Merged

Removed maxLength and pattern for language field #607

Merged

schneidermic0 added a commit to micotto/abap-file-formats that referenced this issue Apr 10, 2024

Fix schema due to changes in issue SAP#582

c72cc2e

schneidermic0 added a commit to mseich/abap-file-formats that referenced this issue Apr 10, 2024

Fix schema due to changes in issue SAP#582

328b92b

schneidermic0 added a commit to aaronbruchsap/abap-file-formats that referenced this issue Apr 10, 2024

Fix schema due to changes in issue SAP#582

b0d6f6b

schneidermic0 added a commit to Bomberus/abap-file-formats that referenced this issue Apr 10, 2024

Fix schema due to changes in issue SAP#582

3bc02fc

Markus1812 mentioned this issue Apr 11, 2024

updated documentation for the language field #611

Merged

schneidermic0 closed this as completed Apr 12, 2024

wurzka mentioned this issue Apr 19, 2024

Use bcp47 language code for AFF abapGit/abapGit#6915

Merged

albertmink mentioned this issue Apr 22, 2024

Update JSON Schema abaplint/vscode-abap-artifacts#6

Merged

albertmink mentioned this issue Jul 2, 2024

Adding en-GB abapGit-tests/INTF_i18n#1

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ISO 639-1 is not sufficient for language fields #582

ISO 639-1 is not sufficient for language fields #582

schneidermic0 commented Jan 19, 2024

schneidermic0 commented Jan 19, 2024 •

edited

Loading

wurzka commented Mar 20, 2024

schneidermic0 commented Mar 25, 2024

schneidermic0 commented Mar 28, 2024

schneidermic0 commented Mar 28, 2024 •

edited

Loading

schneidermic0 commented Mar 28, 2024

schneidermic0 commented Mar 28, 2024 •

edited

Loading

schneidermic0 commented Apr 8, 2024

schneidermic0 commented Apr 9, 2024

schneidermic0 commented Apr 9, 2024

schneidermic0 commented Apr 12, 2024

ISO 639-1 is not sufficient for language fields #582

ISO 639-1 is not sufficient for language fields #582

Comments

schneidermic0 commented Jan 19, 2024

schneidermic0 commented Jan 19, 2024 • edited Loading

wurzka commented Mar 20, 2024

schneidermic0 commented Mar 25, 2024

schneidermic0 commented Mar 28, 2024

schneidermic0 commented Mar 28, 2024 • edited Loading

schneidermic0 commented Mar 28, 2024

schneidermic0 commented Mar 28, 2024 • edited Loading

schneidermic0 commented Apr 8, 2024

schneidermic0 commented Apr 9, 2024

schneidermic0 commented Apr 9, 2024

schneidermic0 commented Apr 12, 2024

schneidermic0 commented Jan 19, 2024 •

edited

Loading

schneidermic0 commented Mar 28, 2024 •

edited

Loading

schneidermic0 commented Mar 28, 2024 •

edited

Loading