You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently, the process of generating an importjson file strongly depends on the contents of the CW dictionary. We intend to make the following changes:
There's repeated code to process each dictionary. Isolate code that processes complete dictionaries in one place
Encapsulate source specific changes into each source's class
Separate special initialization processes currently done while converting CW into a general class so that any source can be "the first to appear"
Turn aggregation from a global operation to a one-to-one process so that source priority can be easily changed.
Ensure entries that have no mapping on previously aggregated entries are still included (depends on previous issues)\
Refactor instructions to use _altlab versions of alternative dictionaries instead of main (mostly immutable) sources.
Change documentation and dependencies on use of FSTs: Currently we only use the relaxed analyzer as a way to account for spelling differences between dictionaries.
It is currently expected that finalizing this process will expand the crkeng_dictionary.importjson file with around 8k senses that are currently being discarded by the matching process.
The text was updated successfully, but these errors were encountered:
Currently, the process of generating an
importjson
file strongly depends on the contents of the CW dictionary. We intend to make the following changes:_altlab
versions of alternative dictionaries instead of main (mostly immutable) sources.It is currently expected that finalizing this process will expand the
crkeng_dictionary.importjson
file with around 8k senses that are currently being discarded by the matching process.The text was updated successfully, but these errors were encountered: