You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I found a few examples of numbers which are conceptually one number, separated into two tokens:
В доменна пещ бяха изгорени и 157 615 пиратски компактдиска, хванати от митниците.
7 157 157 NUM Mc-pi Definite=Ind|Number=Plur|NumType=Card 10 nummod 10:nummod _
8 615 615 NUM Mc-pi Definite=Ind|Number=Plur|NumType=Card 7 flat 7:flat _
Даваме на Югославия помощи за $ 50 000 помощ
7 50 петдесет NUM Mc-pi Definite=Ind|Number=Plur|NumType=Card 9 nummod 9:nummod _
8 000 000 NUM Mc-pi Definite=Ind|Number=Plur|NumType=Card 7 flat 7:flat _
This is different from the treatment in FR_GSD, for example:
# text = À partir du XXIe siècle, les recensements réels des communes de moins de 10 000 habitants ont lieu tous les cinq ans.
17 10 000 10 000 NUM _ Number=Plur 18 nummod _ _
Conceptually this looks like it should be one token, although I don't know if that's standardized in UD or exactly what implications that would have for downstream tools.
The text was updated successfully, but these errors were encountered:
I found a few examples of numbers which are conceptually one number, separated into two tokens:
This is different from the treatment in FR_GSD, for example:
Conceptually this looks like it should be one token, although I don't know if that's standardized in UD or exactly what implications that would have for downstream tools.
The text was updated successfully, but these errors were encountered: