Numbers, Numeral Classifiers and more

Kanji Numbers

These are parsed using the normal grammar, as described in Bender (2002).

Hindu-Arabic numbers

What we should do is parse numbers and pass them through using SMAF.

ChaSen settings

We set ChaSen to join numbers together:

(連結品詞 ((名詞 数))
          ((記号 アルファベット)))
(COMPOSIT_POS ((名詞 数))
          ((記号 アルファベット)))

Work Around

Until we have a proper preprocessor, we just add all numbers from 0–999, using the script utils/make-num.perl. We add fullwidth and halfwidth (marked -a for ascii) versions. We could add more as needed.

n_card-42 := card_2_no_valence &
 [ STEM < "42" >,
   SYNSEM.LKEYS.KEYREL.CARG "42" ].

n_card-42-a := card_2_no_valence &
 [ STEM < "42" >,
   SYNSEM.LKEYS.KEYREL.CARG "42" ].

References

Last update: 2011-10-09 by anonymous [edit]