分词器测试

    • alias:text类型,token的别名。
    • description:text类型,token的描述。
    • token:text类型,token的文本内容。
    • dictionaries:regdictionary数组类型,是分词器为token选定的词典。
    • dictionary:regdictionary类型,用来识别token的词典。如果为空,则不做识别。
    • lexemes:text数组类型,词典识别token时生成的词素。如果为空,则不生成词素。空数组({})意味着token将被识别成停用词。
    1. alias | description | token | dictionaries | dictionary | lexemes
    2. -----------+-----------------+-------+----------------+--------------+---------
    3. asciiword | Word, all ASCII | a | {english_stem} | english_stem | {}
    4. asciiword | Word, all ASCII | fat | {english_stem} | english_stem | {fat}
    5. blank | Space symbols | | {} | |
    6. asciiword | Word, all ASCII | cat | {english_stem} | english_stem | {cat}
    7. blank | Space symbols | | {} | |
    8. blank | Space symbols | | {} | |
    9. asciiword | Word, all ASCII | on | {english_stem} | english_stem | {}
    10. blank | Space symbols | | {} | |
    11. asciiword | Word, all ASCII | a | {english_stem} | english_stem | {}
    12. blank | Space symbols | | {} | |
    13. blank | Space symbols | | {} | |
    14. blank | Space symbols | - | {} | |
    15. blank | Space symbols | | {} | |
    16. asciiword | Word, all ASCII | ate | {english_stem} | english_stem | {ate}
    17. blank | Space symbols | | {} | |
    18. asciiword | Word, all ASCII | a | {english_stem} | english_stem | {}
    19. blank | Space symbols | | {} | |
    20. asciiword | Word, all ASCII | fat | {english_stem} | english_stem | {fat}
    21. blank | Space symbols | | {} | |