Replies: 1 comment
-
先确认下是分词器慢导致还是kafka数据导入问题:1. 使用默认分词器看下导入速率变化 2. 参考实时功能使用文档,默认是从topic起始开始导入,如果已经存在很多比全量早的数据,会导入然后过滤掉,可以设置导入起始时间kafka_start_timestamp |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
0.3.0版本使用example包里边realtime的case,修改表字段并将分词替换成jieba分词器,其余配置保持不变;往kafka里发送了5000万条数据,半个小时count索引只导入了3万多条数据,机器配置是32核128G,查看机器负载几乎100%空闲,应该修改哪些配置才能加快导入速度呢
Beta Was this translation helpful? Give feedback.
All reactions