Replies: 1 comment
-
|
先确认下是分词器慢导致还是kafka数据导入问题:1. 使用默认分词器看下导入速率变化 2. 参考实时功能使用文档,默认是从topic起始开始导入,如果已经存在很多比全量早的数据,会导入然后过滤掉,可以设置导入起始时间kafka_start_timestamp |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
0.3.0版本使用example包里边realtime的case,修改表字段并将分词替换成jieba分词器,其余配置保持不变;往kafka里发送了5000万条数据,半个小时count索引只导入了3万多条数据,机器配置是32核128G,查看机器负载几乎100%空闲,应该修改哪些配置才能加快导入速度呢
Beta Was this translation helpful? Give feedback.
All reactions