Is there any doc on how to setup kafka on confluen...
# all-things-deployment
s
Is there any doc on how to setup kafka on confluent cloud (https://confluent.cloud/) with datahub? I read https://github.com/linkedin/datahub/tree/master/datahub-kubernetes/datahub and it just mentions the URLs under
global.kafka.*
settings. There is no mention of any API key to authenticate with confluent cloud
found something at https://github.com/linkedin/datahub/issues/2504. Hopefully this works
It is not clear what to do with zookeeper with confluent cloud. Don't think datahub needs direct zookeeper access. I am going to try to run it using the values edited from https://github.com/linkedin/datahub/blob/master/datahub-kubernetes/datahub/values.yaml
Getting this error in
datahub-kafka-setup-job
when trying to do this setup on GCP using confluent kafka
Copy code
[kafka-admin-client-thread | adminclient-1] ERROR org.apache.kafka.common.utils.KafkaThread - Uncaught exception in thread 'kafka-admin-client-thread | adminclient-1':
java.lang.OutOfMemoryError: Java heap space at java.nio.HeapByteBuffer.<init>(HeapByteBuffer.java:57) at java.nio.ByteBuffer.allocate(ByteBuffer.java:335) at org.apache.kafka.common.memory.MemoryPool$1.tryAllocate(MemoryPool.java:30) at org.apache.kafka.common.network.NetworkReceive.readFrom(NetworkReceive.java:112) at org.apache.kafka.common.network.KafkaChannel.receive(KafkaChannel.java:424) at org.apache.kafka.common.network.KafkaChannel.read(KafkaChannel.java:385) at org.apache.kafka.common.network.Selector.attemptRead(Selector.java:651) at org.apache.kafka.common.network.Selector.pollSelectionKeys(Selector.java:572) at org.apache.kafka.common.network.Selector.poll(Selector.java:483) at org.apache.kafka.clients.NetworkClient.poll(NetworkClient.java:540) at org.apache.kafka.clients.admin.KafkaAdminClient$AdminClientRunnable.run(KafkaAdminClient.java:1196) at java.lang.Thread.run(Thread.java:748)
This does not make any sense as memory and CPU seem to be enough as per metrics
Any help on this would be great. Still seeing this with chart version
0.2.6
I have this in my
values.yml
Copy code
kafka:
    bootstrap:
      server: "KEY_ID:KEY_SECRET@CLUSTER_IDENTIFIER.asia-southeast1.gcp.confluent.cloud:9092"
    zookeeper:
      server: "prerequisites-zookeeper:2181"
    schemaregistry:
      url: "KEY_ID:KEY_SECRET@CLUSTER_IDENTIFIER.australia-southeast1.gcp.confluent.cloud"
As far as I could tell confluent cloud does not provide a zookeeper URL so not sure what to do with this
@mammoth-bear-12532 @early-lamp-41924 any suggestions for this?
I see that it is simply creating the 4 topics. I can do that manually in confluent cloud and disable this job
m
Thanks for letting us know @square-activity-64562 , there is probably a way to do this using the admin client that uses the broker connection instead of Zk. Will have to research that.
s
s
Hi @square-activity-64562 - Trying the same. Did you get it to work on Confluent Cloud without ZK?
s
yes was able to do that. Had to disable kafka setup job @straight-salesclerk-65619 and do some setting in helm config. @big-carpet-38439 is going to write a doc for that soon which should help you out
s
Thanks @square-activity-64562!
b
Yes! Hoping to get that out shortly
I'll send you what I have so far @straight-salesclerk-65619
s
Much appreciated @big-carpet-38439! Thanks!