Hi,
My temporal server is deployed on GKE cluster. We are using Cassandra for it’s db layer which is also deployed on another GKE cluster. We are continously observing following errors in temporal history service logs but our workflows are being executed successfully:
"cluster-name":"active", "component":"timer-queue-processor", "error":"Failed to update shard. previous_range_id: 9, columns: (range_id=11)", "level":"error", "logging-call-at":"timerQueueAckMgr.go:402", "msg":"Error updating timer ack level for shard", "service":"history", "shard-id":555,
"component":"visibility-queue-processor", "error":"Failed to update shard. previous_range_id: 14, columns: (range_id=16)", "level":"error", "logging-call-at":"queueAckMgr.go:225", "msg":"Error updating ack level for shard", "operation-result":"OperationFailed", "service":"history", "shard-id":441
"cluster-name":"active", "component":"transfer-queue-processor", "error":"Failed to update shard. previous_range_id: 11, columns: (range_id=15)", "level":"error", "logging-call-at":"queueAckMgr.go:225", "msg":"Error updating ack level for shard", "operation-result":"OperationFailed", "service":"history", "shard-id":378,
We have kept our numHistoryShards value to 4096. We tried to install temporal with fresh schema, but still seeing these errors in our logs.
We also observed one error in matching service:
"component":"matching-engine", "error":"Failed to update task queue. name: /_sys/DebitCardIntegrationActivityTaskQueue/1, type: Activity, rangeID: 4, columns: (range_id=5)", "level":"error", "logging-call-at":"taskReader.go:187", "msg":"Persistent store operation failure", "service":"matching"
What could be the issue? Please help.