Class: Karafka::Instrumentation::LoggerListener
- Inherits:
-
Object
- Object
- Karafka::Instrumentation::LoggerListener
- Defined in:
- lib/karafka/instrumentation/logger_listener.rb
Overview
Default listener that hooks up to our instrumentation and uses its events for logging It can be removed/replaced or anything without any harm to the Karafka app flow.
Instance Method Summary collapse
-
#initialize(log_polling: true) ⇒ LoggerListener
constructor
A new instance of LoggerListener.
- #on_app_quiet(_event) ⇒ Object
- #on_app_quieting(_event) ⇒ Object
-
#on_app_running(_event) ⇒ Object
Logs info that we’re running Karafka app.
-
#on_app_stopped(_event) ⇒ Object
Logs info that we stopped the Karafka server.
-
#on_app_stopping(_event) ⇒ Object
Logs info that we’re going to stop the Karafka server.
-
#on_client_pause(event) ⇒ Object
Prints info about a consumer pause occurrence.
-
#on_client_resume(event) ⇒ Object
Prints information about resuming of processing of a given topic partition.
-
#on_connection_listener_fetch_loop(event) ⇒ Object
Logs each messages fetching attempt.
-
#on_connection_listener_fetch_loop_received(event) ⇒ Object
Logs about messages that we’ve received from Kafka.
-
#on_consumer_consuming_retry(event) ⇒ Object
Prints info about retry of processing after an error.
-
#on_dead_letter_queue_dispatched(event) ⇒ Object
Logs info when we have dispatched a message the the DLQ.
-
#on_error_occurred(event) ⇒ Object
There are many types of errors that can occur in many places, but we provide a single handler for all of them to simplify error instrumentation.
- #on_filtering_seek(event) ⇒ Object
-
#on_filtering_throttled(event) ⇒ Object
Logs info about throttling event.
-
#on_process_notice_signal(event) ⇒ Object
Logs info about system signals that Karafka received and prints backtrace for threads in case of ttin.
- #on_swarm_manager_before_fork(event) ⇒ Object
- #on_swarm_manager_control(event) ⇒ Object
- #on_swarm_manager_stopping(event) ⇒ Object
- #on_swarm_manager_terminating(event) ⇒ Object
- #on_swarm_node_after_fork(_event) ⇒ Object
-
#on_worker_process(event) ⇒ Object
Prints info about the fact that a given job has started.
-
#on_worker_processed(event) ⇒ Object
Prints info about the fact that a given job has finished.
Constructor Details
#initialize(log_polling: true) ⇒ LoggerListener
Returns a new instance of LoggerListener.
23 24 25 |
# File 'lib/karafka/instrumentation/logger_listener.rb', line 23 def initialize(log_polling: true) @log_polling = log_polling end |
Instance Method Details
#on_app_quiet(_event) ⇒ Object
172 173 174 |
# File 'lib/karafka/instrumentation/logger_listener.rb', line 172 def on_app_quiet(_event) info 'Reached quiet mode. No messages will be processed anymore' end |
#on_app_quieting(_event) ⇒ Object
167 168 169 |
# File 'lib/karafka/instrumentation/logger_listener.rb', line 167 def on_app_quieting(_event) info 'Switching to quiet mode. New messages will not be processed' end |
#on_app_running(_event) ⇒ Object
Logs info that we’re running Karafka app.
157 158 159 160 161 162 163 164 |
# File 'lib/karafka/instrumentation/logger_listener.rb', line 157 def on_app_running(_event) info "Running in #{RUBY_DESCRIPTION}" info "Running Karafka #{Karafka::VERSION} server" return if Karafka.pro? info 'See LICENSE and the LGPL-3.0 for licensing details' end |
#on_app_stopped(_event) ⇒ Object
Logs info that we stopped the Karafka server.
186 187 188 |
# File 'lib/karafka/instrumentation/logger_listener.rb', line 186 def on_app_stopped(_event) info 'Stopped Karafka server' end |
#on_app_stopping(_event) ⇒ Object
Logs info that we’re going to stop the Karafka server.
179 180 181 |
# File 'lib/karafka/instrumentation/logger_listener.rb', line 179 def on_app_stopping(_event) info 'Stopping Karafka server' end |
#on_client_pause(event) ⇒ Object
There may be no offset provided in case user wants to pause on the consecutive offset position. This can be beneficial when not wanting to purge the buffers.
Prints info about a consumer pause occurrence. Irrelevant if user or system initiated.
88 89 90 91 92 93 94 95 96 97 98 99 |
# File 'lib/karafka/instrumentation/logger_listener.rb', line 88 def on_client_pause(event) topic = event[:topic] partition = event[:partition] offset = event[:offset] client = event[:caller] info <<~MSG.tr("\n", ' ').strip! [#{client.id}] Pausing on topic #{topic}/#{partition} on #{offset ? "offset #{offset}" : 'the consecutive offset'} MSG end |
#on_client_resume(event) ⇒ Object
Prints information about resuming of processing of a given topic partition
104 105 106 107 108 109 110 111 112 |
# File 'lib/karafka/instrumentation/logger_listener.rb', line 104 def on_client_resume(event) topic = event[:topic] partition = event[:partition] client = event[:caller] info <<~MSG.tr("\n", ' ').strip! [#{client.id}] Resuming on topic #{topic}/#{partition} MSG end |
#on_connection_listener_fetch_loop(event) ⇒ Object
Logs each messages fetching attempt
30 31 32 33 34 35 |
# File 'lib/karafka/instrumentation/logger_listener.rb', line 30 def on_connection_listener_fetch_loop(event) return unless log_polling? listener = event[:caller] debug "[#{listener.id}] Polling messages..." end |
#on_connection_listener_fetch_loop_received(event) ⇒ Object
Logs about messages that we’ve received from Kafka
40 41 42 43 44 45 46 47 48 49 50 51 52 53 |
# File 'lib/karafka/instrumentation/logger_listener.rb', line 40 def on_connection_listener_fetch_loop_received(event) return unless log_polling? listener = event[:caller] time = event[:time].round(2) = event[:messages_buffer].size = "[#{listener.id}] Polled #{} messages in #{time}ms" # We don't want the "polled 0" in dev as it would spam the log # Instead we publish only info when there was anything we could poll and fail over to the # zero notifications when in debug mode .zero? ? debug() : info() end |
#on_consumer_consuming_retry(event) ⇒ Object
Prints info about retry of processing after an error
117 118 119 120 121 122 123 124 125 126 127 128 |
# File 'lib/karafka/instrumentation/logger_listener.rb', line 117 def on_consumer_consuming_retry(event) topic = event[:topic] partition = event[:partition] offset = event[:offset] consumer = event[:caller] timeout = event[:timeout] info <<~MSG.tr("\n", ' ').strip! [#{consumer.id}] Retrying of #{consumer.class} after #{timeout} ms on topic #{topic}/#{partition} from offset #{offset} MSG end |
#on_dead_letter_queue_dispatched(event) ⇒ Object
Logs info when we have dispatched a message the the DLQ
193 194 195 196 197 198 199 200 201 202 203 204 205 206 |
# File 'lib/karafka/instrumentation/logger_listener.rb', line 193 def on_dead_letter_queue_dispatched(event) consumer = event[:caller] topic = consumer.topic.name = event[:message] offset = .offset dlq_topic = consumer.topic.dead_letter_queue.topic partition = .partition info <<~MSG.tr("\n", ' ').strip! [#{consumer.id}] Dispatched message #{offset} from #{topic}/#{partition} to DLQ topic: #{dlq_topic} MSG end |
#on_error_occurred(event) ⇒ Object
There are many types of errors that can occur in many places, but we provide a single handler for all of them to simplify error instrumentation.
272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 |
# File 'lib/karafka/instrumentation/logger_listener.rb', line 272 def on_error_occurred(event) type = event[:type] error = event[:error] details = (error.backtrace || []).join("\n") case type when 'consumer.consume.error' error "Consumer consuming error: #{error}" error details when 'consumer.revoked.error' error "Consumer on revoked failed due to an error: #{error}" error details when 'consumer.idle.error' error "Consumer idle failed due to an error: #{error}" error details when 'consumer.shutdown.error' error "Consumer on shutdown failed due to an error: #{error}" error details when 'consumer.tick.error' error "Consumer on tick failed due to an error: #{error}" error details when 'consumer.after_consume.error' error "Consumer on after_consume failed due to an error: #{error}" error details when 'worker.process.error' fatal "Worker processing failed due to an error: #{error}" fatal details when 'connection.listener.fetch_loop.error' error "Listener fetch loop error: #{error}" error details when 'swarm.supervisor.error' fatal "Swarm supervisor crashed due to an error: #{error}" fatal details when 'runner.call.error' fatal "Runner crashed due to an error: #{error}" fatal details when 'app.stopping.error' # Counts number of workers and listeners that were still active when forcing the # shutdown. Please note, that unless all listeners are closed, workers will not finalize # their operations as well. # We need to check if listeners and workers are assigned as during super early stages of # boot they are not. listeners = Server.listeners ? Server.listeners.count(&:active?) : 0 workers = Server.workers ? Server.workers.count(&:alive?) : 0 = <<~MSG.tr("\n", ' ').strip! Forceful Karafka server stop with: #{workers} active workers and #{listeners} active listeners MSG error when 'app.forceful_stopping.error' error "Forceful shutdown error occurred: #{error}" error details when 'librdkafka.error' error "librdkafka internal error occurred: #{error}" error details # Those can occur when emitted statistics are consumed by the end user and the processing # of statistics fails. The statistics are emitted from librdkafka main loop thread and # any errors there crash the whole thread when 'callbacks.statistics.error' error "callbacks.statistics processing failed due to an error: #{error}" error details when 'callbacks.error.error' error "callbacks.error processing failed due to an error: #{error}" error details # Those will only occur when retries in the client fail and when they did not stop after # back-offs when 'connection.client.poll.error' error "Data polling error occurred: #{error}" error details when 'connection.client.rebalance_callback.error' error "Rebalance callback error occurred: #{error}" error details when 'connection.client.unsubscribe.error' error "Client unsubscribe error occurred: #{error}" error details # This handles any custom errors coming from places like Web-UI, etc else error "#{type} error occurred: #{error}" error details end end |
#on_filtering_seek(event) ⇒ Object
227 228 229 230 231 232 233 234 235 236 237 238 239 |
# File 'lib/karafka/instrumentation/logger_listener.rb', line 227 def on_filtering_seek(event) consumer = event[:caller] topic = consumer.topic.name # Message to which we seek = event[:message] partition = .partition offset = .offset info <<~MSG.tr("\n", ' ').strip! [#{consumer.id}] Post-filtering seeking to message #{offset} on #{topic}/#{partition} MSG end |
#on_filtering_throttled(event) ⇒ Object
Logs info about throttling event
211 212 213 214 215 216 217 218 219 220 221 222 223 224 |
# File 'lib/karafka/instrumentation/logger_listener.rb', line 211 def on_filtering_throttled(event) consumer = event[:caller] topic = consumer.topic.name # Here we get last message before throttle = event[:message] partition = .partition offset = .offset info <<~MSG.tr("\n", ' ').strip! [#{consumer.id}] Throttled and will resume from message #{offset} on #{topic}/#{partition} MSG end |
#on_process_notice_signal(event) ⇒ Object
Logs info about system signals that Karafka received and prints backtrace for threads in case of ttin
134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 |
# File 'lib/karafka/instrumentation/logger_listener.rb', line 134 def on_process_notice_signal(event) info "Received #{event[:signal]} system signal" # We print backtrace only for ttin return unless event[:signal] == :SIGTTIN # Inspired by Sidekiq Thread.list.each do |thread| tid = (thread.object_id ^ ::Process.pid).to_s(36) warn "Thread TID-#{tid} #{thread.name}" if thread.backtrace warn thread.backtrace.join("\n") else warn '<no backtrace available>' end end end |
#on_swarm_manager_before_fork(event) ⇒ Object
254 255 256 |
# File 'lib/karafka/instrumentation/logger_listener.rb', line 254 def on_swarm_manager_before_fork(event) debug "Swarm manager starting node with id: #{event[:node].id}" end |
#on_swarm_manager_control(event) ⇒ Object
264 265 266 267 |
# File 'lib/karafka/instrumentation/logger_listener.rb', line 264 def on_swarm_manager_control(event) pids = event[:caller].nodes.map(&:pid).join(', ') debug "Swarm manager checking nodes: #{pids}" end |
#on_swarm_manager_stopping(event) ⇒ Object
242 243 244 245 |
# File 'lib/karafka/instrumentation/logger_listener.rb', line 242 def on_swarm_manager_stopping(event) node = event[:node] error "Swarm manager detected unhealthy node #{node.pid}. Sending TERM signal..." end |
#on_swarm_manager_terminating(event) ⇒ Object
248 249 250 251 |
# File 'lib/karafka/instrumentation/logger_listener.rb', line 248 def on_swarm_manager_terminating(event) node = event[:node] error "Swarm manager detected unresponsive node #{node.pid}. Sending KILL signal..." end |
#on_swarm_node_after_fork(_event) ⇒ Object
259 260 261 |
# File 'lib/karafka/instrumentation/logger_listener.rb', line 259 def on_swarm_node_after_fork(_event) info "Swarm node #{::Process.pid} forked from #{::Process.ppid}" end |
#on_worker_process(event) ⇒ Object
Prints info about the fact that a given job has started
58 59 60 61 62 63 64 65 |
# File 'lib/karafka/instrumentation/logger_listener.rb', line 58 def on_worker_process(event) job = event[:job] job_type = job.class.to_s.split('::').last consumer = job.executor.topic.consumer topic = job.executor.topic.name partition = job.executor.partition info "[#{job.id}] #{job_type} job for #{consumer} on #{topic}/#{partition} started" end |
#on_worker_processed(event) ⇒ Object
Prints info about the fact that a given job has finished
70 71 72 73 74 75 76 77 78 79 80 81 |
# File 'lib/karafka/instrumentation/logger_listener.rb', line 70 def on_worker_processed(event) job = event[:job] time = event[:time].round(2) job_type = job.class.to_s.split('::').last consumer = job.executor.topic.consumer topic = job.executor.topic.name partition = job.executor.partition info <<~MSG.tr("\n", ' ').strip! [#{job.id}] #{job_type} job for #{consumer} on #{topic}/#{partition} finished in #{time} ms MSG end |