Class: Karafka::Swarm::Supervisor

Inherits:

Object

Object
Karafka::Swarm::Supervisor

show all

Includes:: Core::Helpers::Time

Defined in:: lib/karafka/swarm/supervisor.rb

Overview

Note:

Technically speaking supervisor is never in the running state because we do not want to have any sockets or anything else on it that could break under forking. It has its own “supervising” state from which it can go to the final shutdown.

Supervisor that starts forks and uses monitor to monitor them. Also handles shutdown of all the processes including itself.

In case any node dies, it will be restarted.

Instance Method Summary collapse

#initialize ⇒ Supervisor constructor

A new instance of Supervisor.
#run ⇒ Object

Creates needed number of forks, installs signals and starts supervision.

Constructor Details

#initialize ⇒ `Supervisor`

Returns a new instance of Supervisor.

# File 'lib/karafka/swarm/supervisor.rb', line 37

def initialize
  @mutex = Mutex.new
  @queue = Processing::TimedQueue.new
end

Instance Method Details

#run ⇒ `Object`

Creates needed number of forks, installs signals and starts supervision

# File 'lib/karafka/swarm/supervisor.rb', line 43

def run
  # Validate the CLI provided options the same way as we do for the regular server
  cli_contract.validate!(
    activity_manager.to_h,
    scope: %w[swarm cli]
  )

  # Close producer just in case. While it should not be used, we do not want even a
  # theoretical case since librdkafka is not thread-safe.
  # We close it prior to forking just to make sure, there is no issue with initialized
  # producer (should not be initialized but just in case)
  Karafka.producer.close

  # Ensure rdkafka stuff is loaded into memory pre-fork. This will ensure, that we save
  # few MB on forking as this will be already in memory.
  Rdkafka::Bindings.rd_kafka_global_init

  Karafka::App.warmup

  manager.start

  process.on_sigint { stop }
  process.on_sigquit { stop }
  process.on_sigterm { stop }
  process.on_sigtstp { quiet }
  process.on_sigttin { signal('TTIN') }
  # Needed to be registered as we want to unlock on child changes
  process.on_sigchld {}
  process.on_any_active { unlock }
  process.supervise

  Karafka::App.supervise!

  loop do
    return if Karafka::App.terminated?

    lock
    control
  end

# If the cli contract validation failed reraise immediately and stop the process
rescue Karafka::Errors::InvalidConfigurationError => e
  raise e
# If anything went wrong during supervision, signal this and die
# Supervisor is meant to be thin and not cause any issues. If you encounter this case
# please report it as it should be considered critical
rescue StandardError => e
  monitor.instrument(
    'error.occurred',
    caller: self,
    error: e,
    manager: manager,
    type: 'swarm.supervisor.error'
  )

  manager.terminate
  manager.cleanup

  raise e
end