Class: Ignis::AI::BatchProcessor

Inherits:

Object

Object
Ignis::AI::BatchProcessor

show all

Defined in:: lib/nnw/ai/inference.rb

Overview

BatchProcessor — concurrent inference with dynamic batching.

Instance Method Summary collapse

#initialize(model, tokenizer, max_batch_size: 8, max_wait_ms: 50) ⇒ BatchProcessor constructor

A new instance of BatchProcessor.
#start! ⇒ Thread

Start the batch processing loop in a background thread.
#stop! ⇒ void

Stop the processor.
#submit(prompt, **params) ⇒ String

Submit a request for processing.

Constructor Details

#initialize(model, tokenizer, max_batch_size: 8, max_wait_ms: 50) ⇒ `BatchProcessor`

Returns a new instance of BatchProcessor.

Parameters:

model (Transformer::Model)
tokenizer (Tokenizer)
max_batch_size (Integer) (defaults to: 8)
max_wait_ms (Integer) (defaults to: 50) —

max milliseconds to wait for batch fill

# File 'lib/nnw/ai/inference.rb', line 156

def initialize(model, tokenizer, max_batch_size: 8, max_wait_ms: 50)
  @generator = TextGenerator.new(model, tokenizer)
  @max_batch_size = max_batch_size
  @max_wait_ms = max_wait_ms
  @queue = Queue.new
  @running = false
end

Instance Method Details

#start! ⇒ `Thread`

Start the batch processing loop in a background thread.

Returns:

(Thread)

# File 'lib/nnw/ai/inference.rb', line 176

def start!
  @running = true
  @thread = Thread.new { batch_loop }
  @thread
end

#stop! ⇒ `void`

This method returns an undefined value.

Stop the processor.

# File 'lib/nnw/ai/inference.rb', line 184

def stop!
  @running = false
  @thread&.join(5)
end

#submit(prompt, **params) ⇒ `String`

Submit a request for processing.