PatientHttp::SolidQueue

Built for APIs that like to think.

This gem provides a mechanism to offload HTTP requests to a dedicated async I/O processor running in your Solid Queue worker process using the patient_http gem. Worker threads are freed immediately while HTTP requests are in flight so that they can do other work instead of waiting for HTTP responses.

Motivation

Solid Queue is designed with the assumption that jobs are short-lived and complete quickly. Long-running HTTP requests block worker threads from processing other jobs, leading to increased latency and reduced throughput. This is particularly problematic when calling LLM or AI APIs, where requests can take many seconds to complete.

The Problem:

┌────────────────────────────────────────────────────────────────────────┐
│                    Traditional Solid Queue Job                         │
│                                                                        │
│  Worker Thread 1: [████████████ HTTP Request (5s) ████████████████]    │
│  Worker Thread 2: [████████████ HTTP Request (5s) ████████████████]    │
│  Worker Thread 3: [████████████ HTTP Request (5s) ████████████████]    │
│                                                                        │
│  → 3 workers blocked for 5 seconds = 0 jobs processed                  │
└────────────────────────────────────────────────────────────────────────┘

The Solution:

┌────────────────────────────────────────────────────────────────────────┐
│                     With Async HTTP Processor                          │
│                                                                        │
│  Worker Thread 1: [█ Enqueue █][█ Job █][█ Job █][█ Job █][█ Job █]    │
│  Worker Thread 2: [█ Enqueue █][█ Job █][█ Job █][█ Job █][█ Job █]    │
│  Worker Thread 3: [█ Enqueue █][█ Job █][█ Job █][█ Job █][█ Job █]    │
│                                                                        │
│  Async Processor: [═══════════ 100+ concurrent HTTP requests ════════] │
│                                                                        │
│  → Workers immediately free = dozens of jobs processed                 │
└────────────────────────────────────────────────────────────────────────┘

The async processor runs in a dedicated thread within your Solid Queue worker process, using Ruby's Fiber-based concurrency to handle hundreds of concurrent HTTP requests without blocking. When an HTTP request completes, a callback service is invoked for processing.

Quick Start

1. Create a Callback Service

Define a callback service class with on_complete and on_error methods:

class FetchDataCallback
  def on_complete(response)
    user_id = response.callback_args[:user_id]
    if response.success?
      data = response.json
      User.find(user_id).update!(external_data: data)
    else
      Rails.logger.error("HTTP #{response.status} fetching data for user #{user_id}")
    end
  end

  def on_error(error)
    user_id = error.callback_args[:user_id]
    Rails.logger.error("Failed to fetch data for user #{user_id}: #{error.message}")
  end
end

2. Make HTTP Requests

Make HTTP requests from anywhere in your code using PatientHttp:

PatientHttp.get(
  "https://api.example.com/users/#{user_id}",
  headers: {"Authorization" => "Bearer #{ENV['API_KEY']}"},
  callback: FetchDataCallback,
  callback_args: {user_id: user_id}
)

3. That's It!

The request will be enqueued as an Active Job and passed to a PatientHttp processor to execute asynchronously. When the HTTP request completes, your callback's on_complete method is executed in another Active Job.

If an error occurs during the request, the on_error method is called instead.

You can also call PatientHttp.post, PatientHttp.put, PatientHttp.patch, and PatientHttp.delete for other HTTP methods. See the patient_http docs for the full API reference.

The response.callback_args and error.callback_args provide access to the arguments you passed via the callback_args option.

[!IMPORTANT] Do not re-raise errors in the on_error callback as a means to retry the request. That will just retry the error callback job. If you want to retry the original request, you can enqueue a new request from within on_error. Be careful with this approach, though, as it can lead to infinite retry loops if the error condition is not resolved.

Also note that the error callback is only called when an exception occurs during the HTTP request (timeout, connection failure, etc). HTTP error status codes (4xx, 5xx) do not trigger the error callback by default. Instead, they are treated as completed requests and passed to the on_complete callback. See the "Handling HTTP Error Responses" section below for how to treat HTTP errors as exceptions.

Handling HTTP Error Responses

By default, HTTP error status codes (4xx, 5xx) are treated as successful responses and passed to the on_complete callback. You can check the status using response.success?, response.client_error?, or response.server_error?:

class ApiCallback
  def on_complete(response)
    if response.success?
      process_data(response.json)
    elsif response.client_error?
      handle_client_error(response.status, response.body)
    elsif response.server_error?
      handle_server_error(response.status, response.body)
    end
  end

  def on_error(error)
    Rails.logger.error("Request failed: #{error.message}")
  end
end

PatientHttp.get(
  "https://api.example.com/data/#{id}",
  callback: ApiCallback
)

If you prefer to treat HTTP errors as exceptions, you can use the raise_error_responses option. When enabled, non-2xx responses will call the on_error callback with an HttpError instead:

class ApiCallback
  def on_complete(response)
    # Only called for 2xx responses
    process_data(response.json)
  end

  def on_error(error)
    # Called for exceptions AND HTTP errors when using raise_error_responses
    if error.is_a?(PatientHttp::HttpError)
      # Access the response via error.response
      Rails.logger.error("HTTP #{error.status} from #{error.url}: #{error.response.body}")
    else
      # Regular request errors (timeout, connection, etc)
      Rails.logger.error("Request failed: #{error.message}")
    end
  end
end

PatientHttp.get(
  "https://api.example.com/data/#{id}",
  callback: ApiCallback,
  raise_error_responses: true
)

The HttpError provides convenient access to the response:

def on_error(error)
  if error.is_a?(PatientHttp::HttpError)
    puts error.status              # HTTP status code
    puts error.url                 # Request URL
    puts error.http_method         # HTTP method
    puts error.response.body       # Response body
    puts error.response.headers    # Response headers
    puts error.response.json       # Parse JSON response (if applicable)
  end
end

Usage Patterns

Making Requests

The primary interface for making requests is through the PatientHttp module, which provides convenience methods for all HTTP verbs:

# GET request
PatientHttp.get("https://api.example.com/users/123",
  callback: MyCallback, callback_args: {user_id: 123})

# POST request with JSON body
PatientHttp.post("https://api.example.com/users",
  json: {name: "John", email: "john@example.com"},
  callback: MyCallback)

# PUT request
PatientHttp.put("https://api.example.com/users/123",
  json: {name: "Updated Name"},
  callback: MyCallback)

# PATCH request
PatientHttp.patch("https://api.example.com/users/123",
  json: {status: "active"},
  callback: MyCallback)

# DELETE request
PatientHttp.delete("https://api.example.com/users/123",
  callback: MyCallback)

Available request options:

callback: - (required) Callback service class or class name
callback_args: - Hash of arguments passed to callback via response/error
headers: - Request headers
body: - Request body (for POST/PUT/PATCH)
json: - Object to serialize as JSON body (cannot use with body)
params: - Query parameters to append to URL
timeout: - Request timeout in seconds
raise_error_responses: - Treat non-2xx responses as errors

You can also build a PatientHttp::Request object and pass it to PatientHttp.execute for more control:

request = PatientHttp::Request.new(:get, "https://api.example.com/users/123",
  headers: {"Authorization" => "Bearer token"},
  params: {include: "profile"},
  timeout: 30
)
PatientHttp.execute(request: request, callback: MyCallback, callback_args: {user_id: 123})

See the patient_http docs for the full Request and Response API reference.

Using Request Templates

For repeated requests to the same API, use PatientHttp::RequestTemplate to share configuration:

class ApiService
  def initialize
    @template = PatientHttp::RequestTemplate.new(
      base_url: "https://api.example.com",
      headers: {"Authorization" => "Bearer #{ENV['API_KEY']}"},
      timeout: 60
    )
  end

  def fetch_user(user_id)
    request = @template.get("/users/#{user_id}")
    PatientHttp.execute(
      request: request,
      callback: FetchUserCallback,
      callback_args: {user_id: user_id}
    )
  end

  def update_user(user_id, attributes)
    request = @template.patch("/users/#{user_id}", json: attributes)
    PatientHttp.execute(
      request: request,
      callback: UpdateUserCallback,
      callback_args: {user_id: user_id}
    )
  end
end

Using the RequestHelper Module

For classes that make many async HTTP requests, you can include PatientHttp::RequestHelper to get convenient instance methods like async_get, async_post, async_put, async_patch, and async_delete. You can also define a request template at the class level using the request_template class method to set shared options like base_url, headers, and timeout.

When using this gem, the request handler is automatically registered when the processor starts and unregistered when it stops — no manual setup is required.

class NotificationService
  include PatientHttp::RequestHelper

  request_template base_url: "https://api.example.com",
                   headers: {"Authorization" => "Bearer #{ENV['API_KEY']}"},
                   timeout: 30

  def notify_user(user_id, message)
    async_post("/notifications",
      json: {user_id: user_id, message: message},
      callback: NotificationCallback,
      callback_args: {user_id: user_id}
    )
  end

  def fetch_user(user_id)
    async_get("/users/#{user_id}",
      callback: FetchUserCallback,
      callback_args: {user_id: user_id}
    )
  end
end

The async_* methods accept the same options as PatientHttp.get, PatientHttp.post, etc. Paths are resolved relative to the base_url defined in the request template.

See the patient_http gem for the full RequestHelper documentation.

Callback Arguments

Pass custom data to your callbacks using the callback_args option:

class FetchDataCallback
  def on_complete(response)
    # Access callback_args using symbol or string keys
    user_id = response.callback_args[:user_id]
    request_timestamp = response.callback_args[:request_timestamp]

    User.find(user_id).update!(
      external_data: response.json,
      fetched_at: request_timestamp
    )
  end

  def on_error(error)
    user_id = error.callback_args[:user_id]
    request_timestamp = error.callback_args[:request_timestamp]

    Rails.logger.error(
      "Failed to fetch data for user #{user_id} at #{request_timestamp}: #{error.message}"
    )
  end
end

# Pass data via callback_args option
PatientHttp.get(
  "https://api.example.com/users/#{user_id}",
  callback: FetchDataCallback,
  callback_args: {
    user_id: user_id,
    request_timestamp: Time.now.iso8601
  }
)

Important details about callback_args:

Must be a Hash (or respond to to_h) containing only JSON-native types: nil, true, false, String, Integer, Float, Array, or Hash
Hash keys will be converted to strings for serialization
Nested hashes and hashes in arrays also have their keys converted to strings
You can access callback_args using either symbol or string keys: callback_args[:user_id] or callback_args["user_id"]

Sensitive Data Handling

Requests and responses from asynchronous HTTP requests may be stored in your queue backend (and optionally external storage) in order to execute completion callbacks. This can raise security concerns if they contain sensitive data since the data will be stored in plain text.

Encryption is configured on the parent patient_http gem. You can set an encryption_key to automatically encrypt and decrypt request and response data using ActiveSupport::MessageEncryptor:

PatientHttp::SolidQueue.configure do |config|
  config.encryption_key = Rails.application.credentials.patient_http_secret
end

See the patient_http gem for full documentation on encryption options, including key rotation and custom encryption callables.

Configuration

The gem can be configured globally in an initializer:

PatientHttp::SolidQueue.configure do |config|
  # Maximum concurrent HTTP requests (default: 256)
  config.max_connections = 256

  # Default timeout for HTTP requests in seconds (default: 60)
  config.request_timeout = 60

  # Maximum number of host clients to pool (default: 100)
  config.connection_pool_size = 100

  # Connection timeout in seconds (default: nil, uses request_timeout)
  config.connection_timeout = 10

  # Number of retries for failed requests (default: 3)
  config.retries = 3

  # Handler called when a callback job exhausts all Sidekiq retries
  config.on_retries_exhausted { |error| MyAlertService.notify(error) }

  # HTTP/HTTPS proxy URL (default: nil)
  # Supports authentication: "http://user:pass@proxy.example.com:8080"
  config.proxy_url = "http://proxy.example.com:8080"

  # Default User-Agent header for all requests (default: "PatientHttp")
  config.user_agent = "MyApp/1.0"

  # Timeout for graceful shutdown in seconds
  # (default: SolidQueue.shutdown_timeout - 2)
  # This should be less than your worker shutdown timeout
  config.shutdown_timeout = 23

  # Maximum response body size in bytes (default: 1MB)
  # Responses larger than this will trigger ResponseTooLargeError
  config.max_response_size = 1024 * 1024

  # Maximum number of redirects to follow (default: 5, 0 disables)
  config.max_redirects = 5

  # Whether to raise HttpError for non-2xx responses by default (default: false)
  config.raise_error_responses = false

  # Heartbeat interval for crash recovery in seconds (default: 60)
  config.heartbeat_interval = 60

  # Orphan detection threshold in seconds (default: 300)
  # Requests older than this without a heartbeat will be re-enqueued
  config.orphan_threshold = 300

  # Size threshold in bytes for external payload storage (default: 64KB)
  # Payloads larger than this will be stored externally when a payload
  # store is configured.
  config.payload_store_threshold = 64 * 1024

  # Queue name for RequestJob and CallbackJob (default: nil, Active Job default)
  config.queue_name = "async_http"

  # Custom logger (defaults to SolidQueue.logger)
  config.logger = Rails.logger

  # Encryption key for sensitive data (see Sensitive Data Handling)
  # Accepts a string or an array of strings for key rotation.
  # Encryption is provided by the parent patient_http gem.
  config.encryption_key = Rails.application.credentials.patient_http_secret
end

See the Configuration class for all available options.

Tuning Tips

max_connections: Adjust this based on your system's resources. Each connection uses memory and file descriptors. A tuned system with sufficient resources can handle thousands of concurrent connections.
request_timeout: Set this based on the expected response times of the APIs you are calling. AI APIs might sometimes take minutes to respond as they generate content.
connection_pool_size: Controls how many connections to different hosts are kept alive. Increase for applications calling many different API endpoints.
connection_timeout: Set this if you need to fail fast on connection establishment. Useful for detecting network issues quickly.
retries: Number of times to retry a failed request before calling the error callback.
max_response_size: Set this to limit the maximum size of HTTP responses. This helps prevent excessive memory usage from unexpectedly large responses. Responses need to be serialized as Active Job arguments and very large responses may cause performance issues. If a response body is text content, it will be compressed to save space. However, binary content needs to be Base64 encoded which increases size by ~33%.
payload_store_threshold: Lower this if your queue backend struggles with large payloads; higher values avoid extra external storage reads/writes.
heartbeat_interval and orphan_threshold: For high-churn workloads, keep heartbeat_interval as large as your recovery SLO allows (while still less than orphan_threshold) to reduce write/update pressure on monitoring tables. If Solid Queue uses PostgreSQL and request volume is high, tune autovacuum for the queue database tables because inflight_requests is intentionally insert/update/delete heavy.

[!IMPORTANT]

One difference between using this gem and making synchronous HTTP requests from a Solid Queue job is that if max_connections is reached due to slow asynchronous requests, new requests will trigger an error on the Active Job. The Active Job retry mechanism will handle re-enqueuing the job.

In contrast, slow synchronous HTTP requests will fill up the worker pool and block new jobs from being dequeued until a worker thread becomes free.

In general, the former behavior is preferable because it allows Solid Queue to continue processing other jobs and prevents getting into a state with 1000's of jobs stuck in the queue.

Metrics and Monitoring

Callbacks for Custom Monitoring

You can register callbacks to integrate with your monitoring system using the after_completion and after_error hooks:

PatientHttp::SolidQueue.after_completion do |response|
  StatsD.timing("patient_http.duration", response.duration * 1000)
  StatsD.increment("patient_http.status.#{response.status}")
end

PatientHttp::SolidQueue.after_error do |error|
  error_type = error.is_a?(PatientHttp::Error) ? error.error_type : "exception"
  StatsD.increment("patient_http.error.#{error_type}")
  Rails.logger.error("Async HTTP error: #{error.class.name} - #{error.message}")
end

You can register multiple callbacks; they will be called in the order registered.

Shutdown Behavior

The async HTTP processor automatically hooks in with Solid Queue's lifecycle events.

Startup: Processor starts automatically when Solid Queue starts a worker
Shutdown: Processor waits up to shutdown_timeout seconds for in-flight requests to complete

Incomplete Request Handling

If requests are still in-flight when shutdown times out:

In-flight requests are interrupted
The original Active Job is automatically re-enqueued
Re-enqueued jobs will be processed again when workers are available

This ensures no work is lost during deployments or restarts.

Crash Recovery

The gem includes crash recovery to handle process failures:

Heartbeat Tracking: Every heartbeat_interval seconds, the processor updates heartbeat timestamps for all in-flight requests in the database
Orphan Detection: One processor periodically checks for requests that haven't received a heartbeat update in orphan_threshold seconds
Automatic Re-enqueue: Orphaned requests have their original Active Jobs re-enqueued

This ensures that if a worker process crashes, its in-flight requests will be retried by another process.

Testing

The gem supports testing with Active Job test adapters. When in test mode (PatientHttp.testing?), async HTTP requests are executed immediately within the worker thread, blocking until completion. This allows you to write tests that verify the full request/response cycle without needing the async processor to be running.

Installation

Add this line to your application's Gemfile:

gem "patient_http-solid_queue"

Then execute:

bundle install

Install and run the gem migrations:

bin/rails patient_http_solid_queue:install:migrations
bin/rails db:migrate

The database tables are used for crash recovery and monitoring of in-flight requests and need to be added to the same database that Solid Queue uses.

By default, this install task copies migrations to the queue database migration path (typically db/queue_migrate). If your Solid Queue database name is different, override it with DATABASE=your_database_name.

For a typical multi-database setup, ensure your queue database config defines its own migration path:

development:
  primary:
    adapter: sqlite3
    database: storage/development.sqlite3
  queue:
    adapter: sqlite3
    database: storage/development_queue.sqlite3
    migrations_paths:
      - db/queue_migrate

PostgreSQL example:

development:
  primary:
    adapter: postgresql
    database: my_app_development
  queue:
    adapter: postgresql
    database: my_app_queue_development
    migrations_paths:
      - db/queue_migrate

Then run:

bin/rails patient_http_solid_queue:install:migrations
bin/rails db:queue:migrate

If your Solid Queue database is not named queue, pass its name explicitly when installing migrations:

bin/rails patient_http_solid_queue:install:migrations DATABASE=solid_queue
bin/rails db:migrate:solid_queue

Contributing

Open a pull request on GitHub.

Please use the standardrb syntax and lint your code with standardrb --fix before submitting.

Run the test suite with:

bundle exec rake

There is also a bundled test app in the test_app directory that can be used for manual testing and experimentation.

To run the test app, first install the dependencies:

bundle exec rake test_app:bundle

The server will run on http://localhost:9292 and can be started with:

bundle exec rake test_app

License

The gem is available as open source under the terms of the MIT License.