Class: Woods::Retriever

Inherits:

Object

Object
Woods::Retriever

show all

Defined in:: lib/woods/retriever.rb

Overview

Retriever orchestrates the full retrieval pipeline: classify, execute, rank, and assemble context from a natural language query.

Coordinates four internal components:

Woods::Retrieval::QueryClassifier — determines intent, scope, target type
Woods::Retrieval::SearchExecutor — maps classification to search strategy
Woods::Retrieval::Ranker — re-ranks candidates with weighted signals
Woods::Retrieval::ContextAssembler — builds token-budgeted context string

Optionally builds a structural context overview (codebase unit counts by type) that is prepended to the assembled context.

Examples:

retriever = Woods::Retriever.new(
  vector_store: vector_store,
  metadata_store: metadata_store,
  graph_store: graph_store,
  embedding_provider: embedding_provider
)
result = retriever.retrieve("How does the User model work?")
result.context        # => "Codebase: 42 units (10 models, ...)\n\n---\n\n## User (model)..."
result.strategy       # => :vector
result.tokens_used    # => 4200

Defined Under Namespace

Classes: RetrievalResult, RetrievalTrace

Constant Summary collapse

OLLAMA_EMBEDDING_MODELS = BERT / WordPiece-family embedders Ollama commonly serves. Matched against ‘provider.model_name` to decide whether to use the 1.5 chars/token ratio and wire in an exact Embedding::TokenCounter. Extend this list when new WordPiece-family models become popular —the tiktoken 4.0 default remains the safe fallback for unknowns.

Regexp.union(
  /\Anomic-embed/, /\Abge-/, /\Amxbai-embed/,
  /\Asnowflake-arctic/, /\Aall-minilm/, /\Aparaphrase-/,
  /\Ae5-/, /\Agte-/, /\Astella/,
  /\Agranite-embedding/, /\Ajina-embeddings/
).freeze

STRUCTURAL_TYPES = Unit types queried for the structural context overview.

%w[model controller service job mailer component graphql].freeze

DEFAULT_EXCLUDE_TYPES = Unit types excluded from retrieval by default. test_mapping units make up ~33% of a typical index and lexically dominate semantic rank for production queries (“stripe webhook” often surfaces stripe_webhook_spec.rb above the actual controller). Callers can override by passing types: (include-only) or an explicit exclude_types:.

%w[test_mapping].freeze

Instance Attribute Summary collapse

#graph_store ⇒ Object readonly

Direct handles to the injected stores.
#metadata_store ⇒ Object readonly

Direct handles to the injected stores.
#vector_store ⇒ Object readonly

Direct handles to the injected stores.

Instance Method Summary collapse

#initialize(vector_store:, metadata_store:, graph_store:, embedding_provider:, formatter: nil) ⇒ Retriever constructor

A new instance of Retriever.
#retrieve(query, budget: 8000, types: nil, exclude_types: nil) ⇒ RetrievalResult

Execute the full retrieval pipeline for a natural language query.

Constructor Details

#initialize(vector_store:, metadata_store:, graph_store:, embedding_provider:, formatter: nil) ⇒ `Retriever`

Returns a new instance of Retriever.

Parameters:

vector_store (Storage::VectorStore::Interface) —

Vector store adapter
metadata_store (Storage::MetadataStore::Interface) —

Metadata store adapter
graph_store (Storage::GraphStore::Interface) —

Graph store adapter
embedding_provider (Embedding::Provider::Interface) —

Embedding provider
formatter (#call, nil) (defaults to: nil) —

Optional callable to post-process the context string

# File 'lib/woods/retriever.rb', line 106

def initialize(vector_store:, metadata_store:, graph_store:, embedding_provider:, formatter: nil)
  @vector_store = vector_store
  @metadata_store = metadata_store
  @graph_store = graph_store
  @formatter = formatter

  @classifier = Retrieval::QueryClassifier.new
  @executor = Retrieval::SearchExecutor.new(
    vector_store: vector_store,
    metadata_store: metadata_store,
    graph_store: graph_store,
    embedding_provider: embedding_provider
  )
  @ranker = Retrieval::Ranker.new(metadata_store: metadata_store, graph_store: graph_store)
  # Match truncation sizing to the embedding provider's tokenizer so
  # Ollama-indexed corpora (ratio ~1.5) don't get over-truncated by
  # an OpenAI-sized default (4.0). Unknown/missing providers fall
  # back to the OpenAI-friendly default.
  chars_per_token = infer_chars_per_token(embedding_provider)
  @assembler = Retrieval::ContextAssembler.new(
    metadata_store: metadata_store,
    chars_per_token: chars_per_token,
    token_counter: infer_token_counter(embedding_provider)
  )
end

Instance Attribute Details

#graph_store ⇒ `Object` (readonly)

Direct handles to the injected stores. The sub-components (Woods::Retrieval::SearchExecutor, Woods::Retrieval::Ranker, Woods::Retrieval::ContextAssembler) hold their own references too, but those are implementation details — callers that want to mutate store contents (e.g. the MCP reload tool) read through these accessors. All three refer to the same Ruby objects the sub-components were initialised with, so in-place #clear! + #bulk_load propagates through the entire pipeline without re-instantiating sub-components.



99
100
101

# File 'lib/woods/retriever.rb', line 99

def graph_store
  @graph_store
end

#metadata_store ⇒ `Object` (readonly)



99
100
101

# File 'lib/woods/retriever.rb', line 99

def metadata_store
  @metadata_store
end

#vector_store ⇒ `Object` (readonly)



99
100
101

# File 'lib/woods/retriever.rb', line 99

def vector_store
  @vector_store
end

Instance Method Details

#retrieve(query, budget: 8000, types: nil, exclude_types: nil) ⇒ `RetrievalResult`

Execute the full retrieval pipeline for a natural language query.

Pipeline: classify -> execute -> rank -> filter -> (fallback within-type when filter emptied everything) -> assemble -> format.

When types: is set, the response carries type_rank_context —per-type rank metadata the caller uses to tell a strong match from a weak one without Woods imposing a score threshold.