Class: Phronomy::VectorStore::Pgvector

Inherits:
Base
  • Object
show all
Defined in:
lib/phronomy/vector_store/pgvector.rb

Overview

PostgreSQL-backed vector store using the pgvector extension.

Requires:

  • The +pgvector+ gem (add to your Gemfile)
  • An ActiveRecord model class with the following columns: id (string / uuid) embedding (vector — from the pgvector column type) metadata (text or jsonb — stores arbitrary metadata as JSON)

Examples:

Usage

store = Phronomy::VectorStore::Pgvector.new(model_class: VectorDocument)
store.add(id: "doc1", embedding: [0.1, 0.9], metadata: {text: "hello"})
results = store.search(query_embedding: [0.1, 0.8], k: 5)

Instance Method Summary collapse

Constructor Details

#initialize(model_class:, dimension: nil) ⇒ Pgvector

Returns a new instance of Pgvector.

Parameters:

  • model_class (Class)

    ActiveRecord model with id/embedding/metadata columns

  • dimension (Integer, nil) (defaults to: nil)

    expected embedding dimension for Phronomy-side pre-validation. When nil, dimension enforcement is delegated to the database schema; no pre-validation is performed by Phronomy.



25
26
27
28
29
30
31
32
33
34
35
# File 'lib/phronomy/vector_store/pgvector.rb', line 25

def initialize(model_class:, dimension: nil)
  begin
    require "pgvector"
  rescue LoadError
    raise LoadError,
      "pgvector gem is required for Phronomy::VectorStore::Pgvector. " \
      "Add `gem 'pgvector'` to your Gemfile."
  end
  @model_class = model_class
  @dimension = dimension
end

Instance Method Details

#add(id:, embedding:, metadata: {}) ⇒ Object

Parameters:

  • id (String)
  • embedding (Array<Float>)
  • metadata (Hash) (defaults to: {})


40
41
42
43
44
45
46
47
# File 'lib/phronomy/vector_store/pgvector.rb', line 40

def add(id:, embedding:, metadata: {})
  validate_embedding_dimension!(embedding, @dimension)
  @model_class.upsert(
    {id: id, embedding: safe_vector(embedding), metadata: .to_json},
    unique_by: :id
  )
  self
end

#clearObject



77
78
79
80
# File 'lib/phronomy/vector_store/pgvector.rb', line 77

def clear
  @model_class.delete_all
  self
end

#remove(id:) ⇒ Object



72
73
74
75
# File 'lib/phronomy/vector_store/pgvector.rb', line 72

def remove(id:)
  @model_class.where(id: id).delete_all
  self
end

#search(query_embedding:, k: 5) ⇒ Array<Hash>

Returns sorted by descending similarity score.

Parameters:

  • query_embedding (Array<Float>)
  • k (Integer) (defaults to: 5)

Returns:

  • (Array<Hash>)

    sorted by descending similarity score



52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
# File 'lib/phronomy/vector_store/pgvector.rb', line 52

def search(query_embedding:, k: 5)
  validate_embedding_dimension!(query_embedding, @dimension)
  vec = safe_vector_literal(query_embedding)
  k_safe = Integer(k)
  conn = @model_class.connection
  quoted_vec = "#{conn.quote(vec)}::vector"

  @model_class
    .select("id, metadata, 1 - (embedding <=> #{quoted_vec}) AS score")
    .order("embedding <=> #{quoted_vec}")
    .limit(k_safe)
    .map do |r|
      {
        id: r.id.to_s,
        score: r.score.to_f,
        metadata: JSON.parse(r..to_s, symbolize_names: true)
      }
    end
end