Class: Phronomy::VectorStore::InMemory
- Defined in:
- lib/phronomy/vector_store/in_memory.rb
Overview
Pure-Ruby in-memory vector store using cosine similarity.
Intended for tests, short-lived agents, and Retrieval::Semantic scenarios where the message count is small enough that a linear scan is fast enough.
Instance Method Summary collapse
- #add(id:, embedding:, metadata: {}, cancellation_token: nil) ⇒ Object
- #clear ⇒ Object
-
#initialize(dimension: nil) ⇒ InMemory
constructor
A new instance of InMemory.
- #remove(id:) ⇒ Object
-
#search(query_embedding:, k: 5, cancellation_token: nil) ⇒ Array<Hash>
Sorted by descending score.
-
#size ⇒ Integer
Number of documents stored.
Constructor Details
#initialize(dimension: nil) ⇒ InMemory
Returns a new instance of InMemory.
20 21 22 23 |
# File 'lib/phronomy/vector_store/in_memory.rb', line 20 def initialize(dimension: nil) @documents = {} @expected_dimension = dimension end |
Instance Method Details
#add(id:, embedding:, metadata: {}, cancellation_token: nil) ⇒ Object
30 31 32 33 34 35 36 37 |
# File 'lib/phronomy/vector_store/in_memory.rb', line 30 def add(id:, embedding:, metadata: {}, cancellation_token: nil) cancellation_token&.raise_if_cancelled! # Establish expected dimension on first add, then validate. @expected_dimension ||= .size (, @expected_dimension) @documents[id] = {embedding: , metadata: } self end |
#clear ⇒ Object
67 68 69 70 |
# File 'lib/phronomy/vector_store/in_memory.rb', line 67 def clear @documents.clear self end |
#remove(id:) ⇒ Object
62 63 64 65 |
# File 'lib/phronomy/vector_store/in_memory.rb', line 62 def remove(id:) @documents.delete(id) self end |
#search(query_embedding:, k: 5, cancellation_token: nil) ⇒ Array<Hash>
Returns sorted by descending score.
44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 |
# File 'lib/phronomy/vector_store/in_memory.rb', line 44 def search(query_embedding:, k: 5, cancellation_token: nil) cancellation_token&.raise_if_cancelled! k = validate_k!(k) # search never establishes dimension; validate only when dimension is known. (, @expected_dimension) # Take an atomic snapshot before iterating. Hash#dup is a C-level # call that completes without releasing the GVL, so it is atomic with # respect to any other Ruby thread. Iterating the copy instead of # @documents directly prevents "can't add a new key into hash during # iteration" when a concurrent thread calls #add. snapshot = @documents.dup results = snapshot.map do |id, doc| score = cosine_similarity(, doc[:embedding]) {id: id, score: score, metadata: doc[:metadata]} end results.sort_by { |r| -r[:score] }.first(k) end |
#size ⇒ Integer
Returns number of documents stored.
74 75 76 |
# File 'lib/phronomy/vector_store/in_memory.rb', line 74 def size @documents.size end |