Class: LLM::LlamaCpp

Inherits:

OpenAI

Object
Provider
OpenAI
LLM::LlamaCpp

show all

Defined in:: lib/llm/providers/llamacpp.rb

Overview

The LlamaCpp class implements a provider for [llama.cpp](github.com/ggml-org/llama.cpp) through the OpenAI-compatible API provided by the llama-server binary. Similar to the ollama provider, this provider supports a wide range of models and is straightforward to run on your own hardware.

Examples:

#!/usr/bin/env ruby
require "llm"

llm = LLM.llamacpp(key: nil)
ctx = LLM::Context.new(llm)
ctx.talk ["Tell me about this photo", ctx.local_file("/images/photo.png")]
ctx.messages.select(&:assistant?).each { print "[#{_1.role}]", _1.content, "\n" }

Constant Summary

Constants inherited from OpenAI

OpenAI::HOST

Instance Method Summary collapse

#audio ⇒ Object
#default_model ⇒ String

Returns the default model for chat completions.
#files ⇒ Object
#images ⇒ Object
#initialize(host: "localhost", port: 8080, ssl: false) ⇒ LLM::LlamaCpp constructor
#moderations ⇒ Object
#name ⇒ Symbol

Returns the provider’s name.
#responses ⇒ Object
#vector_stores ⇒ Object

Constructor Details

#initialize(host: "localhost", port: 8080, ssl: false) ⇒ `LLM::LlamaCpp`

Parameters:

key (String, nil) —

The secret key for authentication
host (String) (defaults to: "localhost") —

The host address of the LLM provider
port (Integer) (defaults to: 8080) —

The port number
timeout (Integer) —

The number of seconds to wait for a response
ssl (Boolean) (defaults to: false) —

Whether to use SSL for the connection
persistent (Boolean) —

Whether to use a persistent connection. Requires the net-http-persistent gem.



26
27
28

# File 'lib/llm/providers/llamacpp.rb', line 26

def initialize(host: "localhost", port: 8080, ssl: false, **)
  super
end

Instance Method Details

#audio ⇒ `Object`

Raises:

(NotImplementedError)



51
52
53

# File 'lib/llm/providers/llamacpp.rb', line 51

def audio
  raise NotImplementedError
end

#default_model ⇒ `String`

Returns the default model for chat completions

Returns:

(String)

#files ⇒ `Object`

Raises:

(NotImplementedError)



39
40
41

# File 'lib/llm/providers/llamacpp.rb', line 39

def files
  raise NotImplementedError
end

#images ⇒ `Object`

Raises:

(NotImplementedError)



45
46
47

# File 'lib/llm/providers/llamacpp.rb', line 45

def images
  raise NotImplementedError
end

#moderations ⇒ `Object`

Raises:

(NotImplementedError)



57
58
59

# File 'lib/llm/providers/llamacpp.rb', line 57

def moderations
  raise NotImplementedError
end

#name ⇒ `Symbol`

Returns the provider’s name

Returns:

(Symbol) —

Returns the provider’s name



33
34
35

# File 'lib/llm/providers/llamacpp.rb', line 33

def name
  :llamacpp
end

#responses ⇒ `Object`

Raises:

(NotImplementedError)



63
64
65

# File 'lib/llm/providers/llamacpp.rb', line 63

def responses
  raise NotImplementedError
end

#vector_stores ⇒ `Object`

Raises:

(NotImplementedError)



69
70
71

# File 'lib/llm/providers/llamacpp.rb', line 69

def vector_stores
  raise NotImplementedError
end

Class: LLM::LlamaCpp

Overview

Examples:

Constant Summary

Constants inherited from OpenAI

Instance Method Summary collapse

Methods inherited from OpenAI

Methods included from OpenAI::RequestAdapter

Methods inherited from Provider

Constructor Details

#initialize(host: "localhost", port: 8080, ssl: false) ⇒ LLM::LlamaCpp

Instance Method Details

#audio ⇒ Object

#default_model ⇒ String

#files ⇒ Object

#images ⇒ Object

#moderations ⇒ Object

#name ⇒ Symbol

#responses ⇒ Object

#vector_stores ⇒ Object

#initialize(host: "localhost", port: 8080, ssl: false) ⇒ `LLM::LlamaCpp`

#audio ⇒ `Object`

#default_model ⇒ `String`

#files ⇒ `Object`

#images ⇒ `Object`

#moderations ⇒ `Object`

#name ⇒ `Symbol`

#responses ⇒ `Object`

#vector_stores ⇒ `Object`