Class: OllamaChat::Utils::Fetcher
- Inherits:
-
Object
- Object
- OllamaChat::Utils::Fetcher
- Defined in:
- lib/ollama_chat/utils/fetcher.rb
Overview
A fetcher implementation that handles retrieval and caching of HTTP resources.
This class provides functionality to fetch content from URLs, with support for caching responses and their metadata. It handles various content types and integrates with different cache backends to improve performance by avoiding redundant network requests.
Defined Under Namespace
Modules: ResponseMetadata Classes: RetryWithoutStreaming
Class Method Summary collapse
-
.execute(command) {|tmpfile| ... } ⇒ Object
The execute method runs a shell command and processes its output.
-
.get(url, headers: {}, **options) {|tmp| ... } ⇒ Object
Fetches the content located at
urland optionally caches it. -
.normalize_url(url) ⇒ Object
The normalize_url method processes a URL by converting it to a string, decoding any URI components, removing anchors, and then escaping the URL to ensure it is properly formatted.
-
.read(filename) {|file| ... } ⇒ nil, Object
The read method opens a file and extends it with header extension metadata.
Instance Method Summary collapse
-
#get(url, **opts) {|tmp| ... } ⇒ Object
Fetches the content located at
url. -
#initialize(debug: false, http_options: {}) ⇒ Fetcher
constructor
The initialize method sets up the fetcher instance with debugging and HTTP configuration options.
Constructor Details
#initialize(debug: false, http_options: {}) ⇒ Fetcher
The initialize method sets up the fetcher instance with debugging and HTTP configuration options.
231 232 233 234 235 236 |
# File 'lib/ollama_chat/utils/fetcher.rb', line 231 def initialize(debug: false, http_options: {}) @debug = debug @started = false @streaming = true @http_options = end |
Class Method Details
.execute(command) {|tmpfile| ... } ⇒ Object
The execute method runs a shell command and processes its output.
It captures the command’s standard output and error streams, writes them to a temporary file, and yields the file to the caller. If an exception occurs during execution, it reports the error and yields a failed temporary file instead.
204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 |
# File 'lib/ollama_chat/utils/fetcher.rb', line 204 def self.execute(command, &block) command.is_a?(String) or command = Shellwords.join(command) Tempfile.create do |tmp| unless command =~ /2>&1/ command += ' 2>&1' end IO.popen(command) do |io| until io.eof? tmp.write io.read(1 << 14) end tmp.rewind tmp.extend(OllamaChat::Utils::Fetcher::ResponseMetadata) tmp.content_type = MIME::Types['text/plain'].first block.(tmp) end end rescue => e msg = "Could not execute #{command.inspect} (#{e})" @debug && !e.is_a?(RuntimeError) and msg += "\n#{e.backtrace * ?\n}" raise OllamaChat::ExecuteError, msg end |
.get(url, headers: {}, **options) {|tmp| ... } ⇒ Object
Fetches the content located at url and optionally caches it.
This is a convenience wrapper around an instance of ‘OllamaChat::Utils::Fetcher`. It accepts the same options as the instance method, with the following additions:
-
‘:cache` – an object that responds to `get` and `put` (see `OllamaChat::Utils::CacheFetcher`). If a cached value exists, it is returned immediately and the network is not contacted.
The method streams the HTTP response into a temporary file (or a ‘StringIO` in the event of a failure). The temporary file is extended with `ResponseMetadata`, so the block can inspect `tmp.content_type` and `tmp.ex`. After the block returns the temporary file is closed and discarded. If a cache is supplied and the response is not a `StringIO`, the temporary file is written back to the cache for future requests.
141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 |
# File 'lib/ollama_chat/utils/fetcher.rb', line 141 def self.get(url, headers: {}, **, &block) cache = .delete(:cache) and cache = OllamaChat::Utils::CacheFetcher.new(cache) cache and .puts "Getting #{url.to_s.inspect} via cache…" if result = cache&.get(url, &block) content_type = result&.content_type || 'unknown' .puts "…hit, found #{content_type} content in cache." return result else new(**).send(:get, url, headers:) do |tmp| result = block.(tmp) if cache && !tmp.is_a?(StringIO) tmp.rewind cache.put(url, tmp) end result end end end |
.normalize_url(url) ⇒ Object
The normalize_url method processes a URL by converting it to a string, decoding any URI components, removing anchors, and then escaping the URL to ensure it is properly formatted.
164 165 166 167 168 169 |
# File 'lib/ollama_chat/utils/fetcher.rb', line 164 def self.normalize_url(url) url = url.to_s url = URI.decode_uri_component(url) url = url.sub(/#.*/, '') URI::Parser.new.escape(url).to_s end |
.read(filename) {|file| ... } ⇒ nil, Object
The read method opens a file and extends it with header extension metadata. It then yields the file to the provided block for processing. If the file does not exist, it outputs an error message to standard error.
182 183 184 185 186 187 188 189 190 191 192 |
# File 'lib/ollama_chat/utils/fetcher.rb', line 182 def self.read(filename, &block) if File.exist?(filename) File.open(filename) do |file| file.extend(OllamaChat::Utils::Fetcher::ResponseMetadata) file.content_type = MIME::Types.type_for(filename).first block.(file) end else STDERR.puts "File #{filename.to_s.inspect} doesn't exist." end end |
Instance Method Details
#get(url, **opts) {|tmp| ... } ⇒ Object
Fetches the content located at url.
The method first checks an optional cache (passed via the ‘:cache` option). If a cached response is found, it is returned immediately. Otherwise the URL is fetched over HTTP using Excon. Two modes are supported:
-
Streaming – the response body is streamed directly into a temporary file. Progress is reported via ‘infobar`. If the first request fails with a non‑200 status or the streaming mode is not supported, the method falls back to a non‑streaming request.
-
**Non‑streaming** – the entire body is read into memory before being written to the temporary file.
The temporary file is yielded to the caller. The file is extended with ‘ResponseMetadata`, so the block can inspect `content_type` and `ex` (cache‑expiry in seconds). After the block returns, the temporary file is closed and discarded.
If a cache is supplied and the response is not a ‘StringIO`, the temporary file is written back to the cache for future requests.
289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 |
# File 'lib/ollama_chat/utils/fetcher.rb', line 289 def get(url, **opts, &block) opts.delete(:response_block) and raise ArgumentError, 'response_block not allowed' middlewares = (self.middlewares | Array((opts.delete(:middlewares)))).uniq headers = opts.delete(:headers) || {} headers |= self.headers headers = headers.stringify_keys_recursive response = nil Tempfile.create do |tmp| .label = 'Getting' if @streaming response = excon(url, headers:, response_block: callback(tmp), **opts).request(method: :get) response.status != 200 || !@started and raise RetryWithoutStreaming decorate_io(tmp, response) .finish block.(tmp) else response = excon(url, headers:, middlewares:, **opts).request(method: :get) if status = response.status and status != 200 = "request failed with status %u" % status response.reason_phrase.full? { << " #{_1.inspect}" } error = OllamaChat::HTTPError.new() error.status = status raise error end body = response.body tmp.print body .update(message: (body.size, body.size), force: true) decorate_io(tmp, response) .finish block.(tmp) end end rescue RetryWithoutStreaming @streaming = false retry end |