Module: Pikuri::Tool::Scraper

Defined in:
lib/pikuri/tool/scraper/simple.rb,
lib/pikuri/tool/scraper/pdf.rb,
lib/pikuri/tool/scraper/html.rb,
lib/pikuri/tool/scraper/fetch_error.rb

Overview

Namespace for the URL-to-Markdown scraping stack used by WEB_SCRAPE and FETCH: a content-type-dispatching fetcher (Simple), pure content extractors (HTML, PDF), and a shared error type (FetchError). Nothing here knows about the LLM — the tools that wrap these layers turn rendered Markdown (or FetchError) into the next observation.

Defined Under Namespace

Modules: HTML, PDF, Simple Classes: FetchError