omniflash-sdk (Ruby)

Ruby client for Gemini Omni Flash — generate short video clips (with synchronized audio) and images using Google's Gemini Omni Flash family of models.

Gemini Omni Flash wraps the Omni Flash family (seedance-2 for text/image → video + audio, gpt-image-2 and nano-banana-2 for text/image → image) behind one simple REST API.

Install

gem install omniflash-sdk
gem "omniflash-sdk"

Get an API key

Sign in at Gemini Omni Flash, open the account page, then create a sk-… token.

export OMNIFLASH_API_KEY=sk-xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx

Quick start

require "omniflash_sdk"

client = OmniflashSdk::Client.new  # reads ENV["OMNIFLASH_API_KEY"]

task = client.run(
  model_id: "seedance-2",
  prompt: "a kettle whistles as steam rises, cozy kitchen, warm light",
  aspect_ratio: "16:9"
)
puts task.video_url
puts task.audio_url   # synchronized audio

Lower level

task = client.create_task(
  model_id: "gpt-image-2",
  prompt: "cyberpunk corgi, neon rim light",
  aspect_ratio: "1:1"
)

until task.done?
  sleep 3
  task = client.get_task(task.task_id)
end
puts task.image_url

Models

model_id Modality Output
seedance-2 text/image → video video_url + audio_url
gpt-image-2 text/image → image image_url
nano-banana-2 text/image → image image_url

See current models and pricing on Gemini Omni Flash.

License

MIT