Red Arrow Format - Apache Arrow Format Ruby
Red Arrow Format is the pure Ruby Apache Arrow format serializer and deserializer implementation. This provides only serialize/deserialize features. If you want to process Apache Arrow data not only serialize/desrialize Apache Arrow data, you should use Red Arrow not Red Arrow Format.
Apache Arrow is an in-memory columnar data store. It's used by many products for data analytics.
Install
If you want to install Red Arrow Format by Bundler, you can add the followings to your Gemfile:
gem "red-arrow-format"
If you want to install Red Arrow Format by RubyGems, you can use the following command line:
$ gem install red-arrow-format
Usage
require "arrow-format"
File.open("/dev/shm/data.arrow", "rb") do |input|
reader = ArrowFormat::FileReader.new(input)
reader.each do |record_batch|
# Use record_batch
end
end
Development
You can run tests by the following command lines:
$ cd ruby/red-arrow-format
$ bundle install
$ bundle exec rake test