Rack, CSV streaming, and Ruby's Enumerator

Rack, CSV streaming, and Ruby's Enumerator

September 22, 2014

When a response body to a HTTP request gets big it's a good idea to stream it. A classic example of this is a CSV download--while you might get away without streaming for smaller response bodies as soon as the CSV file size is over a few megabytes you're going to see timeouts and performance issues.

We saw some of these issues with Mailmatch, and I'm going to take you through how we solved these by adding streaming support to our CSV downloads. Note that this tutorial is fairly Ruby, Rack and Sinatra specific although you should be able to apply the principals to your Rack based framework of choice.

It turns out that streaming is baked into Rack's protocol. The body section of Rack's array spec can be anything that responds to each(). In practice this often an array containing the string response body. However we can take advantage of this behavior by providing our own streaming object that responds to each.

Our Sinatra route is going to lookup a record, set up the content disposition headers, and return List#as_csv (which we'll define later).

get '/lists/:id/csv' do
  @list = List.first!(id: params[:id])

  attachment 'list.csv'

Our as_csv method is going to return a Enumerator. We're doing a little magic at the start of the method with enum_for to instantiate the Enumerator.

class List
  def as_csv
    return enum_for(:as_csv) unless block_given?

    emails.each do |email|
      yield CSV.generate_line(email.as_csv)

That's it! The Enumerator responds to each and make sure our response is streamed to the client one CSV line at a time.

While in this case emails is just an array we're iterating over, in practice you'll want to paginate over large datasets so you don't have to load them all into memory.

Company Logo API

Engineeringby Alex MacCaw on January 01, 2021

Clearbit's free Logo API is still available here in 2021 — and still completely free. We never found anything that catered well to company logos. And yet there's a lot of clear use-cases ranging from setting an organization's default image on signup to pulling in logos next to job listings. Clearbit Logo API The API is incredibly simple, taking a company's domain and returning an image. GET https://logo.clearbit.com/:domain Behind the scenes we're using Clearbit's Company API [https://clear

Introducing ultimate parent to Clearbit Enrichment API

Engineeringby Emily Brown on April 16, 2019

We've added a new data attribute to Clearbit's Company Enrichment API: ultimate parent. Bring full context of your accounts' hierarchy into view — so your team can stay up to speed on new acquisitions and know when they're in conversation with the same parent company.

Join our newsletter

Engaging stories and exclusive data, designed for our best customers. One useful issue each month.