Posts

Building Spelling Correction at Etsy

Top K paths in Viterbi algorithm

Limiting Hadoop Part Files using 'shard' and a Common Gotcha

Camus for Scalding on cdh4 without Avro

Notes on Installing Kafka

Installing avronode on Centos

Run Scalding Tests Using Ant

Chaining Jobs in Scalding

Using indexed Lzo files in Scalding

Running Crunch with CDH4

Hadoop Counters in Scalding