<?xml version="1.0" encoding="utf-8"?>
<feed xml:lang="en-us" xmlns="http://www.w3.org/2005/Atom"><title>Simon Willison's Weblog: mrjob</title><link href="http://simonwillison.net/" rel="alternate"/><link href="http://simonwillison.net/tags/mrjob.atom" rel="self"/><id>http://simonwillison.net/</id><updated>2010-10-29T23:55:00+00:00</updated><author><name>Simon Willison</name></author><entry><title>mrjob: Distributed Computing for Everybody</title><link href="https://simonwillison.net/2010/Oct/29/yelp/#atom-tag" rel="alternate"/><published>2010-10-29T23:55:00+00:00</published><updated>2010-10-29T23:55:00+00:00</updated><id>https://simonwillison.net/2010/Oct/29/yelp/#atom-tag</id><summary type="html">
    
&lt;p&gt;&lt;strong&gt;&lt;a href="http://engineeringblog.yelp.com/2010/10/mrjob-distributed-computing-for-everybody.html"&gt;mrjob: Distributed Computing for Everybody&lt;/a&gt;&lt;/strong&gt;&lt;/p&gt;
Yelp use MapReduce with Hadoop (running on Amazon’s EMR service) to power all sorts of interesting features on the site, including spelling suggestions, review highlights, top searches and “people who viewed X also viewed...”. mrjob is their new open source Python framework for writing MapReduce jobs against the Hadoop streaming API.


    &lt;p&gt;Tags: &lt;a href="https://simonwillison.net/tags/yelp"&gt;yelp&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/recovered"&gt;recovered&lt;/a&gt;, &lt;a href="https://simonwillison.net/tags/mrjob"&gt;mrjob&lt;/a&gt;&lt;/p&gt;



</summary><category term="yelp"/><category term="recovered"/><category term="mrjob"/></entry></feed>