<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: The MalStone Benchmark, TeraSort and Clouds For Data Intensive Computing</title>
	<atom:link href="http://rgrossman.com/2009/05/25/malstone-benchmark/feed/" rel="self" type="application/rss+xml" />
	<link>http://rgrossman.com/2009/05/25/malstone-benchmark/</link>
	<description>analytics, analytic strategy and analytic infrastructure</description>
	<lastBuildDate>Wed, 25 Aug 2010 14:16:49 -0500</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.6</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: DataRush posts 2 TB per hour on MalStone B &#124; insideHPC.com</title>
		<link>http://rgrossman.com/2009/05/25/malstone-benchmark/comment-page-1/#comment-327</link>
		<dc:creator>DataRush posts 2 TB per hour on MalStone B &#124; insideHPC.com</dc:creator>
		<pubDate>Wed, 25 Aug 2010 14:16:49 +0000</pubDate>
		<guid isPermaLink="false">http://blog.rgrossman.com/?p=146#comment-327</guid>
		<description>[...] on MalStone B (a stylized benchmark for data intensive computing, Robert Grossman describes it here) using a 32-core Intel Xeon 7550 server. MalStone B10 has 10 billion records, which equates to just [...]</description>
		<content:encoded><![CDATA[<p>[...] on MalStone B (a stylized benchmark for data intensive computing, Robert Grossman describes it here) using a 32-core Intel Xeon 7550 server. MalStone B10 has 10 billion records, which equates to just [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: HIMSS 2010 - DataRush in Health IT making a difference in patient care - Pervasive DataRush</title>
		<link>http://rgrossman.com/2009/05/25/malstone-benchmark/comment-page-1/#comment-137</link>
		<dc:creator>HIMSS 2010 - DataRush in Health IT making a difference in patient care - Pervasive DataRush</dc:creator>
		<pubDate>Thu, 11 Mar 2010 13:41:50 +0000</pubDate>
		<guid isPermaLink="false">http://blog.rgrossman.com/?p=146#comment-137</guid>
		<description>[...] cyber-security, Pervasive DataRush presented results from a benchmark study based on Malstone algorithm for web exploits. The Malstone DataRush implementation is capable of processing 1Tb of [...]</description>
		<content:encoded><![CDATA[<p>[...] cyber-security, Pervasive DataRush presented results from a benchmark study based on Malstone algorithm for web exploits. The Malstone DataRush implementation is capable of processing 1Tb of [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Open Source Cloud Computing Software at SC 09 &#171; From Data to Decisions</title>
		<link>http://rgrossman.com/2009/05/25/malstone-benchmark/comment-page-1/#comment-26</link>
		<dc:creator>Open Source Cloud Computing Software at SC 09 &#171; From Data to Decisions</dc:creator>
		<pubDate>Thu, 12 Nov 2009 17:51:57 +0000</pubDate>
		<guid isPermaLink="false">http://blog.rgrossman.com/?p=146#comment-26</guid>
		<description>[...] measured by the MalStone Benchmark, Sector was over 2x fast as Hadoop. Sector was one of six technologies selected by SC 09 as a [...]</description>
		<content:encoded><![CDATA[<p>[...] measured by the MalStone Benchmark, Sector was over 2x fast as Hadoop. Sector was one of six technologies selected by SC 09 as a [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Large Data Clouds FAQ &#171; From Data to Decisions</title>
		<link>http://rgrossman.com/2009/05/25/malstone-benchmark/comment-page-1/#comment-25</link>
		<dc:creator>Large Data Clouds FAQ &#171; From Data to Decisions</dc:creator>
		<pubDate>Fri, 17 Jul 2009 01:38:21 +0000</pubDate>
		<guid isPermaLink="false">http://blog.rgrossman.com/?p=146#comment-25</guid>
		<description>[...] How do I get started?  The easiest way to get started is to download one of the applications and to work through some basic examples. The example that most people work through is word count. Another common example is the terasort example (soring 10 billion 100 byte records where the first 10 bytes is the key that is sorted and the remaining 90 bytes is the payload). A simple analytic to try is MalStone, which I have described in another post. [...]</description>
		<content:encoded><![CDATA[<p>[...] How do I get started?  The easiest way to get started is to download one of the applications and to work through some basic examples. The example that most people work through is word count. Another common example is the terasort example (soring 10 billion 100 byte records where the first 10 bytes is the key that is sorted and the remaining 90 bytes is the payload). A simple analytic to try is MalStone, which I have described in another post. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Test Drive the Sector Public Cloud &#171; From Data to Decisions</title>
		<link>http://rgrossman.com/2009/05/25/malstone-benchmark/comment-page-1/#comment-24</link>
		<dc:creator>Test Drive the Sector Public Cloud &#171; From Data to Decisions</dc:creator>
		<pubDate>Tue, 23 Jun 2009 17:28:05 +0000</pubDate>
		<guid isPermaLink="false">http://blog.rgrossman.com/?p=146#comment-24</guid>
		<description>[...] is some information on the performance of Sector/Sphere in my post on the MalStone Benchmark, a benchmark for clouds that support data intensive computing. Possibly related posts: [...]</description>
		<content:encoded><![CDATA[<p>[...] is some information on the performance of Sector/Sphere in my post on the MalStone Benchmark, a benchmark for clouds that support data intensive computing. Possibly related posts: [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: rgrossman</title>
		<link>http://rgrossman.com/2009/05/25/malstone-benchmark/comment-page-1/#comment-22</link>
		<dc:creator>rgrossman</dc:creator>
		<pubDate>Thu, 04 Jun 2009 13:58:19 +0000</pubDate>
		<guid isPermaLink="false">http://blog.rgrossman.com/?p=146#comment-22</guid>
		<description>Sample implementations of the benchmark code can now be found on the MalGen web site: &lt;a href=&quot;http://code.google.com/p/malgen&quot; rel=&quot;nofollow&quot;&gt;code.google.com/p/malgen&lt;/a&gt;.

Both a technical report about MalStone and sample implementations are available in the &quot;Feature Downloads&quot; section along the right hand edge.

We view both the code implementations and the timings as preliminary and look forward to working with the community on flushing out and improving the benchmark and optimizing the particular implementations.

A more detailed technical report about MalStone is in preparation.</description>
		<content:encoded><![CDATA[<p>Sample implementations of the benchmark code can now be found on the MalGen web site: <a href="http://code.google.com/p/malgen" rel="nofollow">code.google.com/p/malgen</a>.</p>
<p>Both a technical report about MalStone and sample implementations are available in the &#8220;Feature Downloads&#8221; section along the right hand edge.</p>
<p>We view both the code implementations and the timings as preliminary and look forward to working with the community on flushing out and improving the benchmark and optimizing the particular implementations.</p>
<p>A more detailed technical report about MalStone is in preparation.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: rgrossman</title>
		<link>http://rgrossman.com/2009/05/25/malstone-benchmark/comment-page-1/#comment-21</link>
		<dc:creator>rgrossman</dc:creator>
		<pubDate>Wed, 27 May 2009 01:52:41 +0000</pubDate>
		<guid isPermaLink="false">http://blog.rgrossman.com/?p=146#comment-21</guid>
		<description>Doug,

Thanks for your note.  Please consider the posted results preliminary.  We&#039;ll be making the code available shortly along with a technical document that provides more details about MalStone.  I&#039;ll update the post when we do.   We also found the results suprising and it could well be that with more work optimising the code these numbers would change.

--Bob</description>
		<content:encoded><![CDATA[<p>Doug,</p>
<p>Thanks for your note.  Please consider the posted results preliminary.  We&#8217;ll be making the code available shortly along with a technical document that provides more details about MalStone.  I&#8217;ll update the post when we do.   We also found the results suprising and it could well be that with more work optimising the code these numbers would change.</p>
<p>&#8211;Bob</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Doug Cutting</title>
		<link>http://rgrossman.com/2009/05/25/malstone-benchmark/comment-page-1/#comment-20</link>
		<dc:creator>Doug Cutting</dc:creator>
		<pubDate>Tue, 26 May 2009 21:51:46 +0000</pubDate>
		<guid isPermaLink="false">http://blog.rgrossman.com/?p=146#comment-20</guid>
		<description>Have you posted your benchmark code publicly?  It doesn&#039;t appear to be with the MalGen code.

I can&#039;t think why &quot;Hadoop MapReduce&quot; is slower than &quot;Hadoop Streams&quot;.  I assume the former is a Java implementation, which is nearly always faster than Hadoop Streaming, yet, in your case it is 5x slower.  That seems strange.</description>
		<content:encoded><![CDATA[<p>Have you posted your benchmark code publicly?  It doesn&#8217;t appear to be with the MalGen code.</p>
<p>I can&#8217;t think why &#8220;Hadoop MapReduce&#8221; is slower than &#8220;Hadoop Streams&#8221;.  I assume the former is a Java implementation, which is nearly always faster than Hadoop Streaming, yet, in your case it is 5x slower.  That seems strange.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
