<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Richard Jones, Esq.</title>
	<atom:link href="http://www.metabrew.com/feed" rel="self" type="application/rss+xml" />
	<link>http://www.metabrew.com</link>
	<description>Erlang, PHP, C, C++, Java, PostgreSQL, MySQL, Hadoop, Linux, awk, bash, sed, grep, screen, vim, irc, ssh etc...</description>
	<lastBuildDate>Sun, 20 Dec 2009 18:59:32 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>Rewriting Playdar: C++ to Erlang, massive savings</title>
		<link>http://www.metabrew.com/article/rewriting-playdar-c-to-erlang-massive-savings/</link>
		<comments>http://www.metabrew.com/article/rewriting-playdar-c-to-erlang-massive-savings/#comments</comments>
		<pubDate>Wed, 21 Oct 2009 21:29:15 +0000</pubDate>
		<dc:creator>RJ</dc:creator>
				<category><![CDATA[playdar]]></category>
		<category><![CDATA[programming]]></category>
		<category><![CDATA[c]]></category>
		<category><![CDATA[erlang]]></category>
		<category><![CDATA[rewrite]]></category>

		<guid isPermaLink="false">http://www.metabrew.com/?p=279</guid>
		<description><![CDATA[I&#8217;ve heard many anecdotes and claims about how many lines of code are saved when you write in Erlang instead of [C++/other language]. I&#8217;m happy to report that I now have first-hand experience and some data to share. I initially wrote Playdar in C++ (using Boost and Asio libraries), starting back in February this year. [...]]]></description>
			<content:encoded><![CDATA[<p>
I&#8217;ve heard many anecdotes and claims about how many lines of code are saved when you write in Erlang instead of [C++/other language]. I&#8217;m happy to report that I now have first-hand experience and some data to share.
</p>
<p>
I initially wrote Playdar in C++ (using Boost and Asio libraries), starting back in February this year. I was fortunate to be working with some experienced developers who helped me come to terms with C++. There were three of us hacking on it regularly up until a few months ago, and despite being relatively new to C++, I&#8217;ll say that we ended up with a well designed and robust codebase, all things considered.
</p>
<h2>On Feeling Smug</h2>
<p>
I&#8217;ll admit I felt rather smug making it all work in C++ with Boost and ASIO. Getting it to build on all three platforms and dynamically load extensions (DLLs etc) at runtime in a cross-platform way was also quite satisfying (I had plenty of help with that side of things).  I learned a lot about C++, Boost, ASIO and CMake. But, as the codebase grew, I began to seriously question my decision to use C++.
</p>
<p>
My initial reasons for choosing C++ were twofold:</p>
<ul>
<li>Distribution &#8211; shipping the Erlang VM didn&#8217;t sound like fun</li>
<li><a href="http://developer.kde.org/~wheeler/taglib.html" target="taglib">Taglib</a>  &#8211; *the* library to read metadata from audio files (mp3, m4a, ogg etc) is C++</li>
</ul>
<p>It turns out Playdar is naturally a good fit for Erlang &#8211; it does lots in parallel, and lots of stuff it does is asynchronous  and event based. Even with all the stuff you get with Boost, multithreaded stuff in C++ is inelegant, to put it kindly.
</p>
<h2>SLOCed and Loaded</h2>
<p>
Anyway, a couple of weeks ago I sat down to re-implement Playdar from scratch in Erlang. I thrashed out the guts of it in a couple of days, and by the end of the week I almost had it 1:1 features with the C++ codebase. There&#8217;s still a bit of C++ left &#8211; code to interface with taglib.</p>
<p>Using the SLOCcount tool (SLOC=source lines of code) I counted the lines of code in various modules from both codebases, here are the results:<br />
<br/></p>
<style type="text/css">
#matrix td{ font-size:90%; vertical-align:top; padding: 3px; } #matrix tr { background: #f0f0f0; } #matrix tr.odd { background: #ddd; }
#matrix td.b {font-size:100%; font-weight:bold;}
</style>
<table id="matrix" border="0">
<tbody>
<tr>
<td class="b"></td>
<td class="b">Erlang Version</td>
<td class="b">C++ Version</td>
<td class="b">Savings</td>
</tr>
<tr class="odd">
<td class="b">Core Daemon</td>
<td>1,100</td>
<td>4,491</td>
<td>75%</td>
</tr>
<tr>
<td class="b">Library + Scanner</td>
<td>197 + 167.cpp</td>
<td>1,355</td>
<td>73%</td>
</tr>
<tr class="odd">
<td class="b">LAN Resolver</td>
<td>105</td>
<td>427</td>
<td>75%</td>
</tr>
<tr>
<td class="b">P2P</td>
<td>463</td>
<td>1,762</td>
<td>74%</td>
</tr>
<tr class="odd b">
<td class="b">TOTAL</td>
<td><em>2,032</em></td>
<td><em>8,035</em></td>
<td><em>75%</em></td>
</tr>
</tbody>
</table>
<p><strong><br />
75% less lines of code using Erlang compared to C++ to implement the same thing &#8211; not too shabby :)<br />
</strong><br />
The second time around writing in Erlang I knew exactly what I was building, so it&#8217;s unfair to compare development time of the two codebases, but given how fast I can type I reckon I saved a good few hours of just pounding the keyboard to input the code (and countless hours of debugging: Erlang tends to work first time, really). Well I&#8217;m not sure if &#8220;saved&#8221; is the right word, considering It was working in C++ already, but it&#8217;s my time to waste :)
</p>
<p>
If you count the third party code bundled with both codebases (excluding boost/asio!) then the erlang codebase saves a whopping 92%. I&#8217;m more interested in the savings in code I had to write, however.
</p>
<h2>Memory and CPU Usage</h2>
<p>
I&#8217;ve done some preliminary comparisons between both projects, when it comes to CPU and memory usage both projects are pretty similar. The Erlang codebase uses slightly more memory than C++ at the moment, but I&#8217;m convinced I can get that down to at least as low as the C++ project was. I picked up a few optimization tricks from my three-part <a href="http://www.metabrew.com/article/a-million-user-comet-application-with-mochiweb-part-1/">Million-user comet experiment</a> in Erlang earlier this year. I&#8217;ll post more about this if I learn any new tricks.
</p>
<p>
One thing I&#8217;ve realised about the Erlang codebase is that I&#8217;ve used processes to encapsulate state (active queries, specifically)  where I didn&#8217;t really need to. It seemed sensible at the time, but it&#8217;s probably just a waste of memory. I&#8217;m going to change it to spawn processes to get the work done (ie, a process that runs the query) but not necessarily just to maintain state.</p>
<p><h2>Distribution to the desktop</h2>
<h3>C++</h3>
<p>You just have to make sure that you build everything and ship with any DLLs along with checks in the installer for system libraries needed (runtime dlls). Oh, and make sure you don&#8217;t change the plugin binary interface in the main app, or new plugins will crash and burn when you load them. Add a check for that. Oh and be careful about compiling taglib and stuff with mingw and the rest with VC++, or things might mysteriously crash. Also I heard a horror story about allocating memory in plugin code but deallocating it in the main app when the plugin was compiled against a different stdlib than the main app. This is all par for the course, and the experienced C++ developers I asked for help had no trouble making it work. <strong>Size of installable pacakge: 2.5MB</strong>
</p>
<h3>Erlang</h3>
<p>
Compiling, and building/loading plugins in the Erlang codebase is straightforward on all platforms, as is often the way with VMs. I was against shipping the Erlang VM originally because I figured it would be a lot of hassle and increase the download size substantially. Packaging an Erlang app for the desktop involves taking the installed VM directory structure and stripping out all the docs, source and parts of the Erlang stdlib we don&#8217;t use, then packaging it along with the compiled Playdar code. <a href="http://couchdb.apache.org/" target="cdb">CouchDB</a> does something like this too, and <a href="http://www.rabbitmq.com/" target="rabbit">RabbitMQ</a> ships the Erlang VM without stripping unneeded libs. We&#8217;ll work on packaging some more (for all platforms), but to date <a href="http://twitter.com/mxcl">Max</a> has crafted a package that contains the necessary bits of the Erlang VM, a sexy Prefpane to start/stop the daemon on OS X, and the compiled Playdar code all <strong>weighing in under 10MB.</strong>
</p>
<p>
We&#8217;ll put together a Windows installer soon that&#8217;ll probably be around the same size. A 10MB download isn&#8217;t so bad nowadays, and I expect we can optimize the packaging process some more. Linux users will get a package that depends on the erlang VM in their package manager.<br />
Seems like shipping Erlang apps to the desktop isn&#8217;t so hard after all.
</p>
<h2>tl;dr</h2>
<p>
Someone rewrote a C++ app in Erlang: 75% less lines of code for same functionality.
</p>
<p>
You should read this <a href="http://musicmachinery.com/2009/10/18/playing-with-playdar/">blog post about Playdar, by Paul Lamere</a>,  and take a look at the <a href="http://www.playdar.org/">Playdar website</a>.
</p>
<p>
<a href="http://github.com/RJ/playdar">C++ codebase (deprecated)</a><br />
<a href="http://github.com/RJ/playdar-core">Erlang codebase</a>
</p>
<p>
<strong>Playdar is the future, and the future is written in Erlang :)</strong></p>
]]></content:encoded>
			<wfw:commentRss>http://www.metabrew.com/article/rewriting-playdar-c-to-erlang-massive-savings//feed</wfw:commentRss>
		<slash:comments>14</slash:comments>
		</item>
		<item>
		<title>Erlang talk at London Hackspace</title>
		<link>http://www.metabrew.com/article/erlang-talk-at-london-hackspace/</link>
		<comments>http://www.metabrew.com/article/erlang-talk-at-london-hackspace/#comments</comments>
		<pubDate>Thu, 08 Oct 2009 10:31:01 +0000</pubDate>
		<dc:creator>RJ</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[erlang]]></category>
		<category><![CDATA[hackspace]]></category>
		<category><![CDATA[london]]></category>
		<category><![CDATA[playdar]]></category>
		<category><![CDATA[talk]]></category>

		<guid isPermaLink="false">http://www.metabrew.com/?p=268</guid>
		<description><![CDATA[Last night I gave an &#8220;Intro to Erlang&#8221; talk at a London Hackspace meetup. I did a quick audience survey first: About 75% did &#8220;web programming&#8221; (ruby,python,php,etc).  Around 30% admitted to regularly using C/C++/Java or desktop/mobile app development.  Less than 10% had much experience with functional programming. I wanted to impress upon the audience that [...]]]></description>
			<content:encoded><![CDATA[<p>Last night I gave an &#8220;Intro to Erlang&#8221; talk at a London Hackspace meetup. I did a quick audience survey first: About 75% did &#8220;web programming&#8221; (ruby,python,php,etc).  Around 30% admitted to regularly using C/C++/Java or desktop/mobile app development.  Less than 10% had much experience with functional programming.</p>
<p>I wanted to impress upon the audience that Erlang is a practical language, built by Ericsson with a specific purpose in mind. You use Erlang to build useful, scalable and reliable distributed systems in the real world. This was worth pointing out because when many people hear &#8220;functional programming&#8221; they immediately think of eccentric bearded academics proving the validity of their Haskell code and comparing Monads.</p>
<p>I skipped through the basics of sequential programming in Erlang pretty quickly and tried to spend most of the time showing how you handle processes and send messages. I built a basic Erlang server process that kept a count of how many operations it had done, explaining how it passes state to itself on every loop. Hopefully this helped some people grok how you can build servers that keep a global state by using recursion. I also showed off hot code reloading. We added another feature to the server and upgraded it without stopping it.</p>
<p>You can download the code I used (see link at the end) if you want to try out the examples from last night yourself. The last code I showed was an example of doing the same thing using gen_server, so hopefully if you followed along you&#8217;ll have a good understanding of what gen_server is and why it exists.</p>
<h2>Hot code reloading example</h2>
<p>I can&#8217;t write a post about Erlang without including some code, so here&#8217;s the basic example I used showing how hot code reloading works:</p>
<div class="dean_ch" style="white-space: wrap;">
<ol>
<li class="li1">
<div class="de1">-<span class="kw2">module</span><span class="br0">&#40;</span>ex09<span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">-<span class="kw2">export</span><span class="br0">&#40;</span><span class="br0">&#91;</span>start/<span class="nu0">0</span>, loop/<span class="nu0">2</span>, client/<span class="nu0">3</span><span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">start<span class="br0">&#40;</span><span class="br0">&#41;</span> -&gt; <span class="me1">spawn</span><span class="br0">&#40;</span>?<span class="re0">MODULE</span>, loop, <span class="br0">&#91;</span><span class="nu0">0</span>,<span class="nu0">0</span><span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li2">
<div class="de2">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">loop<span class="br0">&#40;</span><span class="re0">Ops</span>,<span class="re0">Wtfs</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp;<span class="kw1">receive</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp;<span class="br0">&#123;</span><span class="re0">Client</span>, double, <span class="re0">Num</span><span class="br0">&#125;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp;<span class="re0">Client</span> ! <span class="re0">Num</span> * <span class="nu0">2</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp;loop<span class="br0">&#40;</span><span class="re0">Ops</span><span class="nu0">+1</span>, <span class="re0">Wtfs</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp;<span class="br0">&#123;</span><span class="re0">Client</span>, square, <span class="re0">Num</span><span class="br0">&#125;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp;<span class="re0">Client</span> ! <span class="re0">Num</span> * <span class="re0">Num</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp;loop<span class="br0">&#40;</span><span class="re0">Ops</span><span class="nu0">+1</span>, <span class="re0">Wtfs</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li2">
<div class="de2">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp;<span class="br0">&#123;</span><span class="re0">Client</span>, _, _<span class="re0">Num</span><span class="br0">&#125;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp;<span class="re0">Client</span> ! wtf,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp;loop<span class="br0">&#40;</span><span class="re0">Ops</span>, <span class="re0">Wtfs</span><span class="nu0">+1</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp;reload -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp;<span class="me1">io</span>:<span class="kw3">format</span><span class="br0">&#40;</span><span class="st0">&quot;Reloading~n&quot;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp;?<span class="re0">MODULE</span>:<span class="me2">loop</span><span class="br0">&#40;</span><span class="re0">Ops</span>, <span class="re0">Wtfs</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp;stats -&gt;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp;<span class="me1">io</span>:<span class="kw3">format</span><span class="br0">&#40;</span><span class="st0">&quot;Ops: ~p, Wtfs: ~p ~n&quot;</span>, <span class="br0">&#91;</span><span class="re0">Ops</span>, <span class="re0">Wtfs</span><span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp;loop<span class="br0">&#40;</span><span class="re0">Ops</span>, <span class="re0">Wtfs</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp;<span class="kw1">end</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="co1">% basic client API:</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">client<span class="br0">&#40;</span><span class="re0">Pid</span>, <span class="re0">Cmd</span>, <span class="re0">Num</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp;<span class="re0">Pid</span> ! <span class="br0">&#123;</span>self<span class="br0">&#40;</span><span class="br0">&#41;</span>, <span class="re0">Cmd</span>, <span class="re0">Num</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp;<span class="kw1">receive</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp;<span class="re0">Ans</span> -&gt; <span class="re0">Ans</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp;<span class="kw1">after</span> <span class="nu0">1000</span> -&gt; <span class="me1">timeout</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp;<span class="kw1">end</span>.</div>
</li>
</ol>
</div>
<p>And if you were following along you saw something like this:</p>
<pre>1&gt; c(ex09).
{ok,ex09}
2&gt; Pid = ex09:start().
&lt;0.38.0&gt;
3&gt; Pid ! stats.
Ops: 0, Wtfs: 0
stats
4&gt; ex09:client(Pid, double, 10).
20
5&gt; ex09:client(Pid, triple, 10).
wtf</pre>
<p>At this point we added support for &#8220;triple&#8221; to the example and showed how the fully-qualified call to loop (using the modulename:fun() instead of fun() syntax) causes the newest version of the module to be used:</p>
<pre>6&gt; c(ex09).
{ok,ex09}
7&gt; ex09:client(Pid, triple, 10).
wtf
8&gt; Pid ! reload.
Reloading
reload
9&gt; ex09:client(Pid, triple, 10).
30
10&gt; Pid ! stats.
Ops: 2, Wtfs: 2
stats</pre>
<p>You can see from the stats at the end that the global state was kept &#8211; the server process staying running during the code upgrade.</p>
<h2>Download</h2>
<p>The slides, example code and basic mochiweb comet project we saw last night can be downloaded <a title="Slides and example code" href="http://www.metabrew.com/misc/erlang-hackspace-talk.tar.gz">here</a>. I should warn you that unless you saw my talk and the various explanations and disclaimers that went along with the code, it&#8217;s probably not a good place to start or learn from. Have a look at <a href="http://www.learnyousomeerlang.com/" target="_blank">www.learnyousomeerlang.com</a> or get one of the two excellent Erlang books.</p>
<h2>London Hackspace</h2>
<p>If you live in London you should know about this. <a href="http://russ.garrett.co.uk/" target="_blank">Russ</a> and <a href="http://jonty.co.uk/" target="_blank">Jonty</a> (who I worked with at Last.fm for years) started <a href="http://london.hackspace.org.uk/" target="_blank">London Hackspace</a>: &#8220;<strong>We run a dedicated space for people to learn and build things in London.&#8221;</strong> There are workshops at hackspace meetups on topics ranging from Arduino and electronics hacking, to iPhone development, to Erlang and beyond. Their unofficial slogan could be &#8220;Beer &amp; Hacking&#8221; &#8211; it&#8217;s a great place to meet people doing interesting things in London, and to learn new things.</p>
<p><a href="http://london.hackspace.org.uk/" target="_blank">http://london.hackspace.org.uk/</a></p>
<h2>Playdar</h2>
<p><a href="http://www.playdar.org/" target="_blank">Playdar</a> is my pet project at the moment. I talked about this last night too. I wrote it in C++ using Boost, mainly as an excuse to do something serious in C++. I&#8217;ve <a href="http://twitter.com/metabrew/status/4494402561" target="_blank">since seen the error of my masochistic ways</a> and in the last week I&#8217;ve tossed out the 10,000 lines of C++ and rewritten it in Erlang. I&#8217;m not quite finished, but once I have feature parity between the two codebases I&#8217;ll write an article comparing the two.  As you might expect, the Erlang codebase is far superior in almost every way.</p>
<p><a href="http://www.playdar.org/" target="_blank">http://www.playdar.org/</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.metabrew.com/article/erlang-talk-at-london-hackspace//feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Anti-RDBMS: A list of distributed key-value stores</title>
		<link>http://www.metabrew.com/article/anti-rdbms-a-list-of-distributed-key-value-stores/</link>
		<comments>http://www.metabrew.com/article/anti-rdbms-a-list-of-distributed-key-value-stores/#comments</comments>
		<pubDate>Mon, 19 Jan 2009 19:38:43 +0000</pubDate>
		<dc:creator>RJ</dc:creator>
				<category><![CDATA[programming]]></category>
		<category><![CDATA[databases]]></category>
		<category><![CDATA[dht]]></category>
		<category><![CDATA[erlang]]></category>
		<category><![CDATA[hashing]]></category>
		<category><![CDATA[java]]></category>
		<category><![CDATA[nosql]]></category>

		<guid isPermaLink="false">http://www.metabrew.com/?p=209</guid>
		<description><![CDATA[Please Note: this was written January 2009 &#8211; see the comments for updates and additional information. A lot has changed since I wrote this. - RJ Perhaps you&#8217;re considering using a dedicated key-value or document store instead of a traditional relational database. Reasons for this might include: You&#8217;re suffering from Cloud-computing Mania. You need an [...]]]></description>
			<content:encoded><![CDATA[<p><strong>Please Note: this was written January 2009 &#8211; see the comments for updates and additional information. A lot has changed since I wrote this.<br />
- RJ</strong> <br/></p>
<p>Perhaps you&#8217;re considering using a dedicated key-value or document store instead of a traditional relational database. Reasons for this might include:</p>
<ol>
<li>You&#8217;re suffering from Cloud-computing Mania.</li>
<li>You need an excuse to &#8216;get your Erlang on&#8217;</li>
<li>You heard CouchDB was cool.</li>
<li>You hate MySQL, and although PostgreSQL is much better, it still doesn&#8217;t have decent replication. There&#8217;s no chance you&#8217;re buying Oracle licenses.</li>
<li>Your data is stored and retrieved mainly by primary key, without complex joins.</li>
<li>You have a non-trivial amount of data, and the thought of managing lots of RDBMS shards and replication failure scenarios gives you the fear.</li>
</ol>
<p>Whatever your reasons, there are a lot of options to chose from. At Last.fm we do a lot of batch computation in Hadoop, then dump it out to other machines where it&#8217;s indexed and served up over HTTP and <a href="http://developers.facebook.com/thrift/">Thrift</a> as an internal service (stuff like &#8216;most popular songs in London, UK this week&#8217; etc). Presently we&#8217;re using a home-grown index format which points into large files containing lots of data spanning many keys, similar to the Haystack approach mentioned in <a href="http://perspectives.mvdirona.com/2008/06/30/FacebookNeedleInAHaystackEfficientStorageOfBillionsOfPhotos.aspx">this article about Facebook photo storage</a>. It works, but rather than build our own replication and partitioning system on top of this, we are looking to potentially replace it with a distributed, resilient key-value store for reasons 4, 5 and 6 above.</p>
<p>This article represents my notes and research to date on distributed key-value stores (and some other stuff) that might be suitable as RDBMS replacements under the right conditions. I&#8217;m expecting to try some of these out and investigate further in the coming months.</p>
<h4>Glossary and Background Reading</h4>
<ul>
<li><a href="http://en.wikipedia.org/wiki/Distributed_hash_table">Distributed Hash Table (DHT)</a> and algorithms such as Chord or Kadmelia</li>
<li><a href="http://www.allthingsdistributed.com/2007/10/amazons_dynamo.html">Amazon&#8217;s Dynamo Paper</a>, and <a href="http://www.readwriteweb.com/archives/amazon_dynamo.php">this ReadWriteWeb article about Dynamo</a> which explains why such a system is invaluable</li>
<li><a href="http://aws.amazon.com/simpledb/">Amazon&#8217;s SimpleDB Service</a>, and <a href="http://gigaom.com/2007/12/14/amazon-simple-db/">some</a> <a href="http://www.satine.org/archives/2007/12/13/amazon-simpledb/">commentary</a>
<li><a href="http://labs.google.com/papers/bigtable.html">Google&#8217;s BigTable paper</a></li>
<li><a href="http://en.wikipedia.org/wiki/Paxos_algorithm">The Paxos Algorithm</a> &#8211; read this page in order to appreciate that knocking up a Paxos implementation isn&#8217;t something you&#8217;d want to do whilst hungover on a Saturday morning.</li>
</ul>
<h3>The Shortlist</h3>
<p>Here is a list of projects that could potentially replace a group of relational database shards. Some of these are much more than key-value stores, and aren&#8217;t suitable for low-latency data serving, but are interesting none-the-less.</p>
<style type="text/css">
#matrix td{ font-size:90%; vertical-align:top; padding: 3px; } #matrix tr { background: #f0f0f0; } #matrix tr.odd { background: #ddd; } 
#matrix td.bigger {font-size:100%;}
</style>
<table id="matrix" border="0">
<tbody>
<tr class="odd" style="font-weight:bold;">
<td class="bigger">Name</td>
<td>Language</td>
<td>Fault-tolerance</td>
<td>Persistence</td>
<td>Client Protocol</td>
<td>Data model</td>
<td>Docs</td>
<td>Community</td>
</tr>
<tr>
<td style="font-weight:bold" class="bigger"><a class="external text" title="http://project-voldemort.com/" href="http://project-voldemort.com/">Project Voldemort</a></td>
<td>Java</td>
<td>partitioned, replicated, read-repair</td>
<td>Pluggable: BerkleyDB, Mysql</td>
<td>Java API</td>
<td>Structured / blob / text</td>
<td>A</td>
<td>Linkedin, no</td>
</tr>
<tr class="odd">
<td style="font-weight:bold" class="bigger"><a class="external text" title="http://github.com/tuulos/ringo/tree/master" href="http://github.com/tuulos/ringo/tree/master">Ringo</a></td>
<td>Erlang</td>
<td>partitioned, replicated, immutable</td>
<td>Custom on-disk (append only log)</td>
<td>HTTP</td>
<td>blob</td>
<td>B</td>
<td>Nokia, no</td>
</tr>
<tr>
<td style="font-weight:bold" class="bigger"><a class="external text" title="http://code.google.com/p/scalaris/" href="http://code.google.com/p/scalaris/">Scalaris</a></td>
<td>Erlang</td>
<td>partitioned, replicated, paxos</td>
<td>In-memory only</td>
<td>Erlang, Java, HTTP</td>
<td>blob</td>
<td>B</td>
<td>OnScale, no</td>
</tr>
<tr class="odd">
<td style="font-weight:bold" class="bigger"><a class="external text" title="http://kai.wiki.sourceforge.net/" href="http://kai.wiki.sourceforge.net/">Kai</a></td>
<td>Erlang</td>
<td>partitioned, replicated?</td>
<td>On-disk Dets file</td>
<td>Memcached</td>
<td>blob</td>
<td>C</td>
<td>no</td>
</tr>
<tr>
<td style="font-weight:bold" class="bigger"><a class="external text" title="http://github.com/cliffmoon/dynomite/tree/master" href="http://github.com/cliffmoon/dynomite/tree/master">Dynomite</a></td>
<td>Erlang</td>
<td>partitioned, replicated</td>
<td>Pluggable: couch, dets</td>
<td>Custom ascii, Thrift</td>
<td>blob</td>
<td>D+</td>
<td>Powerset, no</td>
</tr>
<tr class="odd">
<td style="font-weight:bold" class="bigger"><a class="external text" title="http://memcachedb.org/" href="http://memcachedb.org/">MemcacheDB</a></td>
<td>C</td>
<td>replication</td>
<td>BerkleyDB</td>
<td>Memcached</td>
<td>blob</td>
<td>B</td>
<td>some</td>
</tr>
<tr>
<td style="font-weight:bold" class="bigger"><a class="external text" title="http://code.google.com/p/thrudb/" href="http://code.google.com/p/thrudb/">ThruDB</a></td>
<td>C++</td>
<td>Replication</td>
<td>Pluggable: BerkleyDB, Custom, Mysql, S3</td>
<td>Thrift</td>
<td>Document oriented</td>
<td>C+</td>
<td>Third rail, unsure</td>
</tr>
<tr class="odd">
<td style="font-weight:bold" class="bigger"><a class="external text" title="http://couchdb.apache.org/" href="http://couchdb.apache.org/">CouchDB</a></td>
<td>Erlang</td>
<td>Replication, partitioning?</td>
<td>Custom on-disk</td>
<td>HTTP, json</td>
<td>Document oriented (json)</td>
<td>A</td>
<td>Apache, yes</td>
</tr>
<tr>
<td style="font-weight:bold" class="bigger"><a class="external text" title="http://code.google.com/p/the-cassandra-project/" href="http://code.google.com/p/the-cassandra-project/">Cassandra</a></td>
<td>Java</td>
<td>Replication, partitioning</td>
<td>Custom on-disk</td>
<td>Thrift</td>
<td>Bigtable meets Dynamo</td>
<td>F</td>
<td>Facebook, no</td>
</tr>
<tr class="odd">
<td style="font-weight:bold" class="bigger"><a class="external text" title="http://hadoop.apache.org/hbase/" href="http://hadoop.apache.org/hbase/">HBase</a></td>
<td>Java</td>
<td>Replication, partitioning</td>
<td>Custom on-disk</td>
<td>Custom API, Thrift, Rest</td>
<td>Bigtable</td>
<td>A</td>
<td>Apache, yes</td>
</tr>
<tr>
<td style="font-weight:bold" class="bigger"><a class="external text" title="http://hypertable.org/" href="http://hypertable.org/">Hypertable</a></td>
<td>C++</td>
<td>Replication, partitioning</td>
<td>Custom on-disk</td>
<td>Thrift, other</td>
<td>Bigtable</td>
<td>A</td>
<td>Zvents, Baidu, yes</td>
</tr>
</tbody>
</table>
<p><br/></p>
<h3>Why 5 of these aren&#8217;t suitable</h3>
<p>What I&#8217;m really looking for is a low latency, replicated, distributed key-value store. Something that scales well as you feed it more machines, and doesn&#8217;t require much setup or maintenance &#8211; it should just work. The API should be that of a simple hashtable: set(key, val), get(key), delete(key). This would dispense with the hassle of managing a sharded / replicated database setup, and hopefully be capable of serving up data by primary key efficiently.</p>
<p>Five of the projects on the list are far from being simple key-value stores, and as such don&#8217;t meet the requirements &#8211; but they are definitely worth a mention. </p>
<p><b>1)</b> We&#8217;re already heavy users of Hadoop, and have been experimenting with <strong>Hbase</strong> for a while. It&#8217;s much more than a KV store, but latency is too great to serve data to the website. We will probably use Hbase internally for other stuff though &#8211; we already have stacks of data in HDFS.</p>
<p><b>2)</b> <strong>Hypertable</strong> provides a similar feature set to Hbase (both are inspired by Google&#8217;s Bigtable). They recently announced a new sponsor, Baidu &#8211; the biggest Chinese search engine. Definitely one to watch, but doesn&#8217;t fit the low-latency KV store bill either.</p>
<p><b>3)</b> <strong>Cassandra</strong> sounded very promising when the source was released by Facebook last year. They use it for inbox search. It&#8217;s Bigtable-esque, but uses a DHT so doesn&#8217;t need a central server (one of the Cassandra developers previously worked at Amazon on Dynamo). Unfortunately it&#8217;s languished in relative obscurity since release, because Facebook never really seemed interested in it as an open-source project. From what I can tell there isn&#8217;t much in the way of documentation or a community around the project at present.</p>
<p><b>4)</b> <strong>CouchDB</strong> is an interesting one &#8211; it&#8217;s a &#8220;distributed, fault-tolerant and schema-free document-oriented database accessible via a RESTful HTTP/JSON API&#8221;. Data is stored in &#8216;documents&#8217;, which are essentially key-value maps themselves, using the data types you see in JSON.  Read the <a href="http://couchdb.apache.org/docs/overview.html">CouchDB Technical Overview</a> if you are curious how the web&#8217;s trendiest document database works under the hood. This article on the <a href="http://push.cx/2009/rules-of-database-app-aging">Rules of Database App Aging</a> goes some way to explaining why document-oriented databases make sense. CouchDB can do full text indexing of your documents, and lets you express views over your data in Javascript. I could imagine using CouchDB to store lots of data on users: name, age, sex, address, IM name and lots of other fields, many of which could be null, and each site update adds or changes the available fields. In situations like that it quickly gets unwieldly adding and changing columns in a database, and updating versions of your application code to match. Although many people are using CouchDB in production, their FAQ points out they may still make backwards-incompatible changes to the storage format and API before version 1.0.</p>
<p><b>5)</b> <strong>ThruDB</strong> is a document storage and indexing system made up for four components: a document storage service, indexing service, message queue and proxy. It uses Thrift for communication, and has a pluggable storage subsystem, including an Amazon S3 option. It&#8217;s designed to scale well horizontally, and might be a better option that CouchDB if you are running on EC2. I&#8217;ve heard a lot more about CouchDB than Thrudb recently, but it&#8217;s definitely worth a look if you need a document database. It&#8217;s not suitable for our needs for the same reasons as CouchDB. </p>
<h3>Distributed key-value stores</h3>
<p>The rest are much closer to being &#8216;simple&#8217; key-value stores with low enough latency to be used for serving data used to build dynamic pages. Latency will be dependent on the environment, and whether or not the dataset fits in memory. If it does, I&#8217;d expect sub-10ms response time, and if not, it all depends on how much money you spent on spinning rust.</p>
<p><b>MemcacheDB</b> is essentially just memcached that saves stuff to disk using a Berkeley database. As useful as this may be for some situations, it doesn&#8217;t deal with replication and partitioning (sharding), so it would still require a lot of work to make it scale horizontally and be tolerant of machine failure. Other memcached derivatives such as <a href="http://repcached.lab.klab.org/">repcached</a> go some way to addressing this by giving you the ability to replicate entire memcache servers (async master-slave setup), but without partitioning it&#8217;s still going to be a pain to manage.</p>
<p><b>Project Voldemort</b> looks <i>awesome</i>. Go and read the <a href="http://project-voldemort.com/">rather splendid website</a>, which explains how it works, and includes pretty diagrams and a good description of how consistent hashing is used in the Design section. (If consistent hashing butters your muffin, check out <a href="http://www.last.fm/user/RJ/journal/2007/04/10/rz_libketama_-_a_consistent_hashing_algo_for_memcache_clients">libketama &#8211; a consistent hashing library</a> and the <a href="http://www.metabrew.com/article/erlang-libketama-driver-consistent-hashing/">Erlang libketama driver</a>). Project-Voldemort handles replication and partitioning of data, and appears to be well written and designed. It&#8217;s reassuring to read in the docs how easy it is to swap out and mock different components for testing. It&#8217;s non-trivial to add nodes to a running cluster, but according to the mailing-list this is being worked on. It sounds like this would fit the bill if we ran it with a Java load-balancer service (see their Physical Architecture Options diagram) that exposed a Thrift API so all our non-Java clients could use it.</p>
<p><b>Scalaris</b> is probably the most face-meltingly awesome thing you could build in Erlang. CouchDB, Ejabberd and RabbitMQ are cool, but Scalaris packs by far the most impressive collection of sexy technologies. Scalaris is a key-value store &#8211; it uses a modified version of the Chord algorithm to form a DHT, and stores the keys in lexicographical order, so range queries are possible. Although I didn&#8217;t see this explicitly mentioned, this should open up all sorts of interesting options for batch processing &#8211; map-reduce for example.  On top of the DHT they use an improved version of <a href="http://en.wikipedia.org/wiki/Paxos_algorithm">Paxos</a> to guarantee ACID properties when dealing with multiple concurrent transactions. So it&#8217;s a key-value store, but it can guarantee the ACID properties and do proper distributed transactions over multiple keys. </p>
<p>Oh, and to demonstrate how you can scale a webservice based on such a system, the Scalaris folk implemented their own version of Wikipedia on Scalaris, loaded in the Wikipedia data, and benchmarked their setup to prove it can do more transactions/sec on equal hardware than the classic PHP/MySQL combo that Wikipedia use. Yikes. </p>
<p>From what I can tell, Scalaris is only memory-resident at the moment and doesn&#8217;t persist data to disk. This makes it entirely impractical to actually run a service like Wikipedia on Scalaris for real &#8211; but it sounds like they tackled the hard problems first, and persisting to disk should be a walk in the park after you rolled your own version of Chord and made Paxos your bitch. Take a look at this presentation about Scalaris from the Erlang Exchange conference: <a href="http://video.google.com/videoplay?docid=6981137233069932108&#038;ei=caB0SaPUNIW0iALk-9CMBQ&#038;q=erlang+exchange">Scalaris presentation video</a>.</p>
<p>The reminaing projects, <b>Dynomite</b>, <b>Ringo</b> and <b>Kai</b> are all, more or less, trying to be Dynamo. Of the three, <b>Ringo</b> looks to be the most specialist &#8211; it makes a distinction between small (less than 4KB) and medium-size data items (<100MB). Medium sized items are stored in individual files, whereas small items are all stored in an append-log, the index of which is read into memory at startup. From what I can tell, Ringo can be used in conjunction with the Erlang map-reduce framework Nokia are working on called <a href="http://discoproject.org">Disco</a>.</p>
<p>I didn&#8217;t find out much about <b>Kai</b> other than it&#8217;s rather new, and some mentions in Japanese. You can chose either Erlang ets or dets as the storage system (memory or disk, respectively), and it uses the memcached protocol, so it will already have client libraries in many languages.</p>
<p><b>Dynomite</b> doesn&#8217;t have great documentation, but it seems to be more capable than Kai, and is under active development. It has pluggable backends including the storage mechanism from CouchDB, so the 2GB file limit in dets won&#8217;t be an issue. Also I heard that Powerset are using it, so that&#8217;s encouraging. </p>
<h3>Summary</h3>
<p>Scalaris is fascinating, and I hope I can find the time to experiment more with it, but it needs to save stuff to disk before it&#8217;d be useful for the kind of things we might use it for at Last.fm. </p>
<p>I&#8217;m keeping an eye on Dynomite &#8211; hopefully more information will surface about what Powerset are doing with it, and how it performs at a large scale. </p>
<p>Based on my research so far, Project-Voldemort looks like the most suitable for our needs. I&#8217;d love to hear more about how it&#8217;s used at LinkedIn, and how many nodes they are running it on. </p>
<h3>What else is there?</h3>
<p>Here are some other related projects:</p>
<ul>
<li><a href="http://www.hazelcast.com/">Hazelcast</a> &#8211; Java DHT/clustering library</li>
<li><a href="http://blitiri.com.ar/p/nmdb/">nmdb</a> &#8211; a network database (dbm-style)</li>
<li><a href="http://open-chord.sourceforge.net/">Open Chord</a> &#8211; Java DHT</li>
</ul>
<p>If you know of anything I&#8217;ve missed off the list, or have any feedback/suggestions, please post a comment. I&#8217;m especially interested in hearing about people who&#8217;ve tested or are using KV-stores in lieu of relational databases.</p>
<p><b>UPDATE 1:</b> Corrected table: memcachedb does replication, as per BerkeleyDB.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.metabrew.com/article/anti-rdbms-a-list-of-distributed-key-value-stores//feed</wfw:commentRss>
		<slash:comments>157</slash:comments>
		</item>
		<item>
		<title>How we use IRC at Last.fm</title>
		<link>http://www.metabrew.com/article/how-we-use-irc-at-lastfm/</link>
		<comments>http://www.metabrew.com/article/how-we-use-irc-at-lastfm/#comments</comments>
		<pubDate>Thu, 08 Jan 2009 20:45:05 +0000</pubDate>
		<dc:creator>RJ</dc:creator>
				<category><![CDATA[programming]]></category>
		<category><![CDATA[irc]]></category>
		<category><![CDATA[java]]></category>
		<category><![CDATA[last.fm]]></category>

		<guid isPermaLink="false">http://www.metabrew.com/?p=207</guid>
		<description><![CDATA[Everyone that works at Last.fm is typically connected to our IRC server. We have different channels per team, as well as a company-wide channel, and a few channels dedicated to automated monitoring. Sometimes it makes much more sense to discuss / ask questions on IRC instead of email, and it&#8217;s useful to be able to [...]]]></description>
			<content:encoded><![CDATA[<p>Everyone that works at Last.fm is typically connected to our IRC server. We have different channels per team, as well as a company-wide channel, and a few channels dedicated to automated monitoring.</p>
<p>Sometimes it makes much more sense to discuss / ask questions on IRC instead of email, and it&#8217;s useful to be able to raise people who are not in the office. That said, the main reason I&#8217;m writing this post is to mention the dev-support bot we use: irccat.</p>
<p><strong>IRCCat &#8211; Development support bot</strong></p>
<p>The irccat bot joins all your channels, and waits for messages on a specified ip:port on your internal network. Anything you send to that port will be sent to IRC by the bot. IRCCat &#8211; as in, `cat` to IRC.</p>
<p>Using netcat, you can easily send events to irc from shell scripts:</p>
<blockquote><p>$  echo &#8220;Something just happened&#8221; | nc -q0 somemachine 12345</p></blockquote>
<p>That will send to the default channel only (first in the config file). You can direct messages to specific combinations of channels (#) or users (@) like so:</p>
<blockquote><p>$  echo &#8220;#syschan Starting backup job&#8221; | nc -q0 somemachine 12345</p>
<p>$  echo &#8220;#musicteam,#legal,@alice New album uploaded: &#8230;&#8221; | nc -q0 somemachine 12345</p></blockquote>
<p>Some of the things we automatically send to appropriate IRC channels:</p>
<ul>
<li>SVN commits</li>
<li>JIRA issue tracker updates</li>
<li>Nagios alerts for monitored hosts and services</li>
<li>Deployment notices to testing/staging/production</li>
<li>Results of automated tests if something bad happens</li>
<li>Links to pics from security camfeed when someone opens the office door out of hours</li>
</ul>
<p>We also post messages from automated backup jobs etc, which helps correlate such events with any unusual load spikes or glitches in usually-smooth graphs.</p>
<p>In addition to providing a cat-to-irc conduit, irccat will also hand off commands to a script you can provide. We use this to expose lookup tools and some admin functions to our support staff and developers. The handler script we use is PHP, and has access to our core website libs. Typing &#8220;?pokereleasenode&#8221;, &#8220;?lookup user RJ&#8221; or &#8220;?uncache artist Radiohead&#8221; is faster than writing a throw-away script, more accessible to non-developers, less hassle than a web interface and creates a public log so people can see what&#8217;s going on.</p>
<p>The bot is written in Java, it&#8217;s easy to build and configure, all the deps are included:</p>
<p><a title="IRCcat source on GitHub" href="http://github.com/RJ/irccat/tree/master" target="_blank">http://github.com/RJ/irccat/tree/master</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.metabrew.com/article/how-we-use-irc-at-lastfm//feed</wfw:commentRss>
		<slash:comments>54</slash:comments>
		</item>
		<item>
		<title>Getting to know ejabberd and writing modules</title>
		<link>http://www.metabrew.com/article/getting-to-know-ejabberd-and-writing-modules/</link>
		<comments>http://www.metabrew.com/article/getting-to-know-ejabberd-and-writing-modules/#comments</comments>
		<pubDate>Sun, 23 Nov 2008 21:53:17 +0000</pubDate>
		<dc:creator>RJ</dc:creator>
				<category><![CDATA[programming]]></category>
		<category><![CDATA[ejabberd]]></category>
		<category><![CDATA[erlang]]></category>
		<category><![CDATA[mnesia]]></category>
		<category><![CDATA[thrift]]></category>
		<category><![CDATA[xmpp]]></category>
		<category><![CDATA[yaws]]></category>

		<guid isPermaLink="false">http://www.metabrew.com/?p=203</guid>
		<description><![CDATA[I started poking around in the ejabberd source code to see what I could learn. I couldn&#8217;t find much in the way of high level documentation that talks about how the various bits of ejabberd talk to each other, so I&#8217;m starting to piece it together myself. After compiling ejabberd I made a php script [...]]]></description>
			<content:encoded><![CDATA[<p>I started poking around in the ejabberd source code to see what I could learn. I couldn&#8217;t find much in the way of high level documentation that talks about how the various bits of ejabberd talk to each other, so I&#8217;m starting to piece it together myself.</p>
<p>After compiling ejabberd I made a php script I could use with the external authentication system. Here&#8217;s a version that supports just two hardcoded users:</p>
<p>ejabberd.cfg:<br />
<code>{auth_method, external}.<br />
{extauth_program, "/tmp/auth.php"}.</code><br />
<br/></p>
<p>auth.php:</p>
<div class="dean_ch" style="white-space: wrap;">
<ol>
<li class="li1">
<div class="de1"><span class="co2">#!/usr/bin/php</span></div>
</li>
<li class="li1">
<div class="de1"><span class="kw2">&lt;?</span></div>
</li>
<li class="li1">
<div class="de1"><span class="re0">$fh</span> &nbsp;= <a href="http://www.php.net/fopen"><span class="kw3">fopen</span></a><span class="br0">&#40;</span><span class="st0">&quot;php://stdin&quot;</span>, <span class="st0">&#8216;r&#8217;</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1"><span class="kw1">if</span><span class="br0">&#40;</span>!<span class="re0">$fh</span><span class="br0">&#41;</span><span class="br0">&#123;</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <a href="http://www.php.net/die"><span class="kw3">die</span></a><span class="br0">&#40;</span><span class="st0">&quot;Cannot open STDIN<span class="es0">\n</span>&quot;</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#125;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="re0">$users</span> = <a href="http://www.php.net/array"><span class="kw3">array</span></a><span class="br0">&#40;</span><span class="st0">&#8216;user1&#8242;</span>=&gt;<span class="st0">&#8216;password1&#8242;</span>, <span class="st0">&#8216;user2&#8242;</span>=&gt;<span class="st0">&#8216;password2&#8242;</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="kw1">do</span><span class="br0">&#123;</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="re0">$lenBytes</span> = <a href="http://www.php.net/fgets"><span class="kw3">fgets</span></a><span class="br0">&#40;</span><span class="re0">$fh</span>, <span class="nu0">3</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">$len</span> = <a href="http://www.php.net/unpack"><span class="kw3">unpack</span></a><span class="br0">&#40;</span><span class="st0">&#8216;n&#8217;</span>, <span class="re0">$lenBytes</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">$len</span> = <span class="re0">$len</span><span class="br0">&#91;</span><span class="nu0">1</span><span class="br0">&#93;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">if</span><span class="br0">&#40;</span><span class="re0">$len</span>&lt;<span class="nu0">1</span><span class="br0">&#41;</span> <span class="kw1">continue</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">$msg</span> = <a href="http://www.php.net/fgets"><span class="kw3">fgets</span></a><span class="br0">&#40;</span><span class="re0">$fh</span>, <span class="re0">$len</span><span class="nu0">+1</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="re0">$toks</span>=<a href="http://www.php.net/explode"><span class="kw3">explode</span></a><span class="br0">&#40;</span><span class="st0">&#8216;:&#8217;</span>,<span class="re0">$msg</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">$method</span> = <a href="http://www.php.net/array_shift"><span class="kw3">array_shift</span></a><span class="br0">&#40;</span><span class="re0">$toks</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">switch</span><span class="br0">&#40;</span><span class="re0">$method</span><span class="br0">&#41;</span><span class="br0">&#123;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">case</span> <span class="st0">&#8216;auth&#8217;</span>:</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <a href="http://www.php.net/list"><span class="kw3">list</span></a><span class="br0">&#40;</span><span class="re0">$username</span>, <span class="re0">$server</span>, <span class="re0">$password</span><span class="br0">&#41;</span> = <span class="re0">$toks</span>;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">if</span><span class="br0">&#40;</span>@<span class="re0">$users</span><span class="br0">&#91;</span><span class="re0">$username</span><span class="br0">&#93;</span> == <span class="re0">$password</span><span class="br0">&#41;</span><span class="br0">&#123;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <a href="http://www.php.net/print"><span class="kw3">print</span></a> <a href="http://www.php.net/pack"><span class="kw3">pack</span></a><span class="br0">&#40;</span><span class="st0">&quot;nn&quot;</span>, <span class="nu0">2</span>, <span class="nu0">1</span><span class="br0">&#41;</span>; <span class="co1">// ok</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#125;</span><span class="kw1">else</span><span class="br0">&#123;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <a href="http://www.php.net/print"><span class="kw3">print</span></a> <a href="http://www.php.net/pack"><span class="kw3">pack</span></a><span class="br0">&#40;</span><span class="st0">&quot;nn&quot;</span>, <span class="nu0">2</span>, <span class="nu0">0</span><span class="br0">&#41;</span>; <span class="co1">// fail</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#125;</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">break</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">case</span> <span class="st0">&#8216;isuser&#8217;</span>:</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <a href="http://www.php.net/list"><span class="kw3">list</span></a><span class="br0">&#40;</span><span class="re0">$username</span>, <span class="re0">$server</span><span class="br0">&#41;</span> = <span class="re0">$toks</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">if</span><span class="br0">&#40;</span><a href="http://www.php.net/isset"><span class="kw3">isset</span></a><span class="br0">&#40;</span><span class="re0">$users</span><span class="br0">&#91;</span><span class="re0">$username</span><span class="br0">&#93;</span><span class="br0">&#41;</span><span class="br0">&#41;</span><span class="br0">&#123;</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <a href="http://www.php.net/print"><span class="kw3">print</span></a> <a href="http://www.php.net/pack"><span class="kw3">pack</span></a><span class="br0">&#40;</span><span class="st0">&quot;nn&quot;</span>, <span class="nu0">2</span>, <span class="nu0">1</span><span class="br0">&#41;</span>; <span class="co1">// yes</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#125;</span><span class="kw1">else</span><span class="br0">&#123;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <a href="http://www.php.net/print"><span class="kw3">print</span></a> <a href="http://www.php.net/pack"><span class="kw3">pack</span></a><span class="br0">&#40;</span><span class="st0">&quot;nn&quot;</span>, <span class="nu0">2</span>, <span class="nu0">0</span><span class="br0">&#41;</span>; <span class="co1">// nope</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#125;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">break</span>;</div>
</li>
<li class="li2">
<div class="de2">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="kw2">default</span>:</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <a href="http://www.php.net/print"><span class="kw3">print</span></a> <a href="http://www.php.net/pack"><span class="kw3">pack</span></a><span class="br0">&#40;</span><span class="st0">&quot;nn&quot;</span>, <span class="nu0">2</span>, <span class="nu0">0</span><span class="br0">&#41;</span>;<span class="co1">// fail</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#125;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#125;</span><span class="kw1">while</span><span class="br0">&#40;</span><span class="kw2">true</span><span class="br0">&#41;</span>;</div>
</li>
</ol>
</div>
<p> <br/></p>
<p>I stripped down the ejabberd config to just load what I considered the bare essentials. Here is the modules section I&#8217;m testing with:</p>
<p>From ejabberd.cfg:<br />
<code>{modules,<br />
 [<br />
  {mod_caps,     []},<br />
  {mod_disco,    []},<br />
  {mod_roster,   []},<br />
  {mod_pubsub,   [ % requires mod_caps<br />
                  {access_createnode, pubsub_createnode},<br />
                  {plugins, ["default", "pep"]}<br />
                 ]},<br />
  {mod_mnesiaweb,     []},<br />
  {mod_thriftctl,     []}<br />
 ]}.</code><br/></p>
<p>mod_disco deals with discovery, so clients can find out what the server supports. mod_roster deals with rosters (buddy lists etc) using mnesia. mod_pubsub is enabled because I want to use <a href="http://xmpp.org/extensions/xep-0118.html">User Tune</a>, an extension that lets you broadcast the name of the song you are playing to all everyone in your roster. mod_caps provides <a href="http://xmpp.org/extensions/xep-0115.html">XEP-115</a> &#8211; an extension for broadcasting and dynamically discovering client, device, or generic entity capabilities. mod_caps is a requirement of mod_pubsub.</p>
<p>I&#8217;ve removed the module that allows users to register, although I made a few accounts first whilst testing. The last two modules, mod_mnesiaweb and mod_thriftctl are modules I wrote.</p>
<h2>mod_mnesiaweb</h2>
<p>To help figure out what&#8217;s going on inside of ejabberd, it&#8217;s useful to be able to easily browse the mnesia database. <a href="http://yaws.hyber.org/">Yaws</a> comes with an appmod that does this, called ymnesia. This ejabberd module will start yaws in embedded mode and run this appmod, enabling you to explore the mnesia database from a web browser.</p>
<p><i><b>Yaws observation:</b> yaws didn&#8217;t appear to build ymnesia by default, I edited the Makefile in src and added &#8220;ymnesia&#8221; to the module list. Also, if ./configure fails, the package you are probably missing is libpam0g-dev</i></p>
<p>mod_mnesiaweb:</p>
<div class="dean_ch" style="white-space: wrap;">
<ol>
<li class="li1">
<div class="de1"><span class="co1">% Ejabberd module that runs yaws in embedded mode,</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co1">% and loads the ymnesia appmod for browsing mnesia.</span></div>
</li>
<li class="li1">
<div class="de1">-<span class="kw2">module</span><span class="br0">&#40;</span>mod_mnesiaweb<span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">-<span class="kw2">author</span><span class="br0">&#40;</span><span class="st0">&#8216;rj@last.fm&#8217;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li2">
<div class="de2">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">-include<span class="br0">&#40;</span><span class="st0">&quot;/usr/local/lib/yaws/include/yaws.hrl&quot;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">-<span class="kw2">behaviour</span><span class="br0">&#40;</span>gen_mod<span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">-<span class="kw2">export</span><span class="br0">&#40;</span><span class="br0">&#91;</span>start/<span class="nu0">2</span>, stop/<span class="nu0">1</span><span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li2">
<div class="de2">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">start<span class="br0">&#40;</span>_<span class="re0">Host</span>, <span class="re0">Opts</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">Port</span> = gen_mod:<span class="me2">get_opt</span><span class="br0">&#40;</span>port, <span class="re0">Opts</span>, <span class="nu0">8001</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; code:<span class="me2">add_path</span><span class="br0">&#40;</span><span class="st0">&quot;/usr/local/lib/yaws/ebin&quot;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; application:<span class="me2">set_env</span><span class="br0">&#40;</span>yaws, embedded, true<span class="br0">&#41;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; application:<span class="me2">start</span><span class="br0">&#40;</span>yaws<span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">GC</span> = yaws_config:<span class="me2">make_default_gconf</span><span class="br0">&#40;</span>false,<span class="st0">&quot;yawstest&quot;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">SC</span> = #sconf<span class="br0">&#123;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; port = <span class="re0">Port</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; servername = <span class="st0">&quot;ejabnesia&quot;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; <span class="kw3">listen</span> = <span class="br0">&#123;</span><span class="nu0">0</span>,<span class="nu0">0</span>,<span class="nu0">0</span>,<span class="nu0">0</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; appmods = <span class="br0">&#91;</span><span class="br0">&#123;</span><span class="st0">&quot;showdb&quot;</span>, ymnesia<span class="br0">&#125;</span><span class="br0">&#93;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; docroot = <span class="st0">&quot;wwwroot&quot;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; yaws_api:<span class="me2">setconf</span><span class="br0">&#40;</span><span class="re0">GC</span>, <span class="br0">&#91;</span><span class="br0">&#91;</span><span class="re0">SC</span><span class="br0">&#93;</span><span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; ok.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">stop<span class="br0">&#40;</span>_<span class="re0">Host</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="me1">application</span>:<span class="me2">stop</span><span class="br0">&#40;</span>yaws<span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; ok.</div>
</li>
</ol>
</div>
<p><br/></p>
<p>To compile it:<br />
<code>erlc -pa ${EJAB_SRC} -I ${EJAB_SRC} mod_mnesiaweb.erl</code><br />
where EJAB_SRC is the ejabberd-2.X.X/src directory, after you&#8217;ve compiled from source (so the beams are there too).</p>
<p>Copy the resulting mod_mnesiaweb.beam to /var/lib/ejabberd/ebin so ejabberd finds it, and it should work. Hit up http://localhost:8001/showdb/ in your browser and you can explore the mnesia database.</p>
<p>Use the match syntax to filter tables. For example to find everyone in my roster, I use this in the input box next to roster:<br />
<code>{roster,{"RJ",'_', {'_','_',[]}}, '_','_','_','_','_','_','_','_'}</code><br/><br />
Not pretty, but it gets the job done. You can just view the entire table, copy a record then replace fields with &#8216;_&#8217; to build queries.</p>
<h2>mod_thriftctl</h2>
<p>Next up I wanted to try the Erlang <a href="http://incubator.apache.org/thrift/">Thrift</a> bindings (written by the folks at <a href="http://amiest-devblog.blogspot.com/2008/01/alternative-erlang-bindings-for-thrift.html">Amie St.</a>), and expose some useful functionality for controlling the server.</p>
<p>If you aren&#8217;t familiar with Thrift, I recommend reading about it first. In a nutshell, you write your API using an IDL (a .thrift file) and the thrift compiler creates client libraries, and server code in various different languages. It&#8217;s an RPC mechanism, and useful in a mixed environment.</p>
<p>mod_thriftctl.thrift:<br />
<code>#!/usr/local/bin/thrift -php -erl</p>
<p>struct JabberUser {<br />
    1: string name,<br />
    2: string server<br />
}</p>
<p>service Ejabthrift {<br />
    /* add ruser to roster of luser, and visa-versa. also routes presence to users if online  */<br />
    void add_friend(        1: JabberUser luser,<br />
                            2: JabberUser ruser<br />
                            ),</p>
<p>    /* remove ruser from luser's roster */<br />
    void remove_friend(    1: JabberUser luser, 2: JabberUser ruser ),</p>
<p>    /* make it look like fromuser sent a message to touser */<br />
    void spoof_message( 1: JabberUser fromuser, 2: JabberUser touser, 3: string message, 4: string subject ),<br />
    /* .. or a chat message */<br />
    void spoof_chat(    1: JabberUser fromuser, 2: JabberUser touser, 3: string message, 4: string thread ),</p>
<p>    /* sends PEP usertune message, see http://xmpp.org/extensions/xep-0118.html */<br />
    void publish_np ( 1: JabberUser fromuser, 2: string artist, 3: string album, 4: string track, 5: i32 tracklength, 6: i32 tracknum )<br />
}</code><br/></p>
<p>Run that .thrift file, and you get gen-php and gen-erl directories, with php client code, and erlang files needed to build a server. </p>
<p>Here&#8217;s the ejabberd module, which starts a thrift server:</p>
<p>mod_thriftctl:</p>
<div class="dean_ch" style="white-space: wrap;">
<ol>
<li class="li1">
<div class="de1"><span class="co1">%</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co1">% A module to control ejabberd with a thrift interface.</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co1">%</span></div>
</li>
<li class="li1">
<div class="de1">-<span class="kw2">module</span><span class="br0">&#40;</span>mod_thriftctl<span class="br0">&#41;</span>.</div>
</li>
<li class="li2">
<div class="de2">-<span class="kw2">author</span><span class="br0">&#40;</span><span class="st0">&#8216;rj@last.fm&#8217;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="co1">% ejabberd headers:</span></div>
</li>
<li class="li1">
<div class="de1">-include<span class="br0">&#40;</span><span class="st0">&quot;ejabberd.hrl&quot;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">-include<span class="br0">&#40;</span><span class="st0">&quot;mod_roster.hrl&quot;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li2">
<div class="de2">-include<span class="br0">&#40;</span><span class="st0">&quot;jlib.hrl&quot;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="co1">% thrift server headers:</span></div>
</li>
<li class="li1">
<div class="de1">-include<span class="br0">&#40;</span><span class="st0">&quot;thrift.hrl&quot;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">-include<span class="br0">&#40;</span><span class="st0">&quot;transport/tSocket.hrl&quot;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li2">
<div class="de2">-include<span class="br0">&#40;</span><span class="st0">&quot;protocol/tBinaryProtocol.hrl&quot;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">-include<span class="br0">&#40;</span><span class="st0">&quot;server/tErlServer.hrl&quot;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">-include<span class="br0">&#40;</span><span class="st0">&quot;transport/tErlAcceptor.hrl&quot;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="co1">% we are an ejabberd module:</span></div>
</li>
<li class="li2">
<div class="de2">-<span class="kw2">behaviour</span><span class="br0">&#40;</span>gen_mod<span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">-<span class="kw2">export</span><span class="br0">&#40;</span><span class="br0">&#91;</span>start/<span class="nu0">2</span>, stop/<span class="nu0">1</span><span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="co1">% our thrift service:</span></div>
</li>
<li class="li1">
<div class="de1">-include<span class="br0">&#40;</span><span class="st0">&quot;ejabthrift_thrift.hrl&quot;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li2">
<div class="de2">-include<span class="br0">&#40;</span><span class="st0">&quot;mod_thriftctl_types.hrl&quot;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">-<span class="kw2">export</span><span class="br0">&#40;</span><span class="br0">&#91;</span> &nbsp; add_friend/<span class="nu0">2</span>, remove_friend/<span class="nu0">2</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; spoof_message/<span class="nu0">4</span>, spoof_chat/<span class="nu0">4</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; publish_np/<span class="nu0">6</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li2">
<div class="de2">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="co1">% convert thrift Jabberuser into ejabberd jid</span></div>
</li>
<li class="li1">
<div class="de1">ju2jid<span class="br0">&#40;</span><span class="re0">Jabberuser</span><span class="br0">&#41;</span> when is_record<span class="br0">&#40;</span><span class="re0">Jabberuser</span>, jabber<span class="re0">User</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; #jid<span class="br0">&#123;</span> user=<span class="re0">Jabberuser</span>#jabber<span class="re0">User</span>.name, server=<span class="re0">Jabberuser</span>#jabber<span class="re0">User</span>.server, resource=<span class="st0">&quot;&quot;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; luser=<span class="re0">Jabberuser</span>#jabber<span class="re0">User</span>.name, lserver=<span class="re0">Jabberuser</span>#jabber<span class="re0">User</span>.server, lresource=<span class="st0">&quot;&quot;</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#125;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">spoof_message<span class="br0">&#40;</span> <span class="re0">FromU</span>, <span class="re0">ToU</span>, <span class="re0">Msg</span>, <span class="re0">Subject</span> <span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">F</span> = ju2jid<span class="br0">&#40;</span><span class="re0">FromU</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">T</span> = ju2jid<span class="br0">&#40;</span><span class="re0">ToU</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="re0">XmlBody</span> = <span class="br0">&#123;</span>xmlelement, <span class="st0">&quot;message&quot;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#91;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#123;</span><span class="st0">&quot;from&quot;</span>, jlib:<span class="me2">jid_to_string</span><span class="br0">&#40;</span><span class="re0">F</span><span class="br0">&#41;</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#123;</span><span class="st0">&quot;to&quot;</span>, jlib:<span class="me2">jid_to_string</span><span class="br0">&#40;</span><span class="re0">T</span><span class="br0">&#41;</span><span class="br0">&#125;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#93;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#91;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span>xmlelement, <span class="st0">&quot;subject&quot;</span>, <span class="br0">&#91;</span><span class="br0">&#93;</span>, <span class="br0">&#91;</span><span class="br0">&#123;</span>xmlcdata, <span class="re0">Subject</span><span class="br0">&#125;</span><span class="br0">&#93;</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span>xmlelement, <span class="st0">&quot;body&quot;</span>, <span class="br0">&#91;</span><span class="br0">&#93;</span>, <span class="br0">&#91;</span><span class="br0">&#123;</span>xmlcdata, <span class="re0">Msg</span><span class="br0">&#125;</span><span class="br0">&#93;</span><span class="br0">&#125;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#93;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#125;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; ejabberd_router:<span class="me2">route</span><span class="br0">&#40;</span><span class="re0">F</span>, <span class="re0">T</span>, <span class="re0">XmlBody</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">spoof_chat<span class="br0">&#40;</span> <span class="re0">FromU</span>, <span class="re0">ToU</span>, <span class="re0">Msg</span>, <span class="re0">Thread</span> <span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">F</span> = ju2jid<span class="br0">&#40;</span><span class="re0">FromU</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">T</span> = ju2jid<span class="br0">&#40;</span><span class="re0">ToU</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="re0">XmlBody</span> = <span class="br0">&#123;</span>xmlelement, <span class="st0">&quot;message&quot;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#91;</span><span class="br0">&#123;</span><span class="st0">&quot;type&quot;</span>, <span class="st0">&quot;chat&quot;</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#123;</span><span class="st0">&quot;from&quot;</span>, jlib:<span class="me2">jid_to_string</span><span class="br0">&#40;</span><span class="re0">F</span><span class="br0">&#41;</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#123;</span><span class="st0">&quot;to&quot;</span>, jlib:<span class="me2">jid_to_string</span><span class="br0">&#40;</span><span class="re0">T</span><span class="br0">&#41;</span><span class="br0">&#125;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#93;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#91;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span>xmlelement, <span class="st0">&quot;thread&quot;</span>, <span class="br0">&#91;</span><span class="br0">&#93;</span>, <span class="br0">&#91;</span><span class="br0">&#123;</span>xmlcdata, <span class="re0">Thread</span><span class="br0">&#125;</span><span class="br0">&#93;</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span>xmlelement, <span class="st0">&quot;body&quot;</span>, <span class="br0">&#91;</span><span class="br0">&#93;</span>, <span class="br0">&#91;</span><span class="br0">&#123;</span>xmlcdata, <span class="re0">Msg</span><span class="br0">&#125;</span><span class="br0">&#93;</span><span class="br0">&#125;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#93;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#125;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; ejabberd_router:<span class="me2">route</span><span class="br0">&#40;</span><span class="re0">F</span>, <span class="re0">T</span>, <span class="re0">XmlBody</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">publish_np<span class="br0">&#40;</span> <span class="re0">FromU</span>, <span class="re0">ArtistS</span>, <span class="re0">AlbumS</span>, <span class="re0">TrackS</span>, <span class="re0">LengthI</span>, <span class="re0">TrackNumI</span> <span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">From</span> = ju2jid<span class="br0">&#40;</span><span class="re0">FromU</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="co1">% The usertune message must contain binaries, not strings or ints</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="re0">FromStr</span> &nbsp; &nbsp; = jlib:<span class="me2">jid_to_string</span><span class="br0">&#40;</span><span class="re0">From</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">Artist</span> &nbsp; &nbsp; &nbsp;= list_to_binary<span class="br0">&#40;</span><span class="re0">ArtistS</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">Album</span> &nbsp; &nbsp; &nbsp; = list_to_binary<span class="br0">&#40;</span><span class="re0">AlbumS</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">Track</span> &nbsp; &nbsp; &nbsp; = list_to_binary<span class="br0">&#40;</span><span class="re0">TrackS</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">Length</span> &nbsp; &nbsp; &nbsp;= list_to_binary<span class="br0">&#40;</span>io_lib:<span class="kw3">format</span><span class="br0">&#40;</span><span class="st0">&quot;~w&quot;</span>,<span class="br0">&#91;</span><span class="re0">LengthI</span><span class="br0">&#93;</span><span class="br0">&#41;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="re0">TrackNum</span> &nbsp; &nbsp;= list_to_binary<span class="br0">&#40;</span>io_lib:<span class="kw3">format</span><span class="br0">&#40;</span><span class="st0">&quot;~w&quot;</span>,<span class="br0">&#91;</span><span class="re0">TrackNumI</span><span class="br0">&#93;</span><span class="br0">&#41;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">Xml</span> = <span class="br0">&#123;</span>xmlelement,<span class="st0">&quot;iq&quot;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span><span class="br0">&#123;</span><span class="st0">&quot;from&quot;</span>, <span class="re0">FromStr</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span><span class="st0">&quot;type&quot;</span>,<span class="st0">&quot;set&quot;</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span><span class="st0">&quot;id&quot;</span>,<span class="st0">&quot;pub1&quot;</span><span class="br0">&#125;</span><span class="br0">&#93;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span><span class="br0">&#123;</span>xmlcdata,&lt;&lt;<span class="st0">&quot;<span class="es0">\n</span> &nbsp;&quot;</span>&gt;&gt;<span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span>xmlelement,<span class="st0">&quot;pubsub&quot;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span><span class="br0">&#123;</span><span class="st0">&quot;xmlns&quot;</span>,<span class="st0">&quot;http://jabber.org/protocol/pubsub&quot;</span><span class="br0">&#125;</span><span class="br0">&#93;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span><span class="br0">&#123;</span>xmlcdata,&lt;&lt;<span class="st0">&quot;<span class="es0">\n</span> &nbsp; &nbsp;&quot;</span>&gt;&gt;<span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span>xmlelement,<span class="st0">&quot;publish&quot;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span><span class="br0">&#123;</span><span class="st0">&quot;node&quot;</span>,<span class="st0">&quot;http://jabber.org/protocol/tune&quot;</span><span class="br0">&#125;</span><span class="br0">&#93;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span><span class="br0">&#123;</span>xmlcdata,&lt;&lt;<span class="st0">&quot;<span class="es0">\n</span> &nbsp; &nbsp; &nbsp;&quot;</span>&gt;&gt;<span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span>xmlelement,<span class="st0">&quot;item&quot;</span>,<span class="br0">&#91;</span><span class="br0">&#93;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span><span class="br0">&#123;</span>xmlcdata,&lt;&lt;<span class="st0">&quot;<span class="es0">\n</span> &nbsp; &nbsp; &nbsp; &nbsp;&quot;</span>&gt;&gt;<span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span>xmlelement,<span class="st0">&quot;tune&quot;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span><span class="br0">&#123;</span><span class="st0">&quot;xmlns&quot;</span>,<span class="st0">&quot;http://jabber.org/protocol/tune&quot;</span><span class="br0">&#125;</span><span class="br0">&#93;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span><span class="br0">&#123;</span>xmlcdata,&lt;&lt;<span class="st0">&quot;<span class="es0">\n</span> &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&quot;</span>&gt;&gt;<span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span>xmlelement,<span class="st0">&quot;artist&quot;</span>,<span class="br0">&#91;</span><span class="br0">&#93;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span><span class="br0">&#123;</span>xmlcdata, <span class="re0">Artist</span><span class="br0">&#125;</span><span class="br0">&#93;</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span>xmlcdata,&lt;&lt;<span class="st0">&quot;<span class="es0">\n</span> &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&quot;</span>&gt;&gt;<span class="br0">&#125;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span>xmlelement,<span class="st0">&quot;length&quot;</span>,<span class="br0">&#91;</span><span class="br0">&#93;</span>,<span class="br0">&#91;</span><span class="br0">&#123;</span>xmlcdata, <span class="re0">Length</span><span class="br0">&#125;</span><span class="br0">&#93;</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span>xmlcdata,&lt;&lt;<span class="st0">&quot;<span class="es0">\n</span> &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&quot;</span>&gt;&gt;<span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span>xmlelement,<span class="st0">&quot;source&quot;</span>,<span class="br0">&#91;</span><span class="br0">&#93;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span><span class="br0">&#123;</span>xmlcdata, <span class="re0">Album</span><span class="br0">&#125;</span><span class="br0">&#93;</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span>xmlcdata,&lt;&lt;<span class="st0">&quot;<span class="es0">\n</span> &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&quot;</span>&gt;&gt;<span class="br0">&#125;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span>xmlelement,<span class="st0">&quot;title&quot;</span>,<span class="br0">&#91;</span><span class="br0">&#93;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span><span class="br0">&#123;</span>xmlcdata, <span class="re0">Track</span><span class="br0">&#125;</span><span class="br0">&#93;</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span>xmlcdata,&lt;&lt;<span class="st0">&quot;<span class="es0">\n</span> &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;&quot;</span>&gt;&gt;<span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span>xmlelement,<span class="st0">&quot;track&quot;</span>,<span class="br0">&#91;</span><span class="br0">&#93;</span>,<span class="br0">&#91;</span><span class="br0">&#123;</span>xmlcdata, <span class="re0">TrackNum</span><span class="br0">&#125;</span><span class="br0">&#93;</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span>xmlcdata,&lt;&lt;<span class="st0">&quot;<span class="es0">\n</span> &nbsp; &nbsp; &nbsp; &nbsp;&quot;</span>&gt;&gt;<span class="br0">&#125;</span><span class="br0">&#93;</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span>xmlcdata,&lt;&lt;<span class="st0">&quot;<span class="es0">\n</span> &nbsp; &nbsp; &nbsp;&quot;</span>&gt;&gt;<span class="br0">&#125;</span><span class="br0">&#93;</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span>xmlcdata,&lt;&lt;<span class="st0">&quot;<span class="es0">\n</span> &nbsp; &nbsp;&quot;</span>&gt;&gt;<span class="br0">&#125;</span><span class="br0">&#93;</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span>xmlcdata,&lt;&lt;<span class="st0">&quot;<span class="es0">\n</span> &nbsp;&quot;</span>&gt;&gt;<span class="br0">&#125;</span><span class="br0">&#93;</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span>xmlcdata,&lt;&lt;<span class="st0">&quot;<span class="es0">\n</span>&quot;</span>&gt;&gt;<span class="br0">&#125;</span><span class="br0">&#93;</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="co1">% PEP means you act as a pubsub node yourself,</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="co1">% so it&#8217;s addressed to yourself and is broadcast to your friends automatically:</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; ejabberd_router:<span class="me2">route</span><span class="br0">&#40;</span><span class="re0">From</span>, <span class="re0">From</span>, <span class="re0">Xml</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; ok.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="co1">% adds bi-directional friend relationship immediately for both users.</span></div>
</li>
<li class="li2">
<div class="de2">add_friend<span class="br0">&#40;</span> &nbsp; &nbsp; #jabber<span class="re0">User</span><span class="br0">&#123;</span>name=<span class="re0">LU</span>, server=<span class="re0">LS</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; #jabber<span class="re0">User</span><span class="br0">&#123;</span>name=<span class="re0">RU</span>, server=<span class="re0">RS</span><span class="br0">&#125;</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">AskMessage</span> = <span class="st0">&quot;&quot;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">Group</span> = <span class="st0">&quot;&quot;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">Subtype</span> = both,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; subscribe<span class="br0">&#40;</span><span class="re0">LU</span>, <span class="re0">LS</span>, <span class="re0">RU</span>, <span class="re0">RS</span>, <span class="re0">RU</span>, <span class="re0">Group</span>, <span class="re0">Subtype</span>, <span class="re0">AskMessage</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; subscribe<span class="br0">&#40;</span><span class="re0">RU</span>, <span class="re0">RS</span>, <span class="re0">LU</span>, <span class="re0">LS</span>, <span class="re0">LU</span>, <span class="re0">Group</span>, <span class="re0">Subtype</span>, <span class="re0">AskMessage</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; route_rosteritem<span class="br0">&#40;</span><span class="re0">LU</span>, <span class="re0">LS</span>, <span class="re0">RU</span>, <span class="re0">RS</span>, <span class="re0">RU</span>, <span class="re0">Group</span>, <span class="re0">Subtype</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; route_rosteritem<span class="br0">&#40;</span><span class="re0">RU</span>, <span class="re0">RS</span>, <span class="re0">LU</span>, <span class="re0">LS</span>, <span class="re0">LU</span>, <span class="re0">Group</span>, <span class="re0">Subtype</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; ok.</div>
</li>
<li class="li2">
<div class="de2">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">remove_friend<span class="br0">&#40;</span> #jabber<span class="re0">User</span><span class="br0">&#123;</span>name=<span class="re0">LU</span>, server=<span class="re0">LS</span><span class="br0">&#125;</span>, #jabber<span class="re0">User</span><span class="br0">&#123;</span>name=<span class="re0">RU</span>, server=<span class="re0">RS</span><span class="br0">&#125;</span> <span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="me1">unsubscribe</span><span class="br0">&#40;</span><span class="re0">LU</span>, <span class="re0">LS</span>, <span class="re0">RU</span>, <span class="re0">RS</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; unsubscribe<span class="br0">&#40;</span><span class="re0">RU</span>, <span class="re0">RS</span>, <span class="re0">LU</span>, <span class="re0">LS</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; route_rosteritem<span class="br0">&#40;</span><span class="re0">LU</span>, <span class="re0">LS</span>, <span class="re0">RU</span>, <span class="re0">RS</span>, <span class="st0">&quot;&quot;</span>, <span class="st0">&quot;&quot;</span>, <span class="st0">&quot;remove&quot;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; route_rosteritem<span class="br0">&#40;</span><span class="re0">RU</span>, <span class="re0">RS</span>, <span class="re0">LU</span>, <span class="re0">LS</span>, <span class="st0">&quot;&quot;</span>, <span class="st0">&quot;&quot;</span>, <span class="st0">&quot;remove&quot;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; ok.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">unsubscribe<span class="br0">&#40;</span><span class="re0">LocalUser</span>, <span class="re0">LocalServer</span>, <span class="re0">RemoteUser</span>, <span class="re0">RemoteServer</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">Key</span> = <span class="br0">&#123;</span><span class="br0">&#123;</span><span class="re0">LocalUser</span>,<span class="re0">LocalServer</span>,<span class="br0">&#123;</span><span class="re0">RemoteUser</span>,<span class="re0">RemoteServer</span>,<span class="br0">&#91;</span><span class="br0">&#93;</span><span class="br0">&#125;</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span><span class="re0">LocalUser</span>,<span class="re0">LocalServer</span><span class="br0">&#125;</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; mnesia:<span class="me2">transaction</span><span class="br0">&#40;</span>fun<span class="br0">&#40;</span><span class="br0">&#41;</span> -&gt; <span class="me1">mnesia</span>:<span class="me2">delete</span><span class="br0">&#40;</span>roster, <span class="re0">Key</span>, write<span class="br0">&#41;</span> <span class="kw1">end</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">route_rosteritem<span class="br0">&#40;</span><span class="re0">LocalUser</span>, <span class="re0">LocalServer</span>, <span class="re0">RemoteUser</span>, <span class="re0">RemoteServer</span>, <span class="re0">Nick</span>, <span class="re0">Group</span>, <span class="re0">Subscription</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">LJID</span> = jlib:<span class="me2">make_jid</span><span class="br0">&#40;</span><span class="re0">LocalUser</span>, <span class="re0">LocalServer</span>, <span class="st0">&quot;&quot;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="re0">RJID</span> = jlib:<span class="me2">make_jid</span><span class="br0">&#40;</span><span class="re0">RemoteUser</span>, <span class="re0">RemoteServer</span>, <span class="st0">&quot;&quot;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">ToS</span> = jlib:<span class="me2">jid_to_string</span><span class="br0">&#40;</span><span class="re0">LJID</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">ItemJIDS</span> = jlib:<span class="me2">jid_to_string</span><span class="br0">&#40;</span><span class="re0">RJID</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">GroupXML</span> = <span class="br0">&#123;</span>xmlelement, <span class="st0">&quot;group&quot;</span>, <span class="br0">&#91;</span><span class="br0">&#93;</span>, <span class="br0">&#91;</span><span class="br0">&#123;</span>xmlcdata, <span class="re0">Group</span><span class="br0">&#125;</span><span class="br0">&#93;</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">Item</span> = <span class="br0">&#123;</span>xmlelement, <span class="st0">&quot;item&quot;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span><span class="br0">&#123;</span><span class="st0">&quot;jid&quot;</span>, <span class="re0">ItemJIDS</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span><span class="st0">&quot;name&quot;</span>, <span class="re0">Nick</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span><span class="st0">&quot;subscription&quot;</span>, <span class="re0">Subscription</span><span class="br0">&#125;</span><span class="br0">&#93;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span><span class="re0">GroupXML</span><span class="br0">&#93;</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">Query</span> = <span class="br0">&#123;</span>xmlelement, <span class="st0">&quot;query&quot;</span>, <span class="br0">&#91;</span><span class="br0">&#123;</span><span class="st0">&quot;xmlns&quot;</span>, ?<span class="re0">NS_ROSTER</span><span class="br0">&#125;</span><span class="br0">&#93;</span>, <span class="br0">&#91;</span><span class="re0">Item</span><span class="br0">&#93;</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="re0">Packet</span> = <span class="br0">&#123;</span>xmlelement, <span class="st0">&quot;iq&quot;</span>, <span class="br0">&#91;</span><span class="br0">&#123;</span><span class="st0">&quot;type&quot;</span>, <span class="st0">&quot;set&quot;</span><span class="br0">&#125;</span>, <span class="br0">&#123;</span><span class="st0">&quot;to&quot;</span>, <span class="re0">ToS</span><span class="br0">&#125;</span><span class="br0">&#93;</span>, <span class="br0">&#91;</span><span class="re0">Query</span><span class="br0">&#93;</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; ejabberd_router:<span class="me2">route</span><span class="br0">&#40;</span><span class="re0">LJID</span>, <span class="re0">LJID</span>, <span class="re0">Packet</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">subscribe<span class="br0">&#40;</span><span class="re0">LocalUser</span>, <span class="re0">LocalServer</span>, <span class="re0">RemoteUser</span>, <span class="re0">RemoteServer</span>, <span class="re0">Nick</span>, <span class="re0">Group</span>, <span class="re0">Subscription</span>, <span class="re0">Xattrs</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="re0">R</span> = #roster<span class="br0">&#123;</span>usj = <span class="br0">&#123;</span><span class="re0">LocalUser</span>,<span class="re0">LocalServer</span>,<span class="br0">&#123;</span><span class="re0">RemoteUser</span>,<span class="re0">RemoteServer</span>,<span class="br0">&#91;</span><span class="br0">&#93;</span><span class="br0">&#125;</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; us = <span class="br0">&#123;</span><span class="re0">LocalUser</span>,<span class="re0">LocalServer</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; jid = <span class="br0">&#123;</span><span class="re0">RemoteUser</span>,<span class="re0">RemoteServer</span>,<span class="br0">&#91;</span><span class="br0">&#93;</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; name = <span class="re0">Nick</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; subscription = <span class="re0">Subscription</span>, <span class="co1">% none, to=you see him, from=he sees you, both</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ask = none, <span class="co1">% out=send request, in=somebody requests you, none</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; groups = <span class="br0">&#91;</span><span class="re0">Group</span><span class="br0">&#93;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; askmessage = <span class="re0">Xattrs</span>, <span class="co1">% example: [{&quot;category&quot;,&quot;conference&quot;}]</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; xs = <span class="br0">&#91;</span><span class="br0">&#93;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#125;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; mnesia:<span class="me2">transaction</span><span class="br0">&#40;</span>fun<span class="br0">&#40;</span><span class="br0">&#41;</span> -&gt; <span class="me1">mnesia</span>:<span class="me2">write</span><span class="br0">&#40;</span><span class="re0">R</span><span class="br0">&#41;</span> <span class="kw1">end</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">start<span class="br0">&#40;</span><span class="re0">Host</span>, <span class="re0">Opts</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; ?<span class="re0">INFO</span><span class="br0">&#40;</span><span class="st0">&quot;mod_ejabthrift start().&quot;</span>,<span class="br0">&#91;</span><span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="co1">%% get options</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="re0">Port</span> = gen_mod:<span class="me2">get_opt</span><span class="br0">&#40;</span>port, <span class="re0">Opts</span>, <span class="nu0">9000</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; spawn<span class="br0">&#40;</span>fun<span class="br0">&#40;</span><span class="br0">&#41;</span>-&gt; <span class="me1">thrift</span>:<span class="me2">start</span><span class="br0">&#40;</span><span class="br0">&#41;</span> <span class="kw1">end</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; ?<span class="re0">INFO</span><span class="br0">&#40;</span><span class="st0">&quot;mod_ejabthrift thrift:start().&quot;</span>,<span class="br0">&#91;</span><span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="re0">Handler</span> &nbsp; = ?<span class="re0">MODULE</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">Processor</span> = ejabthrift_thrift,</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">TF</span> = t<span class="re0">BufferedTransportFactory</span>:<span class="me2">new</span><span class="br0">&#40;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">PF</span> = t<span class="re0">BinaryProtocolFactory</span>:<span class="me2">new</span><span class="br0">&#40;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">ServerTransport</span> = t<span class="re0">ErlAcceptor</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">ServerFlavor</span> &nbsp; &nbsp;= t<span class="re0">ErlServer</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">Server</span> = oop:<span class="me2">start_new</span><span class="br0">&#40;</span><span class="re0">ServerFlavor</span>, <span class="br0">&#91;</span><span class="re0">Port</span>, <span class="re0">Handler</span>, <span class="re0">Processor</span>, <span class="re0">ServerTransport</span>, <span class="re0">TF</span>, <span class="re0">PF</span><span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">case</span> ?<span class="re0">R0</span><span class="br0">&#40;</span><span class="re0">Server</span>, effectful_serve<span class="br0">&#41;</span> <span class="kw1">of</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; ok &nbsp; &nbsp;-&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; ?<span class="re0">INFO</span><span class="br0">&#40;</span><span class="st0">&quot;mod_ejabthrift: Thrift server (~s) listening on port ~w&quot;</span>,<span class="br0">&#91;</span><span class="re0">Host</span>, <span class="re0">Port</span><span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="co1">% put Server into process dictionary (needed for clean stop)</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; put<span class="br0">&#40;</span>thrift_server_reference, <span class="re0">Server</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; ok;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">Error</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; ?<span class="re0">ERROR_MSG</span><span class="br0">&#40;</span><span class="st0">&quot;mod_ejabthrift: Error starting thrift server: ~w&quot;</span>, <span class="br0">&#91;</span><span class="re0">Error</span><span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Error</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="kw1">end</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">stop<span class="br0">&#40;</span>_<span class="re0">Host</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; ?<span class="re0">C0</span><span class="br0">&#40;</span>get<span class="br0">&#40;</span>thrift_server_reference<span class="br0">&#41;</span>, stop<span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; ok.</div>
</li>
</ol>
</div>
<p><br/></p>
<p>To build, first build the gen-erl code:</p>
<p><code>erlc -pa ${EJAB_SRC} -I ${EJAB_SRC} -I ${ERL_THRIFT}/include -I ./gen-erl -o ./gen-erl ./gen-erl/*.erl</code></p>
<p>Where ERL_THRIFT is the lib/erl directory from the amiethrift code, git://repo.or.cz/amiethrift.git</p>
<p>Then compile the module:</p>
<p><code>erlc -pa ${EJAB_SRC} -I ${EJAB_SRC} -I ${ERL_THRIFT}/include -I ./gen-erl *.erl</code></p>
<p>To install, copy all the beam files to the ejabberd ebin dir:</p>
<p><code>sudo cp *.beam gen-erl/*.beam /var/lib/ejabberd/ebin/</code></p>
<p>This is inspired by mod_xmlrpc, which is in ejabberd-modules. As you can see from the start function, that&#8217;s what it takes to start a thrift server. It&#8217;s now trivial to call into ejabberd from other languages. For example, if you started listening to a song using a flash player on the website, a php webservice could make a user tune announcement on your behalf, or spoof messages from you boasting how much you love listening to Paris Hilton. </p>
<p>If anyone knows where I can read about the ejabberd architecture / design, so I don&#8217;t have to piece it all together myself, please let me know.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.metabrew.com/article/getting-to-know-ejabberd-and-writing-modules//feed</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>ssh hack: connect directly to machine via a firewall box</title>
		<link>http://www.metabrew.com/article/ssh-hack-connect-directly-to-machine-via-a-firewall-box/</link>
		<comments>http://www.metabrew.com/article/ssh-hack-connect-directly-to-machine-via-a-firewall-box/#comments</comments>
		<pubDate>Mon, 17 Nov 2008 17:44:44 +0000</pubDate>
		<dc:creator>RJ</dc:creator>
				<category><![CDATA[hacks]]></category>
		<category><![CDATA[hack]]></category>
		<category><![CDATA[ssh]]></category>

		<guid isPermaLink="false">http://www.metabrew.com/?p=196</guid>
		<description><![CDATA[UPDATED 23/03/2009: added &#8220;-q0&#8243; option to clean up netcat after session terminates, and left another useful ssh tip in the comments. It&#8217;s common to have to ssh to firewall / gateway machine, then ssh to the machine you want to work on within a server network. Typically you&#8217;d do this from your local machine: $ [...]]]></description>
			<content:encoded><![CDATA[<p><strong>UPDATED 23/03/2009:</strong> added &#8220;-q0&#8243; option to clean up netcat after session terminates, and left another useful ssh tip in the comments.</p>
<p>It&#8217;s common to have to ssh to firewall / gateway machine, then ssh to the machine you want to work on within a server network.<br />
Typically you&#8217;d do this from your local machine:<br />
<code>$ ssh firewall.example.com<br />
Password:<br />
$ ssh my-private-host</code></p>
<p>I finally got bored of doing this, and created the following file, <strong><code>/usr/bin/sssh</code></strong></p>
<pre>#!/bin/bash
ssh -oproxycommand="ssh -q firewall.example.com nc -q0 %h %p" $*</pre>
<p>Now I can use the <code>sssh</code> command to connect to hosts using the firewall machine as a proxy. Like most good hacks, this uses netcat.</p>
<p>Eg:<br />
<code>$ sssh 10.1.2.3</code><br />
Will connect me directly to a machine on the server network, via the firewall box. Seeing as it passes all parameters to ssh (the <code>$*</code> bit) you can do port forwards and X-forwarding as usual too:</p>
<pre>$ sssh -L 5432:localhost:5432 my-vm</pre>
<p>This lets me tunnel the port for a PostgreSQL running on my development vm (<code>my-vm</code>) in a single command. I have all my keys installed, so no passwords needed &#8211; I estimate this will save me about 60 seconds every day.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.metabrew.com/article/ssh-hack-connect-directly-to-machine-via-a-firewall-box//feed</wfw:commentRss>
		<slash:comments>9</slash:comments>
		</item>
		<item>
		<title>A Million-user Comet Application with Mochiweb, Part 3</title>
		<link>http://www.metabrew.com/article/a-million-user-comet-application-with-mochiweb-part-3/</link>
		<comments>http://www.metabrew.com/article/a-million-user-comet-application-with-mochiweb-part-3/#comments</comments>
		<pubDate>Tue, 04 Nov 2008 16:49:05 +0000</pubDate>
		<dc:creator>RJ</dc:creator>
				<category><![CDATA[programming]]></category>
		<category><![CDATA[c]]></category>
		<category><![CDATA[cnode]]></category>
		<category><![CDATA[comet]]></category>
		<category><![CDATA[erlang]]></category>
		<category><![CDATA[libevent]]></category>
		<category><![CDATA[mochiweb]]></category>

		<guid isPermaLink="false">http://www.metabrew.com/?p=152</guid>
		<description><![CDATA[Part 1 and Part 2 in this series showed how to build a comet application using mochiweb, and how to route messages to connected users. We managed to squeeze application memory down to 8KB per connection. We did ye olde c10k test, and observed what happened with 10,000 connected users. We made graphs. It was [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.metabrew.com/article/a-million-user-comet-application-with-mochiweb-part-1/">Part 1</a> and <a href="http://www.metabrew.com/article/a-million-user-comet-application-with-mochiweb-part-2/">Part 2</a> in this series showed how to build a comet application using mochiweb, and how to route messages to connected users. We managed to squeeze application memory down to 8KB per connection. We did ye olde c10k test, and observed what happened with 10,000 connected users. We made graphs. It was fun, but now it&#8217;s time to make good on the claims made in the title, and turn it up to 1 million connections.</p>
<p>This post covers the following:</p>
<ul>
<li>Add a pubsub-like subscription database using Mnesia</li>
<li>Generate a realistic friends dataset for a million users</li>
<li>Tune mnesia and bulk load in our friends data</li>
<li>Opening a million connections from one machine</li>
<li>Benchmark with 1 Million connected users</li>
<li>Libevent + C for connection handling</li>
<li>Final thoughts</li>
</ul>
<p>One of the challenging parts of this test was actually being able to open 1M connections from a single test machine. Writing a server to accept 1M connections is easier than actually creating 1M connections to test it with, so a fair amount of this article is about the techniques used to open 1M connections from a single machine.</p>
<h2>Getting our pubsub on</h2>
<p>In <a href="http://www.metabrew.com/article/a-million-user-comet-application-with-mochiweb-part-2/">Part 2</a> we used the router to send messages to specific users. This is fine for a chat/IM system, but that there are sexier things we could do instead. Before we launch into a large-scale test, let&#8217;s add one more module &#8211; a subscription database. We want the application store who your friends are, so it can push you all events generated by people on your friends list.</p>
<p>My intention is to use this for Last.fm so I can get a realtime feed of songs <a href="http://www.last.fm/user/RJ/friends">my friends</a> are currently listening to.  It could equally apply to other events generated on social networks. Flickr photo uploads, Facebook newsfeed items, Twitter messages etc. FriendFeed even have a realtime API in beta, so this kind of thing is definitely topical. (Although I&#8217;ve not heard of anyone except Facebook using Erlang for this kind of thing).</p>
<h2>Implementing the subscription-manager</h2>
<p>We&#8217;re implementing a general subscription manager, but we&#8217;ll be subscribing people to everyone on their friends list automatically &#8211; so you could also think of this as a friends database for now.</p>
<p>The subsmanager API:</p>
<ul>
<li>add_subscriptions([{Subscriber, Subscribee},...])</li>
<li>remove_subscriptions([{Subscriber, Subscribee},...])</li>
<li>get_subscribers(User)</li>
</ul>
<p>subsmanager.erl</p>
<div class="dean_ch" style="white-space: wrap;">
<ol>
<li class="li1">
<div class="de1">-<span class="kw2">module</span><span class="br0">&#40;</span>subsmanager<span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">-<span class="kw2">behaviour</span><span class="br0">&#40;</span>gen_server<span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">-include<span class="br0">&#40;</span><span class="st0">&quot;/usr/local/lib/erlang/lib/stdlib-1.15.4/include/qlc.hrl&quot;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">-<span class="kw2">export</span><span class="br0">&#40;</span><span class="br0">&#91;</span>init/<span class="nu0">1</span>, handle_call/<span class="nu0">3</span>, handle_cast/<span class="nu0">2</span>, handle_info/<span class="nu0">2</span>, terminate/<span class="nu0">2</span>, code_change/<span class="nu0">3</span><span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li2">
<div class="de2">-<span class="kw2">export</span><span class="br0">&#40;</span><span class="br0">&#91;</span>add_subscriptions/<span class="nu0">1</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp;remove_subscriptions/<span class="nu0">1</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp;get_subscribers/<span class="nu0">1</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp;first_run/<span class="nu0">0</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp;stop/<span class="nu0">0</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="kw3">start_link</span>/<span class="nu0">0</span><span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">-record<span class="br0">&#40;</span>subscription, <span class="br0">&#123;</span>subscriber, subscribee<span class="br0">&#125;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">-record<span class="br0">&#40;</span>state, <span class="br0">&#123;</span><span class="br0">&#125;</span><span class="br0">&#41;</span>. <span class="co1">% state is all in mnesia</span></div>
</li>
<li class="li1">
<div class="de1">-define<span class="br0">&#40;</span><span class="re0">SERVER</span>, global:<span class="me2">whereis_name</span><span class="br0">&#40;</span>?<span class="re0">MODULE</span><span class="br0">&#41;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2"><span class="kw3">start_link</span><span class="br0">&#40;</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="me1">gen_server</span>:<span class="kw3">start_link</span><span class="br0">&#40;</span><span class="br0">&#123;</span>global, ?<span class="re0">MODULE</span><span class="br0">&#125;</span>, ?<span class="re0">MODULE</span>, <span class="br0">&#91;</span><span class="br0">&#93;</span>, <span class="br0">&#91;</span><span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">stop<span class="br0">&#40;</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="me1">gen_server</span>:<span class="kw3">call</span><span class="br0">&#40;</span>?<span class="re0">SERVER</span>, <span class="br0">&#123;</span>stop<span class="br0">&#125;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li2">
<div class="de2">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">add_subscriptions<span class="br0">&#40;</span><span class="re0">SubsList</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="me1">gen_server</span>:<span class="kw3">call</span><span class="br0">&#40;</span>?<span class="re0">SERVER</span>, <span class="br0">&#123;</span>add_subscriptions, <span class="re0">SubsList</span><span class="br0">&#125;</span>, infinity<span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">remove_subscriptions<span class="br0">&#40;</span><span class="re0">SubsList</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="me1">gen_server</span>:<span class="kw3">call</span><span class="br0">&#40;</span>?<span class="re0">SERVER</span>, <span class="br0">&#123;</span>remove_subscriptions, <span class="re0">SubsList</span><span class="br0">&#125;</span>, infinity<span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">get_subscribers<span class="br0">&#40;</span><span class="re0">User</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="me1">gen_server</span>:<span class="kw3">call</span><span class="br0">&#40;</span>?<span class="re0">SERVER</span>, <span class="br0">&#123;</span>get_subscribers, <span class="re0">User</span><span class="br0">&#125;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2"><span class="co1">%%</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">init<span class="br0">&#40;</span><span class="br0">&#91;</span><span class="br0">&#93;</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="me1">ok</span> = mnesia:<span class="me2">start</span><span class="br0">&#40;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; io:<span class="kw3">format</span><span class="br0">&#40;</span><span class="st0">&quot;Waiting on mnesia tables..<span class="es0">\n</span>&quot;</span>,<span class="br0">&#91;</span><span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; mnesia:<span class="me2">wait_for_tables</span><span class="br0">&#40;</span><span class="br0">&#91;</span>subscription<span class="br0">&#93;</span>, <span class="nu0">30000</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">Info</span> = mnesia:<span class="me2">table_info</span><span class="br0">&#40;</span>subscription, all<span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; io:<span class="kw3">format</span><span class="br0">&#40;</span><span class="st0">&quot;OK. Subscription table info: <span class="es0">\n</span>~w<span class="es0">\n</span><span class="es0">\n</span>&quot;</span>,<span class="br0">&#91;</span><span class="re0">Info</span><span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#123;</span>ok, #state<span class="br0">&#123;</span><span class="br0">&#125;</span><span class="br0">&#125;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2">handle_call<span class="br0">&#40;</span><span class="br0">&#123;</span>stop<span class="br0">&#125;</span>, _<span class="re0">From</span>, <span class="re0">State</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#123;</span>stop, stop, <span class="re0">State</span><span class="br0">&#125;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">handle_call<span class="br0">&#40;</span><span class="br0">&#123;</span>add_subscriptions, <span class="re0">SubsList</span><span class="br0">&#125;</span>, _<span class="re0">From</span>, <span class="re0">State</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="co1">% Transactionally is slower:</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="co1">% F = fun() -&gt;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="co1">% &nbsp; &nbsp; &nbsp; &nbsp; [ ok = mnesia:write(S) || S &lt;- SubsList ]</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="co1">% &nbsp; &nbsp; end,</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="co1">% mnesia:transaction(F),</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#91;</span> mnesia:<span class="me2">dirty_write</span><span class="br0">&#40;</span><span class="re0">S</span><span class="br0">&#41;</span> || <span class="re0">S</span> &lt;- <span class="re0">SubsList</span> <span class="br0">&#93;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="br0">&#123;</span>reply, ok, <span class="re0">State</span><span class="br0">&#125;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">handle_call<span class="br0">&#40;</span><span class="br0">&#123;</span>remove_subscriptions, <span class="re0">SubsList</span><span class="br0">&#125;</span>, _<span class="re0">From</span>, <span class="re0">State</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">F</span> = fun<span class="br0">&#40;</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span> ok = mnesia:<span class="me2">delete_object</span><span class="br0">&#40;</span><span class="re0">S</span><span class="br0">&#41;</span> || <span class="re0">S</span> &lt;- <span class="re0">SubsList</span> <span class="br0">&#93;</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="kw1">end</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; mnesia:<span class="me2">transaction</span><span class="br0">&#40;</span><span class="re0">F</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#123;</span>reply, ok, <span class="re0">State</span><span class="br0">&#125;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">handle_call<span class="br0">&#40;</span><span class="br0">&#123;</span>get_subscribers, <span class="re0">User</span><span class="br0">&#125;</span>, <span class="re0">From</span>, <span class="re0">State</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="re0">F</span> = fun<span class="br0">&#40;</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Subs</span> = mnesia:<span class="me2">dirty_match_object</span><span class="br0">&#40;</span>#subscription<span class="br0">&#123;</span>subscriber=<span class="st0">&#8216;_&#8217;</span>, subscribee=<span class="re0">User</span><span class="br0">&#125;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Users</span> = <span class="br0">&#91;</span><span class="re0">Dude</span> || #subscription<span class="br0">&#123;</span>subscriber=<span class="re0">Dude</span>, subscribee=_<span class="br0">&#125;</span> &lt;- <span class="re0">Subs</span><span class="br0">&#93;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; gen_server:<span class="me2">reply</span><span class="br0">&#40;</span><span class="re0">From</span>, <span class="re0">Users</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">end</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; spawn<span class="br0">&#40;</span><span class="re0">F</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#123;</span>noreply, <span class="re0">State</span><span class="br0">&#125;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">handle_cast<span class="br0">&#40;</span>_<span class="re0">Msg</span>, <span class="re0">State</span><span class="br0">&#41;</span> -&gt; <span class="br0">&#123;</span>noreply, <span class="re0">State</span><span class="br0">&#125;</span>.</div>
</li>
<li class="li1">
<div class="de1">handle_info<span class="br0">&#40;</span>_<span class="re0">Msg</span>, <span class="re0">State</span><span class="br0">&#41;</span> -&gt; <span class="br0">&#123;</span>noreply, <span class="re0">State</span><span class="br0">&#125;</span>.</div>
</li>
<li class="li2">
<div class="de2">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">terminate<span class="br0">&#40;</span>_<span class="re0">Reason</span>, _<span class="re0">State</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="me1">mnesia</span>:<span class="me2">stop</span><span class="br0">&#40;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; ok.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2">code_change<span class="br0">&#40;</span>_<span class="re0">OldVersion</span>, <span class="re0">State</span>, _<span class="re0">Extra</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="me1">io</span>:<span class="kw3">format</span><span class="br0">&#40;</span><span class="st0">&quot;Reloading code for ?MODULE<span class="es0">\n</span>&quot;</span>,<span class="br0">&#91;</span><span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#123;</span>ok, <span class="re0">State</span><span class="br0">&#125;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="co1">%%</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">first_run<span class="br0">&#40;</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="me1">mnesia</span>:<span class="me2">create_schema</span><span class="br0">&#40;</span><span class="br0">&#91;</span>node<span class="br0">&#40;</span><span class="br0">&#41;</span><span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; ok = mnesia:<span class="me2">start</span><span class="br0">&#40;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">Ret</span> = mnesia:<span class="me2">create_table</span><span class="br0">&#40;</span>subscription,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="br0">&#91;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span>disc_copies, <span class="br0">&#91;</span>node<span class="br0">&#40;</span><span class="br0">&#41;</span><span class="br0">&#93;</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span>attributes, record_info<span class="br0">&#40;</span>fields, subscription<span class="br0">&#41;</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span>index, <span class="br0">&#91;</span>subscribee<span class="br0">&#93;</span><span class="br0">&#125;</span>, <span class="co1">%index subscribee too</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp;<span class="br0">&#123;</span>type, bag<span class="br0">&#125;</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">Ret</span>.</div>
</li>
</ol>
</div>
<p><br/><br />
Noteworthy points:</p>
<ul>
<li>I&#8217;ve included qlc.hrl, needed for mnesia queries using list comprehension, using an absolute path. That can&#8217;t be best practice, it wasn&#8217;t finding it otherwise though.</li>
<li><code>get_subscribers</code> spawns another process and delegates the job of replying to that process, using <code>gen_server:reply</code>. This means the gen_server loop won&#8217;t block on that call if we throw lots of lookups at it and mnesia slows down.</li>
<li><code>rr(”subsmanager.erl”).</code> in the example below allows you to use record definitions in the erl shell. Putting your record definitions into a <code>records.hrl</code> file and including that in your modules is considered better style. I inlined it for brevity.</li>
</ul>
<p>Now to test it. <code>first_run()</code> creates the mnesia schema, so it&#8217;s important to run that first. Another potential gotcha with mnesia is that (by default) the database can only be accessed by the node that created it, so give the erl shell a name, and stick with it.</p>
<p><code>$ mkdir /var/mnesia<br />
$ erl -boot start_sasl -mnesia dir '"/var/mnesia_data"' -sname subsman<br />
(subsman@localhost)1&gt; c(subsmanager).<br />
{ok,subsmanager}<br />
(subsman@localhost)2&gt; subsmanager:first_run().<br />
...<br />
{atomic,ok}<br />
(subsman@localhost)3&gt; subsmanager:start_link().<br />
Waiting on mnesia tables..<br />
OK. Subscription table info:<br />
[{access_mode,read_write},{active_replicas,[subsman@localhost]},{arity,3},{attributes,[subscriber,subscribee]},{checkpoints,[]},{commit_work,[{index,bag,[{3,{ram,57378}}]}]},{cookie,{{1224,800064,900003},subsman@localhost}},{cstruct,{cstruct,subscription,bag,[],[subsman@localhost],[],0,read_write,[3],[],false,subscription,[subscriber,subscribee],[],[],{{1224,863164,904753},subsman@localhost},{{2,0},[]}}},{disc_copies,[subsman@localhost]},{disc_only_copies,[]},{frag_properties,[]},{index,[3]},{load_by_force,false},{load_node,subsman@localhost},{load_order,0},{load_reason,{dumper,create_table}},{local_content,false},{master_nodes,[]},{memory,288},{ram_copies,[]},{record_name,subscription},{record_validation,{subscription,3,bag}},{type,bag},{size,0},{snmp,[]},{storage_type,disc_copies},{subscribers,[]},{user_properties,[]},{version,{{2,0},[]}},{where_to_commit,[{subsman@localhost,disc_copies}]},{where_to_read,subsman@localhost},{where_to_write,[subsman@localhost]},{wild_pattern,{subscription,'_','_'}},{{index,3},57378}]</code><br />
<code><br />
{ok,&lt;0.105.0&gt;}<br />
(subsman@localhost)4&gt; rr("subsmanager.erl").<br />
[state,subscription]<br />
(subsman@localhost)5&gt; subsmanager:add_subscriptions([ #subscription{subscriber=alice, subscribee=rj} ]).<br />
ok<br />
(subsman@localhost)6&gt; subsmanager:add_subscriptions([ #subscription{subscriber=bob, subscribee=rj} ]).<br />
ok<br />
(subsman@localhost)7&gt; subsmanager:get_subscribers(rj).<br />
[bob,alice]<br />
(subsman@localhost)8&gt; subsmanager:remove_subscriptions([ #subscription{subscriber=bob, subscribee=rj} ]).<br />
ok<br />
(subsman@localhost)8&gt; subsmanager:get_subscribers(rj).<br />
[alice]<br />
(subsman@localhost)10&gt; subsmanager:get_subscribers(charlie).<br />
[]</code></p>
<p>We&#8217;ll use integer Ids to represent users for the benchmark &#8211; but for this test I used atoms (rj, alice, bob) and assumed that alice and bob are both on rj&#8217;s friends list. It&#8217;s nice that mnesia (and ets/dets) doesn&#8217;t care what values you use &#8211; any Erlang term is valid. This means it&#8217;s a simple upgrade to support multiple types of resource. You could start using <code>{user, 123}</code> or <code>{photo, 789}</code> to represent different things people might subscribe to, without changing anything in the subsmanager module.</p>
<h2>Modifying the router to use subscriptions</h2>
<p>Instead of addressing messages to specific users, ie <code>router:send(123, "Hello user 123")</code>, we&#8217;ll mark messages with a subject &#8211; that is, the person who generated the message (who played the song, who uploaded the photo etc) &#8211; and have the router deliver the message to every user who has subscribed to the subject user. In other words, the API will work like this: <code>router:send(123, "Hello everyone subscribed to user 123")</code></p>
<p>Updated router.erl:</p>
<div class="dean_ch" style="white-space: wrap;">
<ol>
<li class="li1">
<div class="de1">-<span class="kw2">module</span><span class="br0">&#40;</span>router<span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">-<span class="kw2">behaviour</span><span class="br0">&#40;</span>gen_server<span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">-<span class="kw2">export</span><span class="br0">&#40;</span><span class="br0">&#91;</span><span class="kw3">start_link</span>/<span class="nu0">0</span><span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li2">
<div class="de2">-<span class="kw2">export</span><span class="br0">&#40;</span><span class="br0">&#91;</span>init/<span class="nu0">1</span>, handle_call/<span class="nu0">3</span>, handle_cast/<span class="nu0">2</span>, handle_info/<span class="nu0">2</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp;terminate/<span class="nu0">2</span>, code_change/<span class="nu0">3</span><span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">-<span class="kw2">export</span><span class="br0">&#40;</span><span class="br0">&#91;</span>send/<span class="nu0">2</span>, login/<span class="nu0">2</span>, logout/<span class="nu0">1</span><span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2">-define<span class="br0">&#40;</span><span class="re0">SERVER</span>, global:<span class="me2">whereis_name</span><span class="br0">&#40;</span>?<span class="re0">MODULE</span><span class="br0">&#41;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="co1">% will hold bidirectional mapping between id &lt;&#8211;&gt; pid</span></div>
</li>
<li class="li1">
<div class="de1">-record<span class="br0">&#40;</span>state, <span class="br0">&#123;</span>pid2id, id2pid<span class="br0">&#125;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2"><span class="kw3">start_link</span><span class="br0">&#40;</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="me1">gen_server</span>:<span class="kw3">start_link</span><span class="br0">&#40;</span><span class="br0">&#123;</span>global, ?<span class="re0">MODULE</span><span class="br0">&#125;</span>, ?<span class="re0">MODULE</span>, <span class="br0">&#91;</span><span class="br0">&#93;</span>, <span class="br0">&#91;</span><span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="co1">% sends Msg to anyone subscribed to Id</span></div>
</li>
<li class="li1">
<div class="de1">send<span class="br0">&#40;</span><span class="re0">Id</span>, <span class="re0">Msg</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="me1">gen_server</span>:<span class="kw3">call</span><span class="br0">&#40;</span>?<span class="re0">SERVER</span>, <span class="br0">&#123;</span>send, <span class="re0">Id</span>, <span class="re0">Msg</span><span class="br0">&#125;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">login<span class="br0">&#40;</span><span class="re0">Id</span>, <span class="re0">Pid</span><span class="br0">&#41;</span> when is_pid<span class="br0">&#40;</span><span class="re0">Pid</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="me1">gen_server</span>:<span class="kw3">call</span><span class="br0">&#40;</span>?<span class="re0">SERVER</span>, <span class="br0">&#123;</span>login, <span class="re0">Id</span>, <span class="re0">Pid</span><span class="br0">&#125;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2">logout<span class="br0">&#40;</span><span class="re0">Pid</span><span class="br0">&#41;</span> when is_pid<span class="br0">&#40;</span><span class="re0">Pid</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="me1">gen_server</span>:<span class="kw3">call</span><span class="br0">&#40;</span>?<span class="re0">SERVER</span>, <span class="br0">&#123;</span>logout, <span class="re0">Pid</span><span class="br0">&#125;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="co1">%%</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2">init<span class="br0">&#40;</span><span class="br0">&#91;</span><span class="br0">&#93;</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="co1">% set this so we can catch death of logged in pids:</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; process_flag<span class="br0">&#40;</span>trap_exit, true<span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="co1">% use ets for routing tables</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#123;</span>ok, #state<span class="br0">&#123;</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; pid2id = ets:<span class="me2">new</span><span class="br0">&#40;</span>?<span class="re0">MODULE</span>, <span class="br0">&#91;</span>bag<span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; id2pid = ets:<span class="me2">new</span><span class="br0">&#40;</span>?<span class="re0">MODULE</span>, <span class="br0">&#91;</span>bag<span class="br0">&#93;</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#125;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#125;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2">handle_call<span class="br0">&#40;</span><span class="br0">&#123;</span>login, <span class="re0">Id</span>, <span class="re0">Pid</span><span class="br0">&#125;</span>, _<span class="re0">From</span>, <span class="re0">State</span><span class="br0">&#41;</span> when is_pid<span class="br0">&#40;</span><span class="re0">Pid</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="me1">ets</span>:<span class="me2">insert</span><span class="br0">&#40;</span><span class="re0">State</span>#state.pid2id, <span class="br0">&#123;</span><span class="re0">Pid</span>, <span class="re0">Id</span><span class="br0">&#125;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; ets:<span class="me2">insert</span><span class="br0">&#40;</span><span class="re0">State</span>#state.id2pid, <span class="br0">&#123;</span><span class="re0">Id</span>, <span class="re0">Pid</span><span class="br0">&#125;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; link<span class="br0">&#40;</span><span class="re0">Pid</span><span class="br0">&#41;</span>, <span class="co1">% tell us if they exit, so we can log them out</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="co1">%io:format(&quot;~w logged in as ~w\n&quot;,[Pid, Id]),</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="br0">&#123;</span>reply, ok, <span class="re0">State</span><span class="br0">&#125;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">handle_call<span class="br0">&#40;</span><span class="br0">&#123;</span>logout, <span class="re0">Pid</span><span class="br0">&#125;</span>, _<span class="re0">From</span>, <span class="re0">State</span><span class="br0">&#41;</span> when is_pid<span class="br0">&#40;</span><span class="re0">Pid</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="me1">unlink</span><span class="br0">&#40;</span><span class="re0">Pid</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">PidRows</span> = ets:<span class="me2">lookup</span><span class="br0">&#40;</span><span class="re0">State</span>#state.pid2id, <span class="re0">Pid</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="kw1">case</span> <span class="re0">PidRows</span> <span class="kw1">of</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span><span class="br0">&#93;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="me1">ok</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; _ -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">IdRows</span> = <span class="br0">&#91;</span> <span class="br0">&#123;</span><span class="re0">I</span>,<span class="re0">P</span><span class="br0">&#125;</span> || <span class="br0">&#123;</span><span class="re0">P</span>,<span class="re0">I</span><span class="br0">&#125;</span> &lt;- <span class="re0">PidRows</span> <span class="br0">&#93;</span>, <span class="co1">% invert tuples</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ets:<span class="me2">delete</span><span class="br0">&#40;</span><span class="re0">State</span>#state.pid2id, <span class="re0">Pid</span><span class="br0">&#41;</span>, &nbsp; <span class="co1">% delete all pid-&gt;id entries</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span> ets:<span class="me2">delete_object</span><span class="br0">&#40;</span><span class="re0">State</span>#state.id2pid, <span class="re0">Obj</span><span class="br0">&#41;</span> || <span class="re0">Obj</span> &lt;- <span class="re0">IdRows</span> <span class="br0">&#93;</span> <span class="co1">% and all id-&gt;pid</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">end</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="co1">%io:format(&quot;pid ~w logged out\n&quot;,[Pid]),</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#123;</span>reply, ok, <span class="re0">State</span><span class="br0">&#125;</span>;</div>
</li>
<li class="li2">
<div class="de2">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">handle_call<span class="br0">&#40;</span><span class="br0">&#123;</span>send, <span class="re0">Id</span>, <span class="re0">Msg</span><span class="br0">&#125;</span>, <span class="re0">From</span>, <span class="re0">State</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">F</span> = fun<span class="br0">&#40;</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="co1">% get users who are subscribed to Id:</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Users</span> = subsmanager:<span class="me2">get_subscribers</span><span class="br0">&#40;</span><span class="re0">Id</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; io:<span class="kw3">format</span><span class="br0">&#40;</span><span class="st0">&quot;Subscribers of ~w = ~w<span class="es0">\n</span>&quot;</span>,<span class="br0">&#91;</span><span class="re0">Id</span>, <span class="re0">Users</span><span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="co1">% get pids of anyone logged in from Users list:</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Pids0</span> = lists:<span class="me2">map</span><span class="br0">&#40;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; fun<span class="br0">&#40;</span><span class="re0">U</span><span class="br0">&#41;</span>-&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span> <span class="re0">P</span> || <span class="br0">&#123;</span> _<span class="re0">I</span>, <span class="re0">P</span> <span class="br0">&#125;</span> &lt;- ets:<span class="me2">lookup</span><span class="br0">&#40;</span><span class="re0">State</span>#state.id2pid, <span class="re0">U</span><span class="br0">&#41;</span> <span class="br0">&#93;</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">end</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span> <span class="re0">Id</span> | <span class="re0">Users</span> <span class="br0">&#93;</span> <span class="co1">% we are always subscribed to ourselves</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Pids</span> = lists:<span class="me2">flatten</span><span class="br0">&#40;</span><span class="re0">Pids0</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; io:<span class="kw3">format</span><span class="br0">&#40;</span><span class="st0">&quot;Pids: ~w<span class="es0">\n</span>&quot;</span>, <span class="br0">&#91;</span><span class="re0">Pids</span><span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; <span class="co1">% send Msg to them all</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">M</span> = <span class="br0">&#123;</span>router_msg, <span class="re0">Msg</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span> <span class="re0">Pid</span> ! <span class="re0">M</span> || <span class="re0">Pid</span> &lt;- <span class="re0">Pids</span> <span class="br0">&#93;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="co1">% respond with how many users saw the message</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; gen_server:<span class="me2">reply</span><span class="br0">&#40;</span><span class="re0">From</span>, <span class="br0">&#123;</span>ok, length<span class="br0">&#40;</span><span class="re0">Pids</span><span class="br0">&#41;</span><span class="br0">&#125;</span><span class="br0">&#41;</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="kw1">end</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; spawn<span class="br0">&#40;</span><span class="re0">F</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#123;</span>noreply, <span class="re0">State</span><span class="br0">&#125;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="co1">% handle death and cleanup of logged in processes</span></div>
</li>
<li class="li2">
<div class="de2">handle_info<span class="br0">&#40;</span><span class="re0">Info</span>, <span class="re0">State</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">case</span> <span class="re0">Info</span> <span class="kw1">of</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#123;</span><span class="st0">&#8216;EXIT&#8217;</span>, <span class="re0">Pid</span>, _<span class="re0">Why</span><span class="br0">&#125;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="me1">handle_call</span><span class="br0">&#40;</span><span class="br0">&#123;</span>logout, <span class="re0">Pid</span><span class="br0">&#125;</span>, blah, <span class="re0">State</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Wtf</span> -&gt;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="me1">io</span>:<span class="kw3">format</span><span class="br0">&#40;</span><span class="st0">&quot;Caught unhandled message: ~w<span class="es0">\n</span>&quot;</span>, <span class="br0">&#91;</span><span class="re0">Wtf</span><span class="br0">&#93;</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">end</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#123;</span>noreply, <span class="re0">State</span><span class="br0">&#125;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">handle_cast<span class="br0">&#40;</span>_<span class="re0">Msg</span>, <span class="re0">State</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="br0">&#123;</span>noreply, <span class="re0">State</span><span class="br0">&#125;</span>.</div>
</li>
<li class="li1">
<div class="de1">terminate<span class="br0">&#40;</span>_<span class="re0">Reason</span>, _<span class="re0">State</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="me1">ok</span>.</div>
</li>
<li class="li1">
<div class="de1">code_change<span class="br0">&#40;</span>_<span class="re0">OldVsn</span>, <span class="re0">State</span>, _<span class="re0">Extra</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#123;</span>ok, <span class="re0">State</span><span class="br0">&#125;</span>.</div>
</li>
</ol>
</div>
<p><br/></p>
<p>And here&#8217;s a quick test that doesn&#8217;t require mochiweb &#8211; I&#8217;ve used atoms instead of user ids, and omitted some output for clarity:</p>
<p><code>(subsman@localhost)1&gt; c(subsmanager), c(router), rr("subsmanager.erl").<br />
(subsman@localhost)2&gt; subsmanager:start_link().<br />
(subsman@localhost)3&gt; router:start_link().<br />
(subsman@localhost)4&gt; Subs = [#subscription{subscriber=alice, subscribee=rj}, #subscription{subscriber=bob, subscribee=rj}].<br />
[#subscription{subscriber = alice,subscribee = rj},<br />
#subscription{subscriber = bob,subscribee = rj}]<br />
(subsman@localhost)5&gt; subsmanager:add_subscriptions(Subs).<br />
ok<br />
(subsman@localhost)6&gt; router:send(rj, "RJ did something").<br />
Subscribers of rj = [bob,alice]<br />
Pids: []<br />
{ok,0}<br />
(subsman@localhost)7&gt; router:login(alice, self()).<br />
ok<br />
(subsman@localhost)8&gt; router:send(rj, "RJ did something").<br />
Subscribers of rj = [bob,alice]<br />
Pids: [&lt;0.46.0&gt;]<br />
{ok,1}<br />
(subsman@localhost)9&gt; receive {router_msg, M} -&gt; io:format("~s\n",[M]) end.<br />
RJ did something<br />
ok</code></p>
<p>This shows how alice can a receive a message when the subject is someone she is subscribed to (rj), even though the message wasn&#8217;t sent directly to alice. The output shows that the router identified possible targets as <code>[alice,bob]</code> but only delivered the message to one person, alice, because bob was not logged in.</p>
<h2>Generating a typical social-network friends dataset</h2>
<p>We could generate lots of friend relationships at random, but that&#8217;s not particularly realistic. Social networks tend to exhibit a power law distribution. Social networks usually have a few super-popular users (<a href="http://twitter.com/barackobama">some Twitter users</a> have over 100,000 followers) and plenty of people with just a handful of friends. The Last.fm friends data is typical &#8211; it fits a <a href="http://en.wikipedia.org/wiki/Barab%C3%A1si-Albert_model">Barabási–Albert graph model</a>, so that&#8217;s what I&#8217;ll use.</p>
<p>To generate the dataset I&#8217;m using the python module from the excellent <a href="http://cneurocvs.rmki.kfki.hu/igraph/">igraph library</a>:</p>
<p>fakefriends.py:</p>
<div class="dean_ch" style="white-space: wrap;">
<ol>
<li class="li1">
<div class="de1"><span class="kw1">import</span> igraph</div>
</li>
<li class="li1">
<div class="de1">g = igraph.<span class="me1">Graph</span>.<span class="me1">Barabasi</span><span class="br0">&#40;</span><span class="nu0">1000000</span>, <span class="nu0">15</span>, directed=<span class="kw2">False</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="kw1">print</span> <span class="st0">&quot;Edges: &quot;</span> + <span class="kw2">str</span><span class="br0">&#40;</span>g.<span class="me1">ecount</span><span class="br0">&#40;</span><span class="br0">&#41;</span><span class="br0">&#41;</span> + <span class="st0">&quot; Verticies: &quot;</span> + <span class="kw2">str</span><span class="br0">&#40;</span>g.<span class="me1">vcount</span><span class="br0">&#40;</span><span class="br0">&#41;</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">g.<span class="me1">write_edgelist</span><span class="br0">&#40;</span><span class="st0">&quot;fakefriends.txt&quot;</span><span class="br0">&#41;</span></div>
</li>
</ol>
</div>
<p><br/><br />
This will generate with 2 user ids per line, space separated. These are the friend relationships we&#8217;ll load into our subsmanager. User ids range from 1 to a million.</p>
<h2>Bulk loading friends data into mnesia</h2>
<p>This small module will read the fakefriends.txt file and create a list of subscription records.</p>
<p>readfriends.erl &#8211; to read the fakefriends.txt and create subscription records:</p>
<div class="dean_ch" style="white-space: wrap;">
<ol>
<li class="li1">
<div class="de1">-<span class="kw2">module</span><span class="br0">&#40;</span>readfriends<span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">-<span class="kw2">export</span><span class="br0">&#40;</span><span class="br0">&#91;</span>load/<span class="nu0">1</span><span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">-record<span class="br0">&#40;</span>subscription, <span class="br0">&#123;</span>subscriber, subscribee<span class="br0">&#125;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2">load<span class="br0">&#40;</span><span class="re0">Filename</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="me1">for_each_line_in_file</span><span class="br0">&#40;</span><span class="re0">Filename</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; fun<span class="br0">&#40;</span><span class="re0">Line</span>, <span class="re0">Acc</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span><span class="re0">As</span>, <span class="re0">Bs</span><span class="br0">&#93;</span> = string:<span class="me2">tokens</span><span class="br0">&#40;</span>string:<span class="me2">strip</span><span class="br0">&#40;</span><span class="re0">Line</span>, right, $\n<span class="br0">&#41;</span>, <span class="st0">&quot; &quot;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#123;</span><span class="re0">A</span>, _<span class="br0">&#125;</span> = string:<span class="me2">to_integer</span><span class="br0">&#40;</span><span class="re0">As</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#123;</span><span class="re0">B</span>, _<span class="br0">&#125;</span> = string:<span class="me2">to_integer</span><span class="br0">&#40;</span><span class="re0">Bs</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span> #subscription<span class="br0">&#123;</span>subscriber=<span class="re0">A</span>, subscribee=<span class="re0">B</span><span class="br0">&#125;</span> | <span class="re0">Acc</span> <span class="br0">&#93;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">end</span>, <span class="br0">&#91;</span>read<span class="br0">&#93;</span>, <span class="br0">&#91;</span><span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="co1">% via: http://www.trapexit.org/Reading_Lines_from_a_File</span></div>
</li>
<li class="li2">
<div class="de2">for_each_line_in_file<span class="br0">&#40;</span><span class="re0">Name</span>, <span class="re0">Proc</span>, <span class="re0">Mode</span>, <span class="re0">Accum0</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#123;</span>ok, <span class="re0">Device</span><span class="br0">&#125;</span> = file:<span class="me2">open</span><span class="br0">&#40;</span><span class="re0">Name</span>, <span class="re0">Mode</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; for_each_line<span class="br0">&#40;</span><span class="re0">Device</span>, <span class="re0">Proc</span>, <span class="re0">Accum0</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">for_each_line<span class="br0">&#40;</span><span class="re0">Device</span>, <span class="re0">Proc</span>, <span class="re0">Accum</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="kw1">case</span> io:<span class="me2">get_line</span><span class="br0">&#40;</span><span class="re0">Device</span>, <span class="st0">&quot;&quot;</span><span class="br0">&#41;</span> <span class="kw1">of</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; eof &nbsp;-&gt; <span class="me1">file</span>:<span class="kw3">close</span><span class="br0">&#40;</span><span class="re0">Device</span><span class="br0">&#41;</span>, <span class="re0">Accum</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Line</span> -&gt; <span class="re0">NewAccum</span> = <span class="re0">Proc</span><span class="br0">&#40;</span><span class="re0">Line</span>, <span class="re0">Accum</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; for_each_line<span class="br0">&#40;</span><span class="re0">Device</span>, <span class="re0">Proc</span>, <span class="re0">NewAccum</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">end</span>.</div>
</li>
</ol>
</div>
<p><br/><br />
Now in the subsmanager shell, you can read from the text file and add the subscriptions:</p>
<p><code>$ erl -name router@minifeeds4.gs2 +K true +A 128 -setcookie secretcookie -mnesia dump_log_write_threshold 50000 -mnesia dc_dump_limit 40<br />
erl&gt; c(readfriends), c(subsmanager).<br />
erl&gt; subsmanager:first_run().<br />
erl&gt; subsmanager:start_link().<br />
erl&gt; subsmanager:add_subscriptions( readfriends:load("fakefriends.txt") ).</code></p>
<p>Note the additional mnesia parameters &#8211; these are to avoid the <strong>** WARNING ** Mnesia is overloaded</strong> messages you would (probably) otherwise see. Refer to my previous post: <a href="http://www.metabrew.com/article/on-bulk-loading-data-into-mnesia/">On bulk loading data into Mnesia</a> for alternative ways to load in lots of data. The best solution seems to be (as pointed out in the comments, thanks Jacob!) to set those options. The <a href="http://www.erlang.org/doc/apps/mnesia/">Mnesia reference manual</a> contains many other settings under Configuration Parameters, and is worth a look.</p>
<h2>Turning it up to 1 Million</h2>
<p>Creating a million tcp connections from one host is non-trivial. I&#8217;ve a feeling that people who do this regularly have small clusters dedicated to simulating lots of client connections, probably running a real tool like Tsung. Even with the tuning from <a href="http://www.metabrew.com/article/a-million-user-comet-application-with-mochiweb-part-1/">Part 1</a> to increase kernel tcp memory, increase the file descriptor ulimits and set the local port range to the maximum, we will still hit a hard limit on ephemeral ports. When making a tcp connection, the client end is allocated (or you can specify) a port from the range in <code>/proc/sys/net/ipv4/ip_local_port_range</code>. It doesn&#8217;t matter if you specify it manually, or use an ephemeral port, you&#8217;re still going to run out. In Part 1, we set the range to &#8220;1024 65535&#8243; &#8211; meaning there are 65535-1024 = 64511 unprivileged ports available. Some of them will be used by other processes, but we&#8217;ll never get over 64511 client connections, because we&#8217;ll run out of ports.</p>
<p>The local port range is assigned per-IP, so if we make our outgoing connections specifically from a range of different local IP addresses, we&#8217;ll be able to open more than 64511 outgoing connections in total.</p>
<p>So let&#8217;s bring up 17 new IP addresses, with the intention of making 62,000 connections from each &#8211; giving us a total of 1,054,000 connections. Safely over the 2^32 mark:</p>
<p><code>$ for i in `seq 1 17`; do echo sudo ifconfig eth0:$i 10.0.0.$i up ; done</code></p>
<p>If you run <code>ifconfig</code> now you should see your virtual interfaces: eth0:1, eth0:2 &#8230; eth0:17, each with a different IP address. Obviously you should chose a sensible part of whatever address space you are using.</p>
<p>All that remains now is to modify the <code>floodtest</code> tool from Part 1 to specify the local IP it should connect from&#8230; Unfortunately the <a href="http://www.erlang.org/doc/man/http.html">erlang http client</a> doesn&#8217;t let you specify the source IP. Neither does ibrowse, the alternative http client library. Damn.</p>
<p><i>&lt;crazy idea&gt;</i><br />
At this point I considered another option: bringing up 17 pairs of IPs &#8211; one on the server and one on the client &#8211; each pair in their own isolated /30 subnet. I think that if I then made the client connect to any given server IP, it would force the local address to be other half of the pair on that subnet, because only one of the local IPs would actually be able to reach the server IP. In theory, this would mean declaring the local source IP on the client machine would not be necessary (although the range of server IPs would need to be specified). I don&#8217;t know if this would really work &#8211; it sounded plausible at the time. In the end I decided it was too perverted and didn&#8217;t try it.<br />
<i>&lt;/crazy idea&gt;</i><br/></p>
<p>I also poked around in OTP&#8217;s <code>http_transport</code> code and considered adding support for specifying the local IP. It&#8217;s not really a feature you usually need in an HTTP client though, and it would certainly have been more work.</p>
<p><code>gen_tcp</code> lets you specify the source address, so I ended up writing a rather crude client using <code>gen_tcp</code> specifically for this test:</p>
<p>floodtest2.erl</p>
<div class="dean_ch" style="white-space: wrap;">
<ol>
<li class="li1">
<div class="de1">-<span class="kw2">module</span><span class="br0">&#40;</span>floodtest2<span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">-compile<span class="br0">&#40;</span>export_all<span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">-define<span class="br0">&#40;</span><span class="re0">SERVERADDR</span>, <span class="st0">&quot;10.1.2.3&quot;</span><span class="br0">&#41;</span>. <span class="co1">% where mochiweb is running</span></div>
</li>
<li class="li1">
<div class="de1">-define<span class="br0">&#40;</span><span class="re0">SERVERPORT</span>, <span class="nu0">8000</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li2">
<div class="de2">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="co1">% Generate the config in bash like so (chose some available address space):</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co1">% EACH=62000; for i in `seq 1 17`; do echo &quot;{{10,0,0,$i}, $((($i-1)*$EACH+1)), $(($i*$EACH))}, &quot;; done</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">run<span class="br0">&#40;</span><span class="re0">Interval</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Config</span> = <span class="br0">&#91;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#123;</span><span class="br0">&#123;</span><span class="nu0">10</span>,<span class="nu0">0</span>,<span class="nu0">0</span>,<span class="nu0">1</span><span class="br0">&#125;</span>, <span class="nu0">1</span>, <span class="nu0">62000</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#123;</span><span class="br0">&#123;</span><span class="nu0">10</span>,<span class="nu0">0</span>,<span class="nu0">0</span>,<span class="nu0">2</span><span class="br0">&#125;</span>, <span class="nu0">62001</span>, <span class="nu0">124000</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#123;</span><span class="br0">&#123;</span><span class="nu0">10</span>,<span class="nu0">0</span>,<span class="nu0">0</span>,<span class="nu0">3</span><span class="br0">&#125;</span>, <span class="nu0">124001</span>, <span class="nu0">186000</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#123;</span><span class="br0">&#123;</span><span class="nu0">10</span>,<span class="nu0">0</span>,<span class="nu0">0</span>,<span class="nu0">4</span><span class="br0">&#125;</span>, <span class="nu0">186001</span>, <span class="nu0">248000</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li2">
<div class="de2"><span class="br0">&#123;</span><span class="br0">&#123;</span><span class="nu0">10</span>,<span class="nu0">0</span>,<span class="nu0">0</span>,<span class="nu0">5</span><span class="br0">&#125;</span>, <span class="nu0">248001</span>, <span class="nu0">310000</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#123;</span><span class="br0">&#123;</span><span class="nu0">10</span>,<span class="nu0">0</span>,<span class="nu0">0</span>,<span class="nu0">6</span><span class="br0">&#125;</span>, <span class="nu0">310001</span>, <span class="nu0">372000</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#123;</span><span class="br0">&#123;</span><span class="nu0">10</span>,<span class="nu0">0</span>,<span class="nu0">0</span>,<span class="nu0">7</span><span class="br0">&#125;</span>, <span class="nu0">372001</span>, <span class="nu0">434000</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#123;</span><span class="br0">&#123;</span><span class="nu0">10</span>,<span class="nu0">0</span>,<span class="nu0">0</span>,<span class="nu0">8</span><span class="br0">&#125;</span>, <span class="nu0">434001</span>, <span class="nu0">496000</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#123;</span><span class="br0">&#123;</span><span class="nu0">10</span>,<span class="nu0">0</span>,<span class="nu0">0</span>,<span class="nu0">9</span><span class="br0">&#125;</span>, <span class="nu0">496001</span>, <span class="nu0">558000</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li2">
<div class="de2"><span class="br0">&#123;</span><span class="br0">&#123;</span><span class="nu0">10</span>,<span class="nu0">0</span>,<span class="nu0">0</span>,<span class="nu0">10</span><span class="br0">&#125;</span>, <span class="nu0">558001</span>, <span class="nu0">620000</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#123;</span><span class="br0">&#123;</span><span class="nu0">10</span>,<span class="nu0">0</span>,<span class="nu0">0</span>,<span class="nu0">11</span><span class="br0">&#125;</span>, <span class="nu0">620001</span>, <span class="nu0">682000</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#123;</span><span class="br0">&#123;</span><span class="nu0">10</span>,<span class="nu0">0</span>,<span class="nu0">0</span>,<span class="nu0">12</span><span class="br0">&#125;</span>, <span class="nu0">682001</span>, <span class="nu0">744000</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#123;</span><span class="br0">&#123;</span><span class="nu0">10</span>,<span class="nu0">0</span>,<span class="nu0">0</span>,<span class="nu0">13</span><span class="br0">&#125;</span>, <span class="nu0">744001</span>, <span class="nu0">806000</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#123;</span><span class="br0">&#123;</span><span class="nu0">10</span>,<span class="nu0">0</span>,<span class="nu0">0</span>,<span class="nu0">14</span><span class="br0">&#125;</span>, <span class="nu0">806001</span>, <span class="nu0">868000</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li2">
<div class="de2"><span class="br0">&#123;</span><span class="br0">&#123;</span><span class="nu0">10</span>,<span class="nu0">0</span>,<span class="nu0">0</span>,<span class="nu0">15</span><span class="br0">&#125;</span>, <span class="nu0">868001</span>, <span class="nu0">930000</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#123;</span><span class="br0">&#123;</span><span class="nu0">10</span>,<span class="nu0">0</span>,<span class="nu0">0</span>,<span class="nu0">16</span><span class="br0">&#125;</span>, <span class="nu0">930001</span>, <span class="nu0">992000</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#123;</span><span class="br0">&#123;</span><span class="nu0">10</span>,<span class="nu0">0</span>,<span class="nu0">0</span>,<span class="nu0">17</span><span class="br0">&#125;</span>, <span class="nu0">992001</span>, <span class="nu0">1054000</span><span class="br0">&#125;</span><span class="br0">&#93;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; start<span class="br0">&#40;</span><span class="re0">Config</span>, <span class="re0">Interval</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2">start<span class="br0">&#40;</span><span class="re0">Config</span>, <span class="re0">Interval</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Monitor</span> = monitor<span class="br0">&#40;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">AdjustedInterval</span> = <span class="re0">Interval</span> / length<span class="br0">&#40;</span><span class="re0">Config</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span> spawn<span class="br0">&#40;</span>fun start/<span class="nu0">5</span>, <span class="br0">&#91;</span><span class="re0">Lower</span>, <span class="re0">Upper</span>, <span class="re0">Ip</span>, <span class="re0">AdjustedInterval</span>, <span class="re0">Monitor</span><span class="br0">&#93;</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; || <span class="br0">&#123;</span><span class="re0">Ip</span>, <span class="re0">Lower</span>, <span class="re0">Upper</span><span class="br0">&#125;</span> &nbsp;&lt;- <span class="re0">Config</span> <span class="br0">&#93;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; ok.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">start<span class="br0">&#40;</span><span class="re0">LowerID</span>, <span class="re0">UpperID</span>, _, _, _<span class="br0">&#41;</span> when <span class="re0">LowerID</span> == <span class="re0">UpperID</span> -&gt; <span class="me1">done</span>;</div>
</li>
<li class="li1">
<div class="de1">start<span class="br0">&#40;</span><span class="re0">LowerID</span>, <span class="re0">UpperID</span>, <span class="re0">LocalIP</span>, <span class="re0">Interval</span>, <span class="re0">Monitor</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="me1">spawn</span><span class="br0">&#40;</span>fun connect/<span class="nu0">5</span>, <span class="br0">&#91;</span>?<span class="re0">SERVERADDR</span>, ?<span class="re0">SERVERPORT</span>, <span class="re0">LocalIP</span>, <span class="st0">&quot;/test/&quot;</span>++<span class="re0">LowerID</span>, <span class="re0">Monitor</span><span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">receive</span> <span class="kw1">after</span> <span class="re0">Interval</span> -&gt; <span class="me1">start</span><span class="br0">&#40;</span><span class="re0">LowerID</span> + <span class="nu0">1</span>, <span class="re0">UpperID</span>, <span class="re0">LocalIP</span>, <span class="re0">Interval</span>, <span class="re0">Monitor</span><span class="br0">&#41;</span> <span class="kw1">end</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">connect<span class="br0">&#40;</span><span class="re0">ServerAddr</span>, <span class="re0">ServerPort</span>, <span class="re0">ClientIP</span>, <span class="re0">Path</span>, <span class="re0">Monitor</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Opts</span> = <span class="br0">&#91;</span>binary, <span class="br0">&#123;</span>packet, <span class="nu0">0</span><span class="br0">&#125;</span>, <span class="br0">&#123;</span>ip, <span class="re0">ClientIP</span><span class="br0">&#125;</span>, <span class="br0">&#123;</span>reuseaddr, true<span class="br0">&#125;</span>, <span class="br0">&#123;</span>active, false<span class="br0">&#125;</span><span class="br0">&#93;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#123;</span>ok, <span class="re0">Sock</span><span class="br0">&#125;</span> = gen_tcp:<span class="me2">connect</span><span class="br0">&#40;</span><span class="re0">ServerAddr</span>, <span class="re0">ServerPort</span>, <span class="re0">Opts</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Monitor</span> ! open,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">ReqL</span> = io_lib:<span class="kw3">format</span><span class="br0">&#40;</span><span class="st0">&quot;GET ~s<span class="es0">\r</span><span class="es0">\n</span>Host: ~s<span class="es0">\r</span><span class="es0">\n</span><span class="es0">\r</span><span class="es0">\n</span>&quot;</span>, <span class="br0">&#91;</span><span class="re0">Path</span>, <span class="re0">ServerAddr</span><span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Req</span> = list_to_binary<span class="br0">&#40;</span><span class="re0">ReqL</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; ok = gen_tcp:<span class="me2">send</span><span class="br0">&#40;</span><span class="re0">Sock</span>, <span class="br0">&#91;</span><span class="re0">Req</span><span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; do_recv<span class="br0">&#40;</span><span class="re0">Sock</span>, <span class="re0">Monitor</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#40;</span>catch gen_tcp:<span class="kw3">close</span><span class="br0">&#40;</span><span class="re0">Sock</span><span class="br0">&#41;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; ok.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">do_recv<span class="br0">&#40;</span><span class="re0">Sock</span>, <span class="re0">Monitor</span><span class="br0">&#41;</span>-&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">case</span> gen_tcp:<span class="me2">recv</span><span class="br0">&#40;</span><span class="re0">Sock</span>, <span class="nu0">0</span><span class="br0">&#41;</span> <span class="kw1">of</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#123;</span>ok, <span class="re0">B</span><span class="br0">&#125;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Monitor</span> ! <span class="br0">&#123;</span>bytes, size<span class="br0">&#40;</span><span class="re0">B</span><span class="br0">&#41;</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; io:<span class="kw3">format</span><span class="br0">&#40;</span><span class="st0">&quot;Recvd ~s<span class="es0">\n</span>&quot;</span>, <span class="br0">&#91;</span> binary_to_list<span class="br0">&#40;</span><span class="re0">B</span><span class="br0">&#41;</span><span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; io:<span class="kw3">format</span><span class="br0">&#40;</span><span class="st0">&quot;Recvd ~w bytes<span class="es0">\n</span>&quot;</span>, <span class="br0">&#91;</span>size<span class="br0">&#40;</span><span class="re0">B</span><span class="br0">&#41;</span><span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; do_recv<span class="br0">&#40;</span><span class="re0">Sock</span>, <span class="re0">Monitor</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#123;</span>error, closed<span class="br0">&#125;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Monitor</span> ! closed,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; closed;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Other</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Monitor</span> ! closed,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; io:<span class="kw3">format</span><span class="br0">&#40;</span><span class="st0">&quot;Other:~w<span class="es0">\n</span>&quot;</span>,<span class="br0">&#91;</span><span class="re0">Other</span><span class="br0">&#93;</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">end</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="co1">% Monitor process receives stats and reports how much data we received etc:</span></div>
</li>
<li class="li1">
<div class="de1">monitor<span class="br0">&#40;</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Pid</span> = spawn<span class="br0">&#40;</span>?<span class="re0">MODULE</span>, monitor0, <span class="br0">&#91;</span><span class="br0">&#123;</span><span class="nu0">0</span>,<span class="nu0">0</span>,<span class="nu0">0</span>,<span class="nu0">0</span><span class="br0">&#125;</span><span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; timer:<span class="me2">send_interval</span><span class="br0">&#40;</span><span class="nu0">10000</span>, <span class="re0">Pid</span>, report<span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Pid</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">monitor0<span class="br0">&#40;</span><span class="br0">&#123;</span><span class="re0">Open</span>, <span class="re0">Closed</span>, <span class="re0">Chunks</span>, <span class="re0">Bytes</span><span class="br0">&#125;</span>=<span class="re0">S</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">receive</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; report &nbsp;-&gt; <span class="me1">io</span>:<span class="kw3">format</span><span class="br0">&#40;</span><span class="st0">&quot;{Open, Closed, Chunks, Bytes} = ~w<span class="es0">\n</span>&quot;</span>,<span class="br0">&#91;</span><span class="re0">S</span><span class="br0">&#93;</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; open &nbsp; &nbsp;-&gt; <span class="me1">monitor0</span><span class="br0">&#40;</span><span class="br0">&#123;</span><span class="re0">Open</span> + <span class="nu0">1</span>, <span class="re0">Closed</span>, <span class="re0">Chunks</span>, <span class="re0">Bytes</span><span class="br0">&#125;</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; closed &nbsp;-&gt; <span class="me1">monitor0</span><span class="br0">&#40;</span><span class="br0">&#123;</span><span class="re0">Open</span>, <span class="re0">Closed</span> + <span class="nu0">1</span>, <span class="re0">Chunks</span>, <span class="re0">Bytes</span><span class="br0">&#125;</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; chunk &nbsp; -&gt; <span class="me1">monitor0</span><span class="br0">&#40;</span><span class="br0">&#123;</span><span class="re0">Open</span>, <span class="re0">Closed</span>, <span class="re0">Chunks</span> + <span class="nu0">1</span>, <span class="re0">Bytes</span><span class="br0">&#125;</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#123;</span>bytes, <span class="re0">B</span><span class="br0">&#125;</span> -&gt; <span class="me1">monitor0</span><span class="br0">&#40;</span><span class="br0">&#123;</span><span class="re0">Open</span>, <span class="re0">Closed</span>, <span class="re0">Chunks</span>, <span class="re0">Bytes</span> + <span class="re0">B</span><span class="br0">&#125;</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">end</span>.</div>
</li>
</ol>
</div>
<p><br/><br />
As an initial test I was connecting to the mochiweb app from Part 1 &#8211; it simply sends one message to every client every 10 seconds.</p>
<p><code>erl&gt; c(floodtest2), floodtest2:run(20).</code></p>
<p><strong>This quickly ate all my memory.</strong></p>
<p>Turns out opening lots of connections with gen_tcp like that eats a lot of ram. I think it&#8217;d need ~36GB to make it work without any additional tuning. I&#8217;m not interested in trying to optimise my quick-hack erlang http client (in the real world, this would be 1M actual web browsers), and the only machine I could get my hands on that has more than 32GB of RAM is one of our production databases, and I can&#8217;t find a good excuse to take Last.fm offline whilst I test this :) Additionally, it seems like it still only managed to open around 64,500 ports. Hmm.</p>
<p>At this point I decided to break out the trusty <a href="http://monkey.org/~provos/libevent/">libevent</a>, which I was pleased to discover has an HTTP API. Newer versions also have a <code>evhttp_connection_set_local_address</code> function in the http API. This sounds promising.</p>
<p>Here&#8217;s the http client in C using libevent:</p>
<div class="dean_ch" style="white-space: wrap;">
<ol>
<li class="li1">
<div class="de1"><span class="co2">#include &lt;sys/types.h&gt;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co2">#include &lt;sys/time.h&gt;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co2">#include &lt;sys/queue.h&gt;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co2">#include &lt;stdlib.h&gt;</span></div>
</li>
<li class="li2">
<div class="de2"><span class="co2">#include &lt;err.h&gt;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co2">#include &lt;event.h&gt;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co2">#include &lt;evhttp.h&gt;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co2">#include &lt;unistd.h&gt;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co2">#include &lt;stdio.h&gt;</span></div>
</li>
<li class="li2">
<div class="de2"><span class="co2">#include &lt;sys/socket.h&gt;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co2">#include &lt;netinet/in.h&gt;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co2">#include &lt;time.h&gt;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co2">#include &lt;pthread.h&gt;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2"><span class="co2">#define BUFSIZE 4096</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co2">#define NUMCONNS 62000</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co2">#define SERVERADDR &quot;10.103.1.43&quot;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co2">#define SERVERPORT 8000</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co2">#define SLEEP_MS 10</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="kw4">char</span> buf<span class="br0">&#91;</span>BUFSIZE<span class="br0">&#93;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="kw4">int</span> bytes_recvd = <span class="nu0">0</span>;</div>
</li>
<li class="li1">
<div class="de1"><span class="kw4">int</span> chunks_recvd = <span class="nu0">0</span>;</div>
</li>
<li class="li2">
<div class="de2"><span class="kw4">int</span> closed = <span class="nu0">0</span>;</div>
</li>
<li class="li1">
<div class="de1"><span class="kw4">int</span> connected = <span class="nu0">0</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="co1">// called per chunk received</span></div>
</li>
<li class="li1">
<div class="de1"><span class="kw4">void</span> chunkcb<span class="br0">&#40;</span><span class="kw4">struct</span> evhttp_request * req, <span class="kw4">void</span> * arg<span class="br0">&#41;</span></div>
</li>
<li class="li2">
<div class="de2"><span class="br0">&#123;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw4">int</span> s = evbuffer_remove<span class="br0">&#40;</span> req-&gt;input_buffer, &amp;buf, BUFSIZE <span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="co1">//printf(&quot;Read %d bytes: %s\n&quot;, s, &amp;buf);</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; bytes_recvd += s;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; chunks_recvd++;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="kw1">if</span><span class="br0">&#40;</span>connected &gt;= NUMCONNS &amp;&amp; chunks_recvd%<span class="nu0">10000</span>==<span class="nu0">0</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <a href="http://www.opengroup.org/onlinepubs/009695399/functions/printf.html"><span class="kw3">printf</span></a><span class="br0">&#40;</span><span class="st0">&quot;&gt;Chunks: %d<span class="es0">\t</span>Bytes: %d<span class="es0">\t</span>Closed: %d<span class="es0">\n</span>&quot;</span>, chunks_recvd, bytes_recvd, closed<span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#125;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="co1">// gets called when request completes</span></div>
</li>
<li class="li2">
<div class="de2"><span class="kw4">void</span> reqcb<span class="br0">&#40;</span><span class="kw4">struct</span> evhttp_request * req, <span class="kw4">void</span> * arg<span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#123;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; closed++;</div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#125;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2"><span class="kw4">int</span> main<span class="br0">&#40;</span><span class="kw4">int</span> argc, <span class="kw4">char</span> **argv<span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#123;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; event_init<span class="br0">&#40;</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw4">struct</span> evhttp *evhttp_connection;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw4">struct</span> evhttp_request *evhttp_request;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="kw4">char</span> addr<span class="br0">&#91;</span><span class="nu0">16</span><span class="br0">&#93;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw4">char</span> path<span class="br0">&#91;</span><span class="nu0">32</span><span class="br0">&#93;</span>; <span class="co1">// eg: &quot;/test/123&quot;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw4">int</span> i,octet;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">for</span><span class="br0">&#40;</span>octet=<span class="nu0">1</span>; octet&lt;=<span class="nu0">17</span>; octet++<span class="br0">&#41;</span><span class="br0">&#123;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; sprintf<span class="br0">&#40;</span>&amp;addr, <span class="st0">&quot;10.224.0.%d&quot;</span>, octet<span class="br0">&#41;</span>;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">for</span><span class="br0">&#40;</span>i=<span class="nu0">1</span>;i&lt;=NUMCONNS;i++<span class="br0">&#41;</span> <span class="br0">&#123;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; evhttp_connection = evhttp_connection_new<span class="br0">&#40;</span>SERVERADDR, SERVERPORT<span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; evhttp_connection_set_local_address<span class="br0">&#40;</span>evhttp_connection, &amp;addr<span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; evhttp_set_timeout<span class="br0">&#40;</span>evhttp_connection, <span class="nu0">864000</span><span class="br0">&#41;</span>; <span class="co1">// 10 day timeout</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; evhttp_request = evhttp_request_new<span class="br0">&#40;</span>reqcb, <span class="kw2">NULL</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; evhttp_request-&gt;chunk_cb = chunkcb;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; sprintf<span class="br0">&#40;</span>&amp;path, <span class="st0">&quot;/test/%d&quot;</span>, ++connected<span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">if</span><span class="br0">&#40;</span>i%<span class="nu0">100</span>==<span class="nu0">0</span><span class="br0">&#41;</span> &nbsp;<a href="http://www.opengroup.org/onlinepubs/009695399/functions/printf.html"><span class="kw3">printf</span></a><span class="br0">&#40;</span><span class="st0">&quot;Req: %s<span class="es0">\t</span>-&gt;<span class="es0">\t</span>%s<span class="es0">\n</span>&quot;</span>, addr, &amp;path<span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; evhttp_make_request<span class="br0">&#40;</span> evhttp_connection, evhttp_request, EVHTTP_REQ_GET, path <span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; evhttp_connection_set_timeout<span class="br0">&#40;</span>evhttp_request-&gt;evcon, <span class="nu0">864000</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; event_loop<span class="br0">&#40;</span> EVLOOP_NONBLOCK <span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">if</span><span class="br0">&#40;</span> connected % <span class="nu0">200</span> == <span class="nu0">0</span> <span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <a href="http://www.opengroup.org/onlinepubs/009695399/functions/printf.html"><span class="kw3">printf</span></a><span class="br0">&#40;</span><span class="st0">&quot;<span class="es0">\n</span>Chunks: %d<span class="es0">\t</span>Bytes: %d<span class="es0">\t</span>Closed: %d<span class="es0">\n</span>&quot;</span>, chunks_recvd, bytes_recvd, closed<span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; usleep<span class="br0">&#40;</span>SLEEP_MS*<span class="nu0">1000</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#125;</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="br0">&#125;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; event_dispatch<span class="br0">&#40;</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">return</span> <span class="nu0">0</span>;</div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#125;</span></div>
</li>
</ol>
</div>
<p><br/><br />
Most parameters are hardcoded as #define&#8217;s so you configure it by editing the source and recompiling.</p>
<p>Compile and run:<br />
<code>$ gcc -o httpclient httpclient.c -levent<br />
$ ./httpclient</code></p>
<p><strong>This still failed to open more than 64,500 ports</strong>. Although it used less RAM doing it.</p>
<p>It turns out that although I was specifying the local addresses, the ephemeral port allocation somewhere in the kernel or tcp stack didn&#8217;t care, and still ran out after 2^16. So in order to open more than 64,500 connections, you need to specify the local address and local port yourself, and manage them accordingly.</p>
<p>Unfortunately the libevent HTTP API doesn&#8217;t have an option to specify the local port. I <a href="http://monkeymail.org/archives/libevent-users/2008-November/001415.html">patched libevent</a> to add a suitable function:<br />
<code>void evhttp_connection_set_local_port(struct evhttp_connection *evcon, u_short port);</code>. </p>
<p>This was a surprisingly pleasant experience; libevent seems well written, and the documentation is pretty decent too.</p>
<p>With my modified libevent installed, I was able to add the following under the set_local_address line in the above code:<br />
<code>evhttp_connection_set_local_port(evhttp_connection, 1024+i);</code></p>
<p>With that in place, multiple connections from different addresses were able to use the same local port number, specific to the the local address. I recompiled the client and let it run for a bit to confirm it would break the 2^16 barrier.</p>
<p><span title="Pun intended">Netstat confirms it</span>:<br />
<code># netstat -n | awk '/^tcp/ {t[$NF]++}END{for(state in t){print state, t[state]}}'<br />
TIME_WAIT 8<br />
ESTABLISHED 118222</code></p>
<p>This shows how many ports are open in various states. We&#8217;re finally able to open more than 2^16 connections, phew.</p>
<p>Now we have a tool capable of opening a million http connections from a single box. It seems to consume around 2KB per connection, plus whatever the kernel needs. It&#8217;s time to use it for the &#8220;million connected user&#8221; test against our mochiweb comet server.</p>
<h2>C1024K Test &#8211; 1 million comet connections</h2>
<p>For this test I used 4 different servers of varying specs. These specs may be overpowered for the experiment, but they were available and waiting to go into production, and this made a good burn-in test. All four servers are on the same gigabit LAN, with up to 3 switches and a router in the middle somewhere. </p>
<p>The 1 million test I ran is  similar to the 10k test from parts 1 and 2, the main difference being the modified client, now written in C using libevent, and that I&#8217;m running in a proper distributed-erlang setup with more than one machine.</p>
<p>On server 1 &#8211; Quad-core 2GHz CPU, 16GB of RAM</p>
<ul>
<li>Start subsmanager</li>
<li>Load in the friends data</li>
<li>Start the router</li>
</ul>
<p>On server 2 &#8211; Dual Quad-core 2.8GHz CPU, 32GB of RAM</p>
<ul>
<li>Start mochiweb app</li>
</ul>
<p>On server 3 &#8211; Quad-core 2GHz CPU, 16GB of RAM</p>
<ul>
<li>Create 17 virtual IPs as above</li>
<li>Install patched libevent</li>
<li>Run client: <code>./httpclient</code> to create 100 connections per second, up to 1M</li>
</ul>
<p>On server 4 &#8211; Dual-core 2GHz, 2GB RAM</p>
<ul>
<li>Run msggen program, to send lots of messages to the router</li>
</ul>
<p>I measured the memory usage of mochiweb during the ramp-up to a million connections, and for the rest of the day:</p>
<p><a href="http://www.metabrew.com/wp-content/uploads/2008/11/mochimem-c1000k.png"><img src="http://www.metabrew.com/wp-content/uploads/2008/11/mochimem-c1000k.png" alt="" title="Mochiweb memory, 1M connections" width="500" height="300" class="aligncenter size-full wp-image-172" /></a></p>
<p>The httpclient has a built in delay of 10ms between connections, so it took nearly 3 hours to open a million connections. The resident memory used by the mochiweb process with 1M open connections was around 25GB. Here&#8217;s the server this was running on as seen by Ganglia, which measures CPU, network and memory usage and produces nice graphs:</p>
<p><a href="http://www.metabrew.com/wp-content/uploads/2008/11/server21.png"><img src="http://www.metabrew.com/wp-content/uploads/2008/11/server21.png" alt="" title="Server running mochiweb, 1M connections" width="131" height="300" class="alignnone size-medium wp-image-173" /></a></p>
<p>You can see it needs around 38GB and has started to swap. I suspect the difference is mostly consumed by the kernel to keep those connections open. The uplift at the end is when I started sending messages.</p>
<p>Messages were generated using 1,000 processes, with an average time between messages of 60ms per process, giving around 16,666 messages per second overall:</p>
<p><code>erl&gt; [ spawn( fun()->msggen:start(1000000, 10+random:uniform(100), 1000000) end) || I <- lists:seq(1,1000) ].</code></p>
<p>The machine (server-4) generating messages looked like this on Ganglia:</p>
<p><a href="http://www.metabrew.com/wp-content/uploads/2008/11/msggen.png"><img src="http://www.metabrew.com/wp-content/uploads/2008/11/msggen.png" alt="" title="16,666 msgs/sec" width="131" height="300" class="alignnone size-medium wp-image-170" /></a></p>
<p>That&#8217;s 10 MB per second of messages it&#8217;s pumping out &#8211; 16,666 messages a second. Typically these messages would come from a message bus, app servers, or part of an existing infrastructure.</p>
<p>When I started sending messages, the load on server 1 (hosting subsmanager and router) stayed below 1, and CPU utilization increased from 0 to 5%. </p>
<p>CPU on server 2 (hosting mochiweb app, with 1M connections) increased more dramatically:</p>
<p><a href="http://www.metabrew.com/wp-content/uploads/2008/11/server2.png"><img src="http://www.metabrew.com/wp-content/uploads/2008/11/server2.png" alt="" title="Mochiweb server" width="131" height="300" class="alignnone size-medium wp-image-171" /></a></p>
<p>Naturally as processes have to leave their hibernate state to handle messages, memory usage will increase slightly. Having all connections open with no messages is a best-case for memory usage &#8211; unsurprisingly, actually doing stuff requires more memory.</p>
<p>So where does this leave us? To be on the safe side, the mochiweb machine would need 40GB of RAM to hold open 1M active comet connections. Under load, up to 30GB of the memory would be used by the mochiweb app, and the remaining 10GB by the kernel. In other words, you need to allow 40KB per connection.</p>
<p>During various test with lots of connections, I ended up making some additional changes to my sysctl.conf. This was part trial-and-error, I don&#8217;t really know enough about the internals to make especially informed decisions about which values to change. My policy was to wait for things to break, check <code>/var/log/kern.log</code> and see what mysterious error was reported, then increase stuff that sounded sensible after a spot of googling. Here are the settings in place during the above test:</p>
<pre>net.core.rmem_max = 33554432
net.core.wmem_max = 33554432
net.ipv4.tcp_rmem = 4096 16384 33554432
net.ipv4.tcp_wmem = 4096 16384 33554432
net.ipv4.tcp_mem = 786432 1048576 26777216
net.ipv4.tcp_max_tw_buckets = 360000
net.core.netdev_max_backlog = 2500
vm.min_free_kbytes = 65536
vm.swappiness = 0
net.ipv4.ip_local_port_range = 1024 65535</pre>
<p>I would like to learn more about Linux tcp tuning so I can make a more informed decision about these settings. These are almost certainly not optimal, but at least they were enough to get to 1M connections. These changes, along with the fact this is running on a 64bit Erlang VM, and thus has a wordsize of 8bytes instead of 4, might explain why the memory usage is much higher than I observed during the C10k test of part 2.</p>
<h2>An Erlang C-Node using Libevent</h2>
<p>After dabbling with the HTTP api for libevent, it seemed entirely sensible to try the 1M connection test against a libevent HTTPd written in C so we have a basis for comparison.</p>
<p>I&#8217;m guessing that enabling kernel poll means the erlang VM is able to use epoll (or similar), but even so there&#8217;s clearly some overhead involved which we might be able to mitigate by delegating the connection handling to a C program using libevent. I want to reuse most of the Erlang code so far, so let&#8217;s do the bare minimum in C &#8211; just the connection handling and HTTP stuff.</p>
<p>Libevent has an asynchronous HTTP API, which makes implementing http servers trivial &#8211; well, trivial for C, but still less trivial than mochiweb IMO ;) I&#8217;d also been looking for an excuse to try the Erlang C interface, so the following program combines the two. It&#8217;s a comet http server in C using libevent which identifies users using an integer Id (like our mochiweb app), and also acts as an Erlang C-Node. </p>
<p>It connects to a designated erlang node, listens for messages like <code>{123, &lt;&lt;"Hello user 123"&gt;&gt;}</code> then dispatches &#8220;Hello user 123&#8243; to user 123, if connected. Messages for users that are not connected are discarded, just like previous examples.</p>
<p>httpdcnode.c</p>
<div class="dean_ch" style="white-space: wrap;">
<ol>
<li class="li1">
<div class="de1"><span class="co2">#include &lt;sys/types.h&gt;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co2">#include &lt;sys/time.h&gt;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co2">#include &lt;sys/queue.h&gt;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co2">#include &lt;stdlib.h&gt;</span></div>
</li>
<li class="li2">
<div class="de2"><span class="co2">#include &lt;err.h&gt;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co2">#include &lt;event.h&gt;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co2">#include &lt;evhttp.h&gt;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co2">#include &lt;stdio.h&gt;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co2">#include &lt;sys/socket.h&gt;</span></div>
</li>
<li class="li2">
<div class="de2"><span class="co2">#include &lt;netinet/in.h&gt;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="co2">#include &quot;erl_interface.h&quot;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co2">#include &quot;ei.h&quot;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2"><span class="co2">#include &lt;pthread.h&gt;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="co2">#define BUFSIZE 1024</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co2">#define MAXUSERS (17*65536) // C1024K</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2"><span class="co1">// List of current http requests by uid:</span></div>
</li>
<li class="li1">
<div class="de1"><span class="kw4">struct</span> evhttp_request * clients<span class="br0">&#91;</span>MAXUSERS<span class="nu0">+1</span><span class="br0">&#93;</span>;</div>
</li>
<li class="li1">
<div class="de1"><span class="co1">// Memory to store uids passed to the cleanup callback:</span></div>
</li>
<li class="li1">
<div class="de1"><span class="kw4">int</span> slots<span class="br0">&#91;</span>MAXUSERS<span class="nu0">+1</span><span class="br0">&#93;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2"><span class="co1">// called when user disconnects</span></div>
</li>
<li class="li1">
<div class="de1"><span class="kw4">void</span> cleanup<span class="br0">&#40;</span><span class="kw4">struct</span> evhttp_connection *evcon, <span class="kw4">void</span> *arg<span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#123;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw4">int</span> *uidp = <span class="br0">&#40;</span><span class="kw4">int</span> *<span class="br0">&#41;</span> arg;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; fprintf<span class="br0">&#40;</span>stderr, <span class="st0">&quot;disconnected uid %d<span class="es0">\n</span>&quot;</span>, *uidp<span class="br0">&#41;</span>;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; clients<span class="br0">&#91;</span>*uidp<span class="br0">&#93;</span> = <span class="kw2">NULL</span>;</div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#125;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="co1">// handles http connections, sets them up for chunked transfer,</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co1">// extracts the user id and registers in the global connection table,</span></div>
</li>
<li class="li2">
<div class="de2"><span class="co1">// also sends a welcome chunk.</span></div>
</li>
<li class="li1">
<div class="de1"><span class="kw4">void</span> request_handler<span class="br0">&#40;</span><span class="kw4">struct</span> evhttp_request *req, <span class="kw4">void</span> *arg<span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#123;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="kw4">struct</span> evbuffer *buf;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; buf = evbuffer_new<span class="br0">&#40;</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">if</span> <span class="br0">&#40;</span>buf == <span class="kw2">NULL</span><span class="br0">&#41;</span><span class="br0">&#123;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; err<span class="br0">&#40;</span><span class="nu0">1</span>, <span class="st0">&quot;failed to create response buffer&quot;</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#125;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; evhttp_add_header<span class="br0">&#40;</span>req-&gt;output_headers, <span class="st0">&quot;Content-Type&quot;</span>, <span class="st0">&quot;text/html; charset=utf-8&quot;</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li2">
<div class="de2">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="kw4">int</span> uid = <span class="nu0">-1</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">if</span><span class="br0">&#40;</span>strncmp<span class="br0">&#40;</span>evhttp_request_uri<span class="br0">&#40;</span>req<span class="br0">&#41;</span>, <span class="st0">&quot;/test/&quot;</span>, <span class="nu0">6</span><span class="br0">&#41;</span> == <span class="nu0">0</span><span class="br0">&#41;</span><span class="br0">&#123;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; uid = atoi<span class="br0">&#40;</span> <span class="nu0">6</span>+evhttp_request_uri<span class="br0">&#40;</span>req<span class="br0">&#41;</span> <span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#125;</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">if</span><span class="br0">&#40;</span>uid &lt;= <span class="nu0">0</span><span class="br0">&#41;</span><span class="br0">&#123;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; evbuffer_add_printf<span class="br0">&#40;</span>buf, <span class="st0">&quot;User id not found, try /test/123 instead&quot;</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; evhttp_send_reply<span class="br0">&#40;</span>req, HTTP_NOTFOUND, <span class="st0">&quot;Not Found&quot;</span>, buf<span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; evbuffer_free<span class="br0">&#40;</span>buf<span class="br0">&#41;</span>;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">return</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#125;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">if</span><span class="br0">&#40;</span>uid &gt; MAXUSERS<span class="br0">&#41;</span><span class="br0">&#123;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; evbuffer_add_printf<span class="br0">&#40;</span>buf, <span class="st0">&quot;Max uid allowed is %d&quot;</span>, MAXUSERS<span class="br0">&#41;</span>;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; evhttp_send_reply<span class="br0">&#40;</span>req, HTTP_SERVUNAVAIL, <span class="st0">&quot;We ran out of numbers&quot;</span>, buf<span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; evbuffer_free<span class="br0">&#40;</span>buf<span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">return</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#125;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; evhttp_send_reply_start<span class="br0">&#40;</span>req, HTTP_OK, <span class="st0">&quot;OK&quot;</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="co1">// Send welcome chunk:</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; evbuffer_add_printf<span class="br0">&#40;</span>buf, <span class="st0">&quot;Welcome, Url: &#8216;%s&#8217; Id: %d<span class="es0">\n</span>&quot;</span>, evhttp_request_uri<span class="br0">&#40;</span>req<span class="br0">&#41;</span>, uid<span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; evhttp_send_reply_chunk<span class="br0">&#40;</span>req, buf<span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; evbuffer_free<span class="br0">&#40;</span>buf<span class="br0">&#41;</span>;</div>
</li>
<li class="li2">
<div class="de2">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="co1">// put reference into global uid-&gt;connection table:</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; clients<span class="br0">&#91;</span>uid<span class="br0">&#93;</span> = req;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="co1">// set close callback</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; evhttp_connection_set_closecb<span class="br0">&#40;</span> req-&gt;evcon, cleanup, &amp;slots<span class="br0">&#91;</span>uid<span class="br0">&#93;</span> <span class="br0">&#41;</span>;</div>
</li>
<li class="li2">
<div class="de2"><span class="br0">&#125;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="co1">// runs in a thread &#8211; the erlang c-node stuff</span></div>
</li>
<li class="li1">
<div class="de1"><span class="co1">// expects msgs like {uid, msg} and sends a a &#8216;msg&#8217; chunk to uid if connected</span></div>
</li>
<li class="li2">
<div class="de2"><span class="kw4">void</span> cnode_run<span class="br0">&#40;</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#123;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw4">int</span> fd; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="coMULTI">/* fd to Erlang node */</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw4">int</span> got; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="coMULTI">/* Result of receive */</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw4">unsigned</span> <span class="kw4">char</span> buf<span class="br0">&#91;</span>BUFSIZE<span class="br0">&#93;</span>; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="coMULTI">/* Buffer for incoming message */</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; ErlMessage emsg; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="coMULTI">/* Incoming message */</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; ETERM *uid, *msg;</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; erl_init<span class="br0">&#40;</span><span class="kw2">NULL</span>, <span class="nu0">0</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li2">
<div class="de2">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">if</span> <span class="br0">&#40;</span>erl_connect_init<span class="br0">&#40;</span><span class="nu0">1</span>, <span class="st0">&quot;secretcookie&quot;</span>, <span class="nu0">0</span><span class="br0">&#41;</span> == <span class="nu0">-1</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; erl_err_quit<span class="br0">&#40;</span><span class="st0">&quot;erl_connect_init&quot;</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">if</span> <span class="br0">&#40;</span><span class="br0">&#40;</span>fd = erl_connect<span class="br0">&#40;</span><span class="st0">&quot;httpdmaster@localhost&quot;</span><span class="br0">&#41;</span><span class="br0">&#41;</span> &lt; <span class="nu0">0</span><span class="br0">&#41;</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; erl_err_quit<span class="br0">&#40;</span><span class="st0">&quot;erl_connect&quot;</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; fprintf<span class="br0">&#40;</span>stderr, <span class="st0">&quot;Connected to httpdmaster@localhost<span class="es0">\n</span><span class="es0">\r</span>&quot;</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw4">struct</span> evbuffer *evbuf;</div>
</li>
<li class="li2">
<div class="de2">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">while</span> <span class="br0">&#40;</span><span class="nu0">1</span><span class="br0">&#41;</span> <span class="br0">&#123;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; got = erl_receive_msg<span class="br0">&#40;</span>fd, buf, BUFSIZE, &amp;emsg<span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">if</span> <span class="br0">&#40;</span>got == ERL_TICK<span class="br0">&#41;</span> <span class="br0">&#123;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">continue</span>;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#125;</span> <span class="kw1">else</span> <span class="kw1">if</span> <span class="br0">&#40;</span>got == ERL_ERROR<span class="br0">&#41;</span> <span class="br0">&#123;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; fprintf<span class="br0">&#40;</span>stderr, <span class="st0">&quot;ERL_ERROR from erl_receive_msg.<span class="es0">\n</span>&quot;</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw2">break</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#125;</span> <span class="kw1">else</span> <span class="br0">&#123;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">if</span> <span class="br0">&#40;</span>emsg.<span class="me1">type</span> == ERL_REG_SEND<span class="br0">&#41;</span> <span class="br0">&#123;</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="co1">// get uid and body data from eg: {123, &lt;&lt;&quot;Hello&quot;&gt;&gt;}</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; uid = erl_element<span class="br0">&#40;</span><span class="nu0">1</span>, emsg.<span class="me1">msg</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; msg = erl_element<span class="br0">&#40;</span><span class="nu0">2</span>, emsg.<span class="me1">msg</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw4">int</span> userid = ERL_INT_VALUE<span class="br0">&#40;</span>uid<span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw4">char</span> *body = <span class="br0">&#40;</span><span class="kw4">char</span> *<span class="br0">&#41;</span> ERL_BIN_PTR<span class="br0">&#40;</span>msg<span class="br0">&#41;</span>;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw4">int</span> body_len = ERL_BIN_SIZE<span class="br0">&#40;</span>msg<span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="co1">// Is this userid connected?</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">if</span><span class="br0">&#40;</span>clients<span class="br0">&#91;</span>userid<span class="br0">&#93;</span><span class="br0">&#41;</span><span class="br0">&#123;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; fprintf<span class="br0">&#40;</span>stderr, <span class="st0">&quot;Sending %d bytes to uid %d<span class="es0">\n</span>&quot;</span>, body_len, userid<span class="br0">&#41;</span>; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; evbuf = evbuffer_new<span class="br0">&#40;</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; evbuffer_add<span class="br0">&#40;</span>evbuf, <span class="br0">&#40;</span><span class="kw4">const</span> <span class="kw4">void</span>*<span class="br0">&#41;</span>body, <span class="br0">&#40;</span>size_t<span class="br0">&#41;</span> body_len<span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; evhttp_send_reply_chunk<span class="br0">&#40;</span>clients<span class="br0">&#91;</span>userid<span class="br0">&#93;</span>, evbuf<span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; evbuffer_free<span class="br0">&#40;</span>evbuf<span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#125;</span><span class="kw1">else</span><span class="br0">&#123;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; fprintf<span class="br0">&#40;</span>stderr, <span class="st0">&quot;Discarding %d bytes to uid %d &#8211; user not connected<span class="es0">\n</span>&quot;</span>, </div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; body_len, userid<span class="br0">&#41;</span>; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="co1">// noop</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#125;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; erl_free_term<span class="br0">&#40;</span>emsg.<span class="me1">msg</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; erl_free_term<span class="br0">&#40;</span>uid<span class="br0">&#41;</span>; </div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; erl_free_term<span class="br0">&#40;</span>msg<span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#125;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#125;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#125;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="co1">// if we got here, erlang connection died.</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="co1">// this thread is supposed to run forever</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="co1">// TODO &#8211; gracefully handle failure / reconnect / etc</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; pthread_exit<span class="br0">&#40;</span><span class="nu0">0</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#125;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2"><span class="kw4">int</span> main<span class="br0">&#40;</span><span class="kw4">int</span> argc, <span class="kw4">char</span> **argv<span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#123;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="co1">// Launch the thread that runs the cnode:</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; pthread_attr_t tattr;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; pthread_t helper;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="kw4">int</span> status;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; pthread_create<span class="br0">&#40;</span>&amp;helper, <span class="kw2">NULL</span>, cnode_run, <span class="kw2">NULL</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw4">int</span> i;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">for</span><span class="br0">&#40;</span>i=<span class="nu0">0</span>;i&lt;=MAXUSERS;i++<span class="br0">&#41;</span> slots<span class="br0">&#91;</span>i<span class="br0">&#93;</span>=i;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="co1">// Launch libevent httpd:</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw4">struct</span> evhttp *httpd;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; event_init<span class="br0">&#40;</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; httpd = evhttp_start<span class="br0">&#40;</span><span class="st0">&quot;0.0.0.0&quot;</span>, <span class="nu0">8000</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; evhttp_set_gencb<span class="br0">&#40;</span>httpd, request_handler, <span class="kw2">NULL</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; event_dispatch<span class="br0">&#40;</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="co1">// Not reached, event_dispatch() shouldn&#8217;t return</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; evhttp_free<span class="br0">&#40;</span>httpd<span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">return</span> <span class="nu0">0</span>;</div>
</li>
<li class="li1">
<div class="de1"><span class="br0">&#125;</span></div>
</li>
</ol>
</div>
<p><br/></p>
<p>The maximum number of users is #defined, and similarly to the mochiweb server, it listens on port 8000 and expects users to connect with a path like so: <code>/test/&lt;userid&gt;</code>. Also hardcoded is the name of the erlang node it will connect to in order to receive messages, <code>httpdmaster@localhost</code>, and the erlang cookie, &#8220;secretcookie&#8221;. Change these accordingly.</p>
<p>Run the erlang node it will connect to first:<br />
<code>$ erl -setcookie secretcookie -sname httpdmaster@localhost</code></p>
<p>Compile and run like so:<br />
<code>$ gcc -o httpdcnode httpdcnode.c -lerl_interface -lei -levent<br />
$ ./httpdcnode</code></p>
<p>In the erlang shell, check you can see the hidden c-node:<br />
<code>erl&gt; nodes(hidden).<br />
[c1@localhost]</code></p>
<p>Now connect in your browser to <code>http://localhost:8000/test/123</code>. You should see the welcome message.</p>
<p>Now back to the erlang shell &#8211; send a message to the C node:</p>
<p><code>erl&gt; {any, c1@localhost} ! {123, &lt;&lt;"Hello Libevent World"&gt;&gt;}.</code></p>
<p><em>Note that we don&#8217;t have a Pid to use, so we use the alternate representation of {procname, node}. We use &#8216;any&#8217; as the process name, which is ignored by the C-node.</em></p>
<p><b>Now you&#8217;re able to deliver comet messages via Erlang, but all the http connections are managed by a libevent C program which acts as an Erlang node.</b></p>
<p>After removing the debug print statements, I connected 1M clients to the httpdcnode server using the same client as above, the machine showed a total of just under 10GB or memory used. The resident memory of the server process was stable at under 2GB:</p>
<p><a href="http://www.metabrew.com/wp-content/uploads/2008/11/mochimem-libevent.png"><img src="http://www.metabrew.com/wp-content/uploads/2008/11/mochimem-libevent.png" alt="" title="Memory of libevent-based server process, 1M connections" width="500" height="300" class="aligncenter size-full wp-image-176" /></a></p>
<p>So big savings compared to mochiweb when handling lots of connections &#8211; the resident memory per connection for the server process with libevent is just under 2KB. With everything connected, the server machine claims:<br />
<code>Mem:  32968672k total,  9636488k used, 23332184k free,      180k buffers</code><br />
So the kernel/tcp stack is consuming an additional 8KB per connection, which seems a little high, but I have no basis for comparison. </p>
<p>This libevent-cnode server needs a bit more work. It doesn&#8217;t sensibly handle multiple connections from the same user yet, and there&#8217;s no locking so a race condition exists if you disconnect at just when a message was going to be dispatched. </p>
<p>Even so, I think this could be generalized in such a way that would allow you to <strong>use Erlang for all the interesting stuff, and have a C+libevent process act as a dumb connection-pool</strong>. With a bit more wrapper code and callbacks into Erlang, you&#8217;d hardly need to know this was going on &#8211; the C program could be run as a driver or a C-node, and an Erlang wrapper could give you a decent api built on top of libevent. (see <a href="http://www.metabrew.com/article/erlang-libketama-driver-consistent-hashing/">this post</a> for an example Erlang C driver). I would like to experiment further with this.</p>
<h2>Final Thoughts</h2>
<p>I have enough data now to judge how much hardware would be needed if we deploy a large scale comet system for Last.fm. Even a worst case of 40KB per connection isn&#8217;t unreasonable &#8211; memory is pretty cheap at the moment, and 40GB to support a million users is not unreasonable. 10GB is even better. I will finish up the app I&#8217;m building and deploy it somewhere people can try it out. Along the way I&#8217;ll tidy up the erlang memcached client I&#8217;m using and release that (from jungerl, with modifications for consistent hashing and some bug fixes), and some other things. Stay tuned :)</p>
]]></content:encoded>
			<wfw:commentRss>http://www.metabrew.com/article/a-million-user-comet-application-with-mochiweb-part-3//feed</wfw:commentRss>
		<slash:comments>40</slash:comments>
		</item>
		<item>
		<title>A Million-user Comet Application with Mochiweb, Part 2</title>
		<link>http://www.metabrew.com/article/a-million-user-comet-application-with-mochiweb-part-2/</link>
		<comments>http://www.metabrew.com/article/a-million-user-comet-application-with-mochiweb-part-2/#comments</comments>
		<pubDate>Thu, 23 Oct 2008 15:06:08 +0000</pubDate>
		<dc:creator>RJ</dc:creator>
				<category><![CDATA[programming]]></category>
		<category><![CDATA[comet]]></category>
		<category><![CDATA[erlang]]></category>
		<category><![CDATA[mochiweb]]></category>

		<guid isPermaLink="false">http://www.metabrew.com/?p=100</guid>
		<description><![CDATA[In Part 1, we built a (somewhat useless) mochiweb comet application that sent clients a message every 10 seconds. We tuned the Linux kernel, and built a tool to establish a lot of connections in order to test performance and memory usage. We found that it took around 45KB per connection. Part 2 is about [...]]]></description>
			<content:encoded><![CDATA[<p>In <a href="http://www.metabrew.com/article/a-million-user-comet-application-with-mochiweb-part-1/">Part 1</a>, we built a (somewhat useless) mochiweb comet application that sent clients a message every 10 seconds. We tuned the Linux kernel, and built a tool to establish a lot of connections in order to test performance and memory usage. We found that it took around 45KB per connection.</p>
<p>Part 2 is about turning the application into something useful, and saving memory:</p>
<ul>
<li>Implement a message router with a login/logout/send API</li>
<li>Update the mochiweb app to receive messages from the router</li>
<li>Setup a distributed erlang system so we can run the router on a different node/host to mochiweb</li>
<li>Write a tool to spam the router with lots of messages</li>
<li>Graph memory usage over 24hrs, and optimise the mochiweb app to save memory.</li>
</ul>
<p>This means we are decoupling the message sending logic from the mochiweb app. In tandem with the floodtest tool from part 1, we can benchmark a setup closer to a production scenario.</p>
<h2>Implementing the message router</h2>
<p>The router API is just 3 functions:</p>
<ul>
<li><code>login(Id, Pid)</code> register a process (of pid <code>Pid</code>) to receive messages for <code>Id</code></li>
<li><code>logout(Pid)</code> to stop receiving messages</li>
<li><code>send(Id, Msg)</code> sends the message <code>Msg</code> to any client logged in as <code>Id</code></li>
</ul>
<p>Note that, by design, it is possible for one process to login with multiple different <code>Id</code>s.</p>
<p>This example router module uses 2 <code>ets</code> tables to store bidirectional mappings between Pids and Ids. (<code>pid2id</code> and <code>id2pid</code> in the <code>#state</code> record below.)</p>
<p>router.erl:</p>
<div class="dean_ch" style="white-space: wrap;">
<ol>
<li class="li1">
<div class="de1">-<span class="kw2">module</span><span class="br0">&#40;</span>router<span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">-<span class="kw2">behaviour</span><span class="br0">&#40;</span>gen_server<span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">-<span class="kw2">export</span><span class="br0">&#40;</span><span class="br0">&#91;</span><span class="kw3">start_link</span>/<span class="nu0">0</span><span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li2">
<div class="de2">-<span class="kw2">export</span><span class="br0">&#40;</span><span class="br0">&#91;</span>init/<span class="nu0">1</span>, handle_call/<span class="nu0">3</span>, handle_cast/<span class="nu0">2</span>, handle_info/<span class="nu0">2</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp;terminate/<span class="nu0">2</span>, code_change/<span class="nu0">3</span><span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">-<span class="kw2">export</span><span class="br0">&#40;</span><span class="br0">&#91;</span>send/<span class="nu0">2</span>, login/<span class="nu0">2</span>, logout/<span class="nu0">1</span><span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2">-define<span class="br0">&#40;</span><span class="re0">SERVER</span>, global:<span class="me2">whereis_name</span><span class="br0">&#40;</span>?<span class="re0">MODULE</span><span class="br0">&#41;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="co1">% will hold bidirectional mapping between id &lt;&#8211;&gt; pid</span></div>
</li>
<li class="li1">
<div class="de1">-record<span class="br0">&#40;</span>state, <span class="br0">&#123;</span>pid2id, id2pid<span class="br0">&#125;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2"><span class="kw3">start_link</span><span class="br0">&#40;</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="me1">gen_server</span>:<span class="kw3">start_link</span><span class="br0">&#40;</span><span class="br0">&#123;</span>global, ?<span class="re0">MODULE</span><span class="br0">&#125;</span>, ?<span class="re0">MODULE</span>, <span class="br0">&#91;</span><span class="br0">&#93;</span>, <span class="br0">&#91;</span><span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="co1">% sends Msg to anyone logged in as Id</span></div>
</li>
<li class="li1">
<div class="de1">send<span class="br0">&#40;</span><span class="re0">Id</span>, <span class="re0">Msg</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="me1">gen_server</span>:<span class="kw3">call</span><span class="br0">&#40;</span>?<span class="re0">SERVER</span>, <span class="br0">&#123;</span>send, <span class="re0">Id</span>, <span class="re0">Msg</span><span class="br0">&#125;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">login<span class="br0">&#40;</span><span class="re0">Id</span>, <span class="re0">Pid</span><span class="br0">&#41;</span> when is_pid<span class="br0">&#40;</span><span class="re0">Pid</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="me1">gen_server</span>:<span class="kw3">call</span><span class="br0">&#40;</span>?<span class="re0">SERVER</span>, <span class="br0">&#123;</span>login, <span class="re0">Id</span>, <span class="re0">Pid</span><span class="br0">&#125;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2">logout<span class="br0">&#40;</span><span class="re0">Pid</span><span class="br0">&#41;</span> when is_pid<span class="br0">&#40;</span><span class="re0">Pid</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="me1">gen_server</span>:<span class="kw3">call</span><span class="br0">&#40;</span>?<span class="re0">SERVER</span>, <span class="br0">&#123;</span>logout, <span class="re0">Pid</span><span class="br0">&#125;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="co1">%%</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2">init<span class="br0">&#40;</span><span class="br0">&#91;</span><span class="br0">&#93;</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="co1">% set this so we can catch death of logged in pids:</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; process_flag<span class="br0">&#40;</span>trap_exit, true<span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="co1">% use ets for routing tables</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#123;</span>ok, #state<span class="br0">&#123;</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; pid2id = ets:<span class="me2">new</span><span class="br0">&#40;</span>?<span class="re0">MODULE</span>, <span class="br0">&#91;</span>bag<span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; id2pid = ets:<span class="me2">new</span><span class="br0">&#40;</span>?<span class="re0">MODULE</span>, <span class="br0">&#91;</span>bag<span class="br0">&#93;</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="br0">&#125;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#125;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2">handle_call<span class="br0">&#40;</span><span class="br0">&#123;</span>login, <span class="re0">Id</span>, <span class="re0">Pid</span><span class="br0">&#125;</span>, _<span class="re0">From</span>, <span class="re0">State</span><span class="br0">&#41;</span> when is_pid<span class="br0">&#40;</span><span class="re0">Pid</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="me1">ets</span>:<span class="me2">insert</span><span class="br0">&#40;</span><span class="re0">State</span>#state.pid2id, <span class="br0">&#123;</span><span class="re0">Pid</span>, <span class="re0">Id</span><span class="br0">&#125;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; ets:<span class="me2">insert</span><span class="br0">&#40;</span><span class="re0">State</span>#state.id2pid, <span class="br0">&#123;</span><span class="re0">Id</span>, <span class="re0">Pid</span><span class="br0">&#125;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; link<span class="br0">&#40;</span><span class="re0">Pid</span><span class="br0">&#41;</span>, <span class="co1">% tell us if they exit, so we can log them out</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; io:<span class="kw3">format</span><span class="br0">&#40;</span><span class="st0">&quot;~w logged in as ~w<span class="es0">\n</span>&quot;</span>,<span class="br0">&#91;</span><span class="re0">Pid</span>, <span class="re0">Id</span><span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="br0">&#123;</span>reply, ok, <span class="re0">State</span><span class="br0">&#125;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">handle_call<span class="br0">&#40;</span><span class="br0">&#123;</span>logout, <span class="re0">Pid</span><span class="br0">&#125;</span>, _<span class="re0">From</span>, <span class="re0">State</span><span class="br0">&#41;</span> when is_pid<span class="br0">&#40;</span><span class="re0">Pid</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="me1">unlink</span><span class="br0">&#40;</span><span class="re0">Pid</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">PidRows</span> = ets:<span class="me2">lookup</span><span class="br0">&#40;</span><span class="re0">State</span>#state.pid2id, <span class="re0">Pid</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="kw1">case</span> <span class="re0">PidRows</span> <span class="kw1">of</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span><span class="br0">&#93;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="me1">ok</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; _ -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">IdRows</span> = <span class="br0">&#91;</span> <span class="br0">&#123;</span><span class="re0">I</span>,<span class="re0">P</span><span class="br0">&#125;</span> || <span class="br0">&#123;</span><span class="re0">P</span>,<span class="re0">I</span><span class="br0">&#125;</span> &lt;- <span class="re0">PidRows</span> <span class="br0">&#93;</span>, <span class="co1">% invert tuples</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="co1">% delete all pid-&gt;id entries</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; ets:<span class="me2">delete</span><span class="br0">&#40;</span><span class="re0">State</span>#state.pid2id, <span class="re0">Pid</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="co1">% and all id-&gt;pid</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span> ets:<span class="me2">delete_object</span><span class="br0">&#40;</span><span class="re0">State</span>#state.id2pid, <span class="re0">Obj</span><span class="br0">&#41;</span> || <span class="re0">Obj</span> &lt;- <span class="re0">IdRows</span> <span class="br0">&#93;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">end</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; io:<span class="kw3">format</span><span class="br0">&#40;</span><span class="st0">&quot;pid ~w logged out<span class="es0">\n</span>&quot;</span>,<span class="br0">&#91;</span><span class="re0">Pid</span><span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#123;</span>reply, ok, <span class="re0">State</span><span class="br0">&#125;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">handle_call<span class="br0">&#40;</span><span class="br0">&#123;</span>send, <span class="re0">Id</span>, <span class="re0">Msg</span><span class="br0">&#125;</span>, _<span class="re0">From</span>, <span class="re0">State</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="co1">% get pids who are logged in as this Id</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="re0">Pids</span> = <span class="br0">&#91;</span> <span class="re0">P</span> || <span class="br0">&#123;</span> _<span class="re0">Id</span>, <span class="re0">P</span> <span class="br0">&#125;</span> &lt;- ets:<span class="me2">lookup</span><span class="br0">&#40;</span><span class="re0">State</span>#state.id2pid, <span class="re0">Id</span><span class="br0">&#41;</span> <span class="br0">&#93;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="co1">% send Msg to them all</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">M</span> = <span class="br0">&#123;</span>router_msg, <span class="re0">Msg</span><span class="br0">&#125;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#91;</span> <span class="re0">Pid</span> ! <span class="re0">M</span> || <span class="re0">Pid</span> &lt;- <span class="re0">Pids</span> <span class="br0">&#93;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#123;</span>reply, ok, <span class="re0">State</span><span class="br0">&#125;</span>.</div>
</li>
<li class="li2">
<div class="de2">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="co1">% handle death and cleanup of logged in processes</span></div>
</li>
<li class="li1">
<div class="de1">handle_info<span class="br0">&#40;</span><span class="re0">Info</span>, <span class="re0">State</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">case</span> <span class="re0">Info</span> <span class="kw1">of</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#123;</span><span class="st0">&#8216;EXIT&#8217;</span>, <span class="re0">Pid</span>, _<span class="re0">Why</span><span class="br0">&#125;</span> -&gt;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="co1">% force logout:</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; handle_call<span class="br0">&#40;</span><span class="br0">&#123;</span>logout, <span class="re0">Pid</span><span class="br0">&#125;</span>, blah, <span class="re0">State</span><span class="br0">&#41;</span>; </div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Wtf</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="me1">io</span>:<span class="kw3">format</span><span class="br0">&#40;</span><span class="st0">&quot;Caught unhandled message: ~w<span class="es0">\n</span>&quot;</span>, <span class="br0">&#91;</span><span class="re0">Wtf</span><span class="br0">&#93;</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">end</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="br0">&#123;</span>noreply, <span class="re0">State</span><span class="br0">&#125;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">handle_cast<span class="br0">&#40;</span>_<span class="re0">Msg</span>, <span class="re0">State</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#123;</span>noreply, <span class="re0">State</span><span class="br0">&#125;</span>.</div>
</li>
<li class="li1">
<div class="de1">terminate<span class="br0">&#40;</span>_<span class="re0">Reason</span>, _<span class="re0">State</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="me1">ok</span>.</div>
</li>
<li class="li1">
<div class="de1">code_change<span class="br0">&#40;</span>_<span class="re0">OldVsn</span>, <span class="re0">State</span>, _<span class="re0">Extra</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#123;</span>ok, <span class="re0">State</span><span class="br0">&#125;</span>.</div>
</li>
</ol>
</div>
<p><br/></p>
<h2>Updating the mochiweb application</h2>
<p>Let&#8217;s assume a user is represented by an integer <code>Id</code> based on the URL they connect to mochiweb with, and use that id to register with the message router. Instead of blocking for 10 seconds then sending something, the mochiweb loop will block on receiving messages from the router, and send an HTTP chunk to the client for every message the router sends it:</p>
<ul>
<li>Client connects to mochiweb at http://localhost:8000/test/123</li>
<li>Mochiweb app registers the pid for that connection against the id &#8217;123&#8242; with the message router</li>
<li>If you send a message to the router addressed to id &#8217;123&#8242;, it will be relayed to the correct mochiweb process, and appear in the browser for that user</li>
</ul>
<p>Here&#8217;s the updated version of mochiconntest_web.erl:</p>
<div class="dean_ch" style="white-space: wrap;">
<ol>
<li class="li1">
<div class="de1">-<span class="kw2">module</span><span class="br0">&#40;</span>mochiconntest_web<span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">-<span class="kw2">export</span><span class="br0">&#40;</span><span class="br0">&#91;</span>start/<span class="nu0">1</span>, stop/<span class="nu0">0</span>, loop/<span class="nu0">2</span><span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2"><span class="co1">%% External API</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">start<span class="br0">&#40;</span><span class="re0">Options</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#123;</span><span class="re0">DocRoot</span>, <span class="re0">Options1</span><span class="br0">&#125;</span> = get_option<span class="br0">&#40;</span>docroot, <span class="re0">Options</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">Loop</span> = fun <span class="br0">&#40;</span><span class="re0">Req</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;?<span class="re0">MODULE</span>:<span class="me2">loop</span><span class="br0">&#40;</span><span class="re0">Req</span>, <span class="re0">DocRoot</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="kw1">end</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="co1">% we&#8217;ll set our maximum to 1 million connections. (default: 2048)</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; mochiweb_http:<span class="me2">start</span><span class="br0">&#40;</span><span class="br0">&#91;</span><span class="br0">&#123;</span>max, <span class="nu0">1000000</span><span class="br0">&#125;</span>, <span class="br0">&#123;</span>name, ?<span class="re0">MODULE</span><span class="br0">&#125;</span>, <span class="br0">&#123;</span>loop, <span class="re0">Loop</span><span class="br0">&#125;</span> | <span class="re0">Options1</span><span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2">stop<span class="br0">&#40;</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="me1">mochiweb_http</span>:<span class="me2">stop</span><span class="br0">&#40;</span>?<span class="re0">MODULE</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">loop<span class="br0">&#40;</span><span class="re0">Req</span>, <span class="re0">DocRoot</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="st0">&quot;/&quot;</span> ++ <span class="re0">Path</span> = <span class="re0">Req</span>:<span class="me2">get</span><span class="br0">&#40;</span>path<span class="br0">&#41;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="kw1">case</span> <span class="re0">Req</span>:<span class="me2">get</span><span class="br0">&#40;</span>method<span class="br0">&#41;</span> <span class="kw1">of</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Method</span> when <span class="re0">Method</span> =:= <span class="st0">&#8216;GET&#8217;</span>; <span class="re0">Method</span> =:= <span class="st0">&#8216;HEAD&#8217;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">case</span> <span class="re0">Path</span> <span class="kw1">of</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="st0">&quot;test/&quot;</span> ++ <span class="re0">Id</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Response</span> = <span class="re0">Req</span>:<span class="me2">ok</span><span class="br0">&#40;</span><span class="br0">&#123;</span><span class="st0">&quot;text/html; charset=utf-8&quot;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span><span class="br0">&#123;</span><span class="st0">&quot;Server&quot;</span>,<span class="st0">&quot;Mochiweb-Test&quot;</span><span class="br0">&#125;</span><span class="br0">&#93;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; chunked<span class="br0">&#125;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="co1">% login using an integer rather than a string</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#123;</span><span class="re0">IdInt</span>, _<span class="br0">&#125;</span> = string:<span class="me2">to_integer</span><span class="br0">&#40;</span><span class="re0">Id</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; router:<span class="me2">login</span><span class="br0">&#40;</span><span class="re0">IdInt</span>, self<span class="br0">&#40;</span><span class="br0">&#41;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; feed<span class="br0">&#40;</span><span class="re0">Response</span>, <span class="re0">IdInt</span>, <span class="nu0">1</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; _ -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Req</span>:<span class="me2">not_found</span><span class="br0">&#40;</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">end</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="st0">&#8216;POST&#8217;</span> -&gt;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">case</span> <span class="re0">Path</span> <span class="kw1">of</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; _ -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Req</span>:<span class="me2">not_found</span><span class="br0">&#40;</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">end</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; _ -&gt;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Req</span>:<span class="me2">respond</span><span class="br0">&#40;</span><span class="br0">&#123;</span><span class="nu0">501</span>, <span class="br0">&#91;</span><span class="br0">&#93;</span>, <span class="br0">&#91;</span><span class="br0">&#93;</span><span class="br0">&#125;</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">end</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">feed<span class="br0">&#40;</span><span class="re0">Response</span>, <span class="re0">Id</span>, <span class="re0">N</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">receive</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="br0">&#123;</span>router_msg, <span class="re0">Msg</span><span class="br0">&#125;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Html</span> = io_lib:<span class="kw3">format</span><span class="br0">&#40;</span><span class="st0">&quot;Recvd msg #~w: &#8216;~s&#8217;&quot;</span>, <span class="br0">&#91;</span><span class="re0">N</span>, <span class="re0">Msg</span><span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Response</span>:<span class="me2">write_chunk</span><span class="br0">&#40;</span><span class="re0">Html</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">end</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; feed<span class="br0">&#40;</span><span class="re0">Response</span>, <span class="re0">Id</span>, <span class="re0">N</span><span class="nu0">+1</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li2">
<div class="de2">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="co1">%% Internal API</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">get_option<span class="br0">&#40;</span><span class="re0">Option</span>, <span class="re0">Options</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#123;</span>proplists:<span class="me2">get_value</span><span class="br0">&#40;</span><span class="re0">Option</span>, <span class="re0">Options</span><span class="br0">&#41;</span>, proplists:<span class="me2">delete</span><span class="br0">&#40;</span><span class="re0">Option</span>, <span class="re0">Options</span><span class="br0">&#41;</span><span class="br0">&#125;</span>.</div>
</li>
</ol>
</div>
<p><br/></p>
<h2>It&#8217;s Alive!</h2>
<p>Now let&#8217;s bring it to life &#8211; we&#8217;ll use 2 erlang shells, one for mochiweb and one for the router. Edit <code>start-dev.sh</code>, used to start mochiweb,  and add the following additional parameters to <code>erl</code>:</p>
<ul>
<li><code>-sname n1</code> to name the erlang node &#8216;n1&#8242;</li>
<li><code>+K true</code> to enable kernel-poll. Seems daft not to when dealing with lots of connections</li>
<li><code>+P 134217727</code> the default maximum number of processes you can spawn is 32768. Considering we need one process per connection (and I don&#8217;t know of any good reason not to) I suggest just setting this to the maximum possible value. 134,217,727 is the max according to &#8220;man erl&#8221;.</li>
</ul>
<p>Now run <code>make &amp;&amp; ./start-dev.sh</code> and you should see a prompt like this: <code>(n1@localhost)1&gt;</code> &#8211; your mochiweb app is now running and the erlang node has a name.</p>
<p>Now run another erlang shell like so:<br />
<code>erl -sname n2</code><br />
Currently those two erlang instances don&#8217;t know about each other, fix that:<br />
<code>(n2@localhost)1&gt; nodes().<br />
[]<br />
(n2@localhost)2&gt; net_adm:ping(n1@localhost).<br />
pong<br />
(n2@localhost)3&gt; nodes().<br />
[n1@localhost]</code></p>
<p>Now compile and start the router from this shell:<br />
<code>(n2@localhost)4&gt; c(router).<br />
{ok,router}<br />
(n2@localhost)5&gt; router:start_link().<br />
{ok,&lt;0.38.0&gt;}</code></p>
<p>Now for the fun bit, go to <code>http://localhost:8000/test/123</code> in your browser (or use <code>lynx --source "http://localhost:8000/test/123"</code> from the console). Check the shell you launched the router in, you should see it logged in one user.</p>
<p>You can now send messages to the router and watch them appear in your browser. Only send strings for now, because we are using <code>~s</code> to format them with <code>io_lib:format</code> in the <code>feed</code> function, and atoms will crash it:</p>
<p>Just borrow the shell you used to launch the router:</p>
<p><code>(n2@localhost)6&gt; router:send(123, "Hello World").<br />
(n2@localhost)7&gt; router:send(123, "Why not open another browser window too?").<br />
(n2@localhost)8&gt; router:send(456, "This message will go into the void unless you are connected as /test/456 too").</code></p>
<p>Check your browser, you&#8217;ve got comet :)</p>
<h2>Running in a distributed erlang system</h2>
<p>It makes sense to run the router and mochiweb front-end(s) on different machines. Assuming you have a couple of spare machines to test this on, you should start the erlang shells as distributed nodes, i.e. use <code>-name n1@host1.example.com</code> instead of <code>-sname n1</code> (and the same for n2). Make sure they can see each other by using <code>net_adm:ping(...)</code> as above.</p>
<p>Note that on line 16 of router.erl, the name of the router process (&#8216;router&#8217;) is registered globally, and that because we are using the following macro to identify/locate the router in calls to gen_server, it will already work fine in a distributed system:</p>
<p><code>-define(SERVER, global:whereis_name(?MODULE)).</code></p>
<p>A global name registry for processes in a distributed system is just one of the things you get for free with Erlang. </p>
<h2>Generating lots of messages</h2>
<p>In a real environment we might see a long-tail like usage pattern, with some very active users and many infrequent users. However for this test we&#8217;ll just indiscriminately spam random users with fake messages. </p>
<p>msggen.erl:</p>
<div class="dean_ch" style="white-space: wrap;">
<ol>
<li class="li1">
<div class="de1">-<span class="kw2">module</span><span class="br0">&#40;</span>msggen<span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">-<span class="kw2">export</span><span class="br0">&#40;</span><span class="br0">&#91;</span>start/<span class="nu0">3</span><span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">start<span class="br0">&#40;</span><span class="nu0">0</span>, _, _<span class="br0">&#41;</span> -&gt; <span class="me1">ok</span>;</div>
</li>
<li class="li2">
<div class="de2">start<span class="br0">&#40;</span><span class="re0">Num</span>, <span class="re0">Interval</span>, <span class="re0">Max</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">Id</span> = random:<span class="me2">uniform</span><span class="br0">&#40;</span><span class="re0">Max</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; router:<span class="me2">send</span><span class="br0">&#40;</span><span class="re0">Id</span>, <span class="st0">&quot;Fake message Num = &quot;</span> ++ <span class="re0">Num</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">receive</span> <span class="kw1">after</span> <span class="re0">Interval</span> -&gt; <span class="me1">start</span><span class="br0">&#40;</span><span class="re0">Num</span> <span class="nu0">-1</span>, <span class="re0">Interval</span>, <span class="re0">Max</span><span class="br0">&#41;</span> <span class="kw1">end</span>.</div>
</li>
</ol>
</div>
<p><br/><br />
This will send <code>Num</code> messages to random user Ids between 1 and <code>Max</code>, waiting <code>Interval</code> ms between each send. </p>
<p>You can see this in action if you run the router and the mochiweb app, connect with your browser to <code>http://localhost:8000/test/3</code> then run:</p>
<pre>erl -sname test
(test@localhost)1> net_adm:ping(n1@localhost).
pong
(test@localhost)2> c(msggen).
{ok,msggen}
(test@localhost)3> msggen:start(20, 10, 5).
ok</pre>
<p>This will send 20 messages to random Ids between 1-5, with a 10ms wait between messages. Chances are Id 3 will receive a message or four.</p>
<p>We can even run a few of these in parallel to simulate multiple sources for messages. Here&#8217;s an example of spawning 10 processes that each send 20 messages to ids 1-5 with a 100ms delay between each message:</p>
<p><code>[ spawn(fun() -> msggen:start(20, 100, 5), io:format("~w finished.\n", [self()]) end) || _ <- lists:seq(1,10) ].<br />
[<0.97.0>,<0.98.0>,<0.99.0>,<0.100.0>,<0.101.0>,<0.102.0>,<br />
 <0.103.0>,<0.104.0>,<0.105.0>,<0.106.0>]<br />
<0.101.0> finished.<br />
<0.105.0> finished.<br />
<0.106.0> finished.<br />
<0.104.0> finished.<br />
<0.102.0> finished.<br />
<0.98.0> finished.<br />
<0.99.0> finished.<br />
<0.100.0> finished.<br />
<0.103.0> finished.<br />
<0.97.0> finished.<br />
</code></p>
<h2>C10K again, with feeling</h2>
<p>We have the pieces we need to run another larger-scale test now; clients connect to our mochiweb app, which registers them with the message router. We can generate a high volume of fake messages to fire at the router, which will send them to any registered clients. Let&#8217;s run the 10,000 concurrent-user test again from <a href="http://www.metabrew.com/article/a-million-user-comet-application-with-mochiweb-part-1/">Part 1</a>, but this time we&#8217;ll leave all the clients connected for a while while we blast lots of messages through the system.</p>
<p>Assuming you followed the instructions in <a href="http://www.metabrew.com/article/a-million-user-comet-application-with-mochiweb-part-1/">Part 1</a> to tune your kernel and increase your max files ulimit etc, this should be easy. You already have the mochiweb app and router running, so let&#8217;s dump more traffic on it.</p>
<p>Without any clients connected, the mochiweb beam process uses around 40MB (resident):</p>
<p><code>$ ps -o rss= -p `pgrep -f 'sname n1'`<br />
40156</code></p>
<p><i>This greps for the process ID of the command with &#8216;sname n1&#8242; in it, which is our mochiweb erlang process, then uses some formatting options to <code>ps</code> to print the RSS value &#8211; the resident memory size (KB)</i></p>
<p>I concocted this hideous one-liner to print the timestamp (human readable and a unixtime in case we need it later), current memory usage of mochiweb (resident KB), and the number of currently established connections every 60 seconds &#8211; leave this running on the mochiweb machine in a spare terminal:</p>
<p><code>$ MOCHIPID=`pgrep -f 'name n1'`; while [ 1 ] ; do NUMCON=`netstat -n | awk '/ESTABLISHED/ &#038;&#038; $4=="127.0.0.1:8000"' | wc -l`; MEM=`ps -o rss= -p $MOCHIPID`; echo -e "`date`\t`date +%s`\t$MEM\t$NUMCON"; sleep 60; done | tee -a mochimem.log</code></p>
<p><i>If anyone knows a better way to plot memory usage for a single process over time please leave a comment..</i></p>
<p>Now launch the <code>floodtest</code> tool from Part 1 in a new erl shell:<br />
<code>erl> floodtest:start("/tmp/mochi-urls.txt", 10).</code></p>
<p>This will establish 100 new connections per second until all 10,000 clients are connected.<br />
You&#8217;ll see it quickly reaches 10k connections:<br />
<code>erl&gt; floodtest:start("/tmp/mochi-urls.txt", 10).<br />
Stats: {825,0,0}<br />
Stats: {1629,0,0}<br />
Stats: {2397,0,0}<br />
Stats: {3218,0,0}<br />
Stats: {4057,0,0}<br />
Stats: {4837,0,0}<br />
Stats: {5565,0,0}<br />
Stats: {6295,0,0}<br />
Stats: {7022,0,0}<br />
Stats: {7727,0,0}<br />
Stats: {8415,0,0}<br />
Stats: {9116,0,0}<br />
Stats: {9792,0,0}<br />
Stats: {10000,0,0}<br />
...</code></p>
<p>Check the hideous memory usage one-liner output:<br />
<code>Mon Oct 20 16:57:24 BST 2008    1224518244      40388   1<br />
Mon Oct 20 16:58:25 BST 2008    1224518305      41120   263<br />
Mon Oct 20 16:59:27 BST 2008    1224518367      65252   5267<br />
Mon Oct 20 17:00:32 BST 2008    1224518432      89008   9836<br />
Mon Oct 20 17:01:37 BST 2008    1224518497      90748   10001<br />
Mon Oct 20 17:02:41 BST 2008    1224518561      90964   10001<br />
Mon Oct 20 17:03:46 BST 2008    1224518626      90964   10001<br />
Mon Oct 20 17:04:51 BST 2008    1224518691      90964   10001</code></p>
<p>It reached 10k concurrent connections (plus one I had open in firefox) and the resident memory size of mochiweb is around 90MB (90964KB).</p>
<p>Now unleash some messages:</p>
<p><code>erl&gt; [ spawn(fun() -> msggen:start(1000000, 100, 10000) end) || _ <- lists:seq(1,100) ].<br />
[<0.65.0>,<0.66.0>,<0.67.0>,<0.68.0>,<0.69.0>,<0.70.0>,<br />
 <0.71.0>,<0.72.0>,<0.73.0>,<0.74.0>,<0.75.0>,<0.76.0>,<br />
 <0.77.0>,<0.78.0>,<0.79.0>,<0.80.0>,<0.81.0>,<0.82.0>,<br />
 <0.83.0>,<0.84.0>,<0.85.0>,<0.86.0>,<0.87.0>,<0.88.0>,<br />
 <0.89.0>,<0.90.0>,<0.91.0>,<0.92.0>,<0.93.0>|...]</code></p>
<p>That&#8217;s 100 processes each sending a million messages at a rate of 10 messages a second to random Ids from 1 to 10,000. That means the router is seeing 1000 messages per second, and on average each of our 10k clients will get one message every 10 seconds.</p>
<p>Check the output in the <code>floodtest</code> shell, and you&#8217;ll see clients are receiving http chunks (remember it was {NumConnected, NumClosed, NumChunksRecvd}):<br />
<code>...<br />
Stats: {10000,0,5912}<br />
Stats: {10000,0,15496}<br />
Stats: {10000,0,25145}<br />
Stats: {10000,0,34755}<br />
Stats: {10000,0,44342}<br />
...</code></p>
<p>A million messages at a rate of 10 per second per process will take 27 hours to complete. Here&#8217;s how the memory usage looks after just 10 mins:<br />
<code>Mon Oct 20 16:57:24 BST 2008    1224518244      40388   1<br />
Mon Oct 20 16:58:25 BST 2008    1224518305      41120   263<br />
Mon Oct 20 16:59:27 BST 2008    1224518367      65252   5267<br />
Mon Oct 20 17:00:32 BST 2008    1224518432      89008   9836<br />
Mon Oct 20 17:01:37 BST 2008    1224518497      90748   10001<br />
Mon Oct 20 17:02:41 BST 2008    1224518561      90964   10001<br />
Mon Oct 20 17:03:46 BST 2008    1224518626      90964   10001<br />
Mon Oct 20 17:04:51 BST 2008    1224518691      90964   10001<br />
Mon Oct 20 17:05:55 BST 2008    1224518755      90980   10001<br />
Mon Oct 20 17:07:00 BST 2008    1224518820      91120   10001<br />
Mon Oct 20 17:08:05 BST 2008    1224518885      98664   10001<br />
Mon Oct 20 17:09:10 BST 2008    1224518950      106752  10001<br />
Mon Oct 20 17:10:15 BST 2008    1224519015      114044  10001<br />
Mon Oct 20 17:11:20 BST 2008    1224519080      119468  10001<br />
Mon Oct 20 17:12:25 BST 2008    1224519145      125360  10001</code></p>
<p>You can see the size already crept up from 40MB to 90MB when all 10k clients were connected, and to 125MB after running a bit longer.</p>
<p>It&#8217;s worth pointing out that the floodtest shell is almost CPU-bound, the msggen shell is using 2% CPU and the router and mochiweb less than 1%. (ie, only simulating lots of clients is using much CPU &#8211; the server app itself is very light on the CPU). It helps to have multiple machines, or a multicore CPU for testing.</p>
<h2>Results after running for 24 hours</h2>
<p>I ran this for 24 hours, whilst logging memory usage of the mochiweb process to mochimem.log. This is with 10,000 connected clients, and 1000 messages per second being sent to random clients.</p>
<p>The following bit of bash/awk was used to trick gnuplot into turning the mochimem.log file into a graph:</p>
<p><code>(echo -e "set terminal png size 500,300\nset xlabel \"Minutes Elapsed\"\nset ylabel \"Mem (KB)\"\nset title \"Mem usage with 10k active connections, 1000 msg/sec\"\nplot \"-\" using 1:2 with lines notitle" ; awk 'BEGIN{FS="\t";} NR%10==0 {if(!t){t=$2} mins=($2-t)/60; printf("%d %d\n",mins,$3)}' mochimem.log ; echo -e "end" ) | gnuplot > mochimem.png</code></p>
<div id="attachment_136" class="wp-caption aligncenter" style="width: 510px"><a href="http://www.metabrew.com/wp-content/uploads/2008/10/mochimem.png"><img src="http://www.metabrew.com/wp-content/uploads/2008/10/mochimem.png" alt="Graph of memory usage with c10k, 1000msg/sec, 24hrs" title="mochimem" width="500" height="300" class="size-full wp-image-136" /></a><p class="wp-caption-text">Memory usage with c10k, 1000msg/sec, 24hrs</p></div>
<p>This graph shows the memory usage (with 10k active connections and 1000 msgs/sec) levels off at around 250MB over a 24 hour period. The two big drops, once near the start and once at the end of the test, are when I ran this in the mochiweb erlang process, just out of curiosity:</p>
<p><code>erl&gt; [erlang:garbage_collect(P) || P <- erlang:processes()].</code></p>
<p>This forces all processes to garbage collect, and reclaimed around 100MB of memory &#8211; next up we investigate ways to save memory without resorting to manually forcing garbage collection.</p>
<h2>Reducing memory usage in mochiweb</h2>
<p>Seeing as the mochiweb app is just sending messages and then immediately forgetting them, the memory usage shouldn&#8217;t need to increase with the number of messages sent. </p>
<p>I&#8217;m a novice when it comes to Erlang memory management, but I&#8217;m going to assume that if I can force it to garbage collect more often, it will allow us to reclaim much of that memory, and ultimately let us serve more users with less overall system memory. We might burn a bit more CPU in the process, but that&#8217;s an acceptable trade-off.</p>
<p>Digging around in the <a href="http://erlang.org/doc/man/erlang.html">erlang docs</a> yields this option:</p>
<p><b><code>erlang:system_flag(fullsweep_after, Number)</code></b></p>
<blockquote><p>
    Number is a non-negative integer which indicates how many times generational garbages collections can be done without forcing a fullsweep collection. The value applies to new processes; processes already running are not affected.<br />
    In low-memory systems (especially without virtual memory), setting the value to 0 can help to conserve memory.<br />
    An alternative way to set this value is through the (operating system) environment variable ERL_FULLSWEEP_AFTER.</p></blockquote>
<p>Sounds intriguing, but it only applies to new processes and would affect all processes in the VM, not just our mochiweb processes.</p>
<p>Next up is this: </p>
<p><b><code>erlang:system_flag(min_heap_size, MinHeapSize)</code></b></p>
<blockquote><p>Sets the default minimum heap size for processes. The size is given in words. The new min_heap_size only effects processes spawned after the change of min_heap_size has been made. The min_heap_size can be set for individual processes by use of spawn_opt/N or process_flag/2. </p></blockquote>
<p>Could be useful, but I&#8217;m pretty sure our mochiweb processes need a bigger heap than the default value anyway. I&#8217;d like to avoid needing to patch the mochiweb source to add spawn options if possible. </p>
<p>Next to catch my eye was this:</p>
<p><b><code>erlang:hibernate(Module, Function, Args)</code></b></p>
<blockquote><p>Puts the calling process into a wait state where its memory allocation has been reduced as much as possible, which is useful if the process does not expect to receive any messages in the near future. </p>
<p>The process will be awaken when a message is sent to it, and control will resume in Module:Function with the arguments given by Args with the call stack emptied, meaning that the process will terminate when that function returns. Thus erlang:hibernate/3 will never return to its caller.</p>
<p>If the process has any message in its message queue, the process will be awaken immediately in the same way as described above.</p>
<p>In more technical terms, what erlang:hibernate/3 does is the following. It discards the call stack for the process. Then it garbage collects the process. After the garbage collection, all live data is in one continuous heap. The heap is then shrunken to the exact same size as the live data which it holds (even if that size is less than the minimum heap size for the process).</p>
<p>If the size of the live data in the process is less than the minimum heap size, the first garbage collection occurring after the process has been awaken will ensure that the heap size is changed to a size not smaller than the minimum heap size.</p>
<p>Note that emptying the call stack means that any surrounding catch is removed and has to be re-inserted after hibernation. One effect of this is that processes started using proc_lib (also indirectly, such as gen_server processes), should use proc_lib:hibernate/3 instead to ensure that the exception handler continues to work when the process wakes up. </p></blockquote>
<p>This sounds reasonable &#8211; <b>let&#8217;s try hibernating after every message and see what happens</b>.</p>
<p>Edit <code>mochiconntest_web.erl</code> and change the following:</p>
<ul>
<li>Make the last line of the <code>feed(Response, Id, N)</code> function call hibernate instead of calling itself</li>
<li>Call hibernate immediately after logging into the router, rather than calling <code>feed</code> and blocking on receive</li>
<li>Remember to export <code>feed/3</code> so hibernate can call back into the function on wake-up</li>
</ul>
<p>Updated <code>mochiconntest_web.erl</code> with hibernation between messages:</p>
<div class="dean_ch" style="white-space: wrap;">
<ol>
<li class="li1">
<div class="de1">-<span class="kw2">module</span><span class="br0">&#40;</span>mochiconntest_web<span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">-<span class="kw2">export</span><span class="br0">&#40;</span><span class="br0">&#91;</span>start/<span class="nu0">1</span>, stop/<span class="nu0">0</span>, loop/<span class="nu0">2</span>, feed/<span class="nu0">3</span><span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2"><span class="co1">%% External API</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">start<span class="br0">&#40;</span><span class="re0">Options</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#123;</span><span class="re0">DocRoot</span>, <span class="re0">Options1</span><span class="br0">&#125;</span> = get_option<span class="br0">&#40;</span>docroot, <span class="re0">Options</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">Loop</span> = fun <span class="br0">&#40;</span><span class="re0">Req</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;?<span class="re0">MODULE</span>:<span class="me2">loop</span><span class="br0">&#40;</span><span class="re0">Req</span>, <span class="re0">DocRoot</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="kw1">end</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="co1">% we&#8217;ll set our maximum to 1 million connections. (default: 2048)</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; mochiweb_http:<span class="me2">start</span><span class="br0">&#40;</span><span class="br0">&#91;</span><span class="br0">&#123;</span>max, <span class="nu0">1000000</span><span class="br0">&#125;</span>, <span class="br0">&#123;</span>name, ?<span class="re0">MODULE</span><span class="br0">&#125;</span>, <span class="br0">&#123;</span>loop, <span class="re0">Loop</span><span class="br0">&#125;</span> | <span class="re0">Options1</span><span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2">stop<span class="br0">&#40;</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="me1">mochiweb_http</span>:<span class="me2">stop</span><span class="br0">&#40;</span>?<span class="re0">MODULE</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">loop<span class="br0">&#40;</span><span class="re0">Req</span>, <span class="re0">DocRoot</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="st0">&quot;/&quot;</span> ++ <span class="re0">Path</span> = <span class="re0">Req</span>:<span class="me2">get</span><span class="br0">&#40;</span>path<span class="br0">&#41;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="kw1">case</span> <span class="re0">Req</span>:<span class="me2">get</span><span class="br0">&#40;</span>method<span class="br0">&#41;</span> <span class="kw1">of</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Method</span> when <span class="re0">Method</span> =:= <span class="st0">&#8216;GET&#8217;</span>; <span class="re0">Method</span> =:= <span class="st0">&#8216;HEAD&#8217;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">case</span> <span class="re0">Path</span> <span class="kw1">of</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="st0">&quot;test/&quot;</span> ++ <span class="re0">IdStr</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Response</span> = <span class="re0">Req</span>:<span class="me2">ok</span><span class="br0">&#40;</span><span class="br0">&#123;</span><span class="st0">&quot;text/html; charset=utf-8&quot;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span><span class="br0">&#123;</span><span class="st0">&quot;Server&quot;</span>,<span class="st0">&quot;Mochiweb-Test&quot;</span><span class="br0">&#125;</span><span class="br0">&#93;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; chunked<span class="br0">&#125;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#123;</span><span class="re0">Id</span>, _<span class="br0">&#125;</span> = string:<span class="me2">to_integer</span><span class="br0">&#40;</span><span class="re0">IdStr</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; router:<span class="me2">login</span><span class="br0">&#40;</span><span class="re0">Id</span>, self<span class="br0">&#40;</span><span class="br0">&#41;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="co1">% Hibernate this process until it receives a message:</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; proc_lib:<span class="me2">hibernate</span><span class="br0">&#40;</span>?<span class="re0">MODULE</span>, feed, <span class="br0">&#91;</span><span class="re0">Response</span>, <span class="re0">Id</span>, <span class="nu0">1</span><span class="br0">&#93;</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; _ -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Req</span>:<span class="me2">not_found</span><span class="br0">&#40;</span><span class="br0">&#41;</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">end</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="st0">&#8216;POST&#8217;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">case</span> <span class="re0">Path</span> <span class="kw1">of</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; _ -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Req</span>:<span class="me2">not_found</span><span class="br0">&#40;</span><span class="br0">&#41;</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">end</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; _ -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Req</span>:<span class="me2">respond</span><span class="br0">&#40;</span><span class="br0">&#123;</span><span class="nu0">501</span>, <span class="br0">&#91;</span><span class="br0">&#93;</span>, <span class="br0">&#91;</span><span class="br0">&#93;</span><span class="br0">&#125;</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">end</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2">feed<span class="br0">&#40;</span><span class="re0">Response</span>, <span class="re0">Id</span>, <span class="re0">N</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">receive</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#123;</span>router_msg, <span class="re0">Msg</span><span class="br0">&#125;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Html</span> = io_lib:<span class="kw3">format</span><span class="br0">&#40;</span><span class="st0">&quot;Recvd msg #~w: &#8216;~w&#8217;&lt;br/&gt;&quot;</span>, <span class="br0">&#91;</span><span class="re0">N</span>, <span class="re0">Msg</span><span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Response</span>:<span class="me2">write_chunk</span><span class="br0">&#40;</span><span class="re0">Html</span><span class="br0">&#41;</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="kw1">end</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="co1">% Hibernate this process until it receives a message:</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; proc_lib:<span class="me2">hibernate</span><span class="br0">&#40;</span>?<span class="re0">MODULE</span>, feed, <span class="br0">&#91;</span><span class="re0">Response</span>, <span class="re0">Id</span>, <span class="re0">N</span><span class="nu0">+1</span><span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2"><span class="co1">%% Internal API</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">get_option<span class="br0">&#40;</span><span class="re0">Option</span>, <span class="re0">Options</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#123;</span>proplists:<span class="me2">get_value</span><span class="br0">&#40;</span><span class="re0">Option</span>, <span class="re0">Options</span><span class="br0">&#41;</span>, proplists:<span class="me2">delete</span><span class="br0">&#40;</span><span class="re0">Option</span>, <span class="re0">Options</span><span class="br0">&#41;</span><span class="br0">&#125;</span>.</div>
</li>
</ol>
</div>
<p><br/></p>
<p>I made these changes, ran make to rebuild mochiweb, then redid the same c10k test (1000msgs/sec for 24hrs).</p>
<h2>Results after running for 24 hours w/ proc_lib:hibernate()</h2>
<div id="attachment_145" class="wp-caption alignnone" style="width: 510px"><a href="http://www.metabrew.com/wp-content/uploads/2008/10/mochimem5.png"><img src="http://www.metabrew.com/wp-content/uploads/2008/10/mochimem5.png" alt="Memory usage with c10k, 1000msg/sec, 24hrs, using hibernate()" title="mochimem5" width="500" height="300" class="size-full wp-image-145" /></a><p class="wp-caption-text">Memory usage with c10k, 1000msg/sec, 24hrs, using hibernate()</p></div>
<p>Judicious use of <code>hibernate</code> means the mochiweb application memory levels out at 78MB Resident with 10k connections, much better than the 450MB we saw in <a href="http://www.metabrew.com/article/a-million-user-comet-application-with-mochiweb-part-1/">Part 1</a>. There was no significant increase in CPU usage.</p>
<h2>Summary</h2>
<p>We made a comet application on Mochiweb that lets us push arbitrary messages to users identified by an integer ID. After pumping 1000 msgs/sec through it for 24 hours, <b>with 10,000 connected users, we observed it using 80MB, or 8KB per user</b>. We even made pretty graphs. </p>
<p>This is quite an improvement from the 45KB per used we saw in <a href="http://www.metabrew.com/article/a-million-user-comet-application-with-mochiweb-part-1/">Part 1</a>. The savings are attributed to making the application behave in a more realistic way, and use of <code>hibernate</code> for mochiweb processes between messages.</p>
<h2>Next Steps</h2>
<p>In  <a href="http://www.metabrew.com/article/a-million-user-comet-application-with-mochiweb-part-3/">Part 3</a>, I&#8217;ll <b>turn it up to 1 million</b> connected clients. I will be deploying the test app on a multi-cpu 64-bit server with plenty of RAM. This will show what difference, if any, running on a 64-bit VM makes. I&#8217;ll also detail some additional tricks and tuning needed in order to simulate 1 million client connections. </p>
<p>The application will evolve into a sort of pub-sub system, where subscriptions are associated to user Ids and stored by the app, rather than provided by clients when they connect. We&#8217;ll load in a typical social-network dataset: friends. This will allow a user to login with their user Id and automatically receive any event generated by one of their friends. </p>
<p><strong>UPDATED:</strong> <a href="http://www.metabrew.com/article/a-million-user-comet-application-with-mochiweb-part-3/">Part 3</a> is now online.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.metabrew.com/article/a-million-user-comet-application-with-mochiweb-part-2//feed</wfw:commentRss>
		<slash:comments>24</slash:comments>
		</item>
		<item>
		<title>A Million-user Comet Application with Mochiweb, Part 1</title>
		<link>http://www.metabrew.com/article/a-million-user-comet-application-with-mochiweb-part-1/</link>
		<comments>http://www.metabrew.com/article/a-million-user-comet-application-with-mochiweb-part-1/#comments</comments>
		<pubDate>Wed, 15 Oct 2008 17:39:07 +0000</pubDate>
		<dc:creator>RJ</dc:creator>
				<category><![CDATA[programming]]></category>
		<category><![CDATA[comet]]></category>
		<category><![CDATA[erlang]]></category>
		<category><![CDATA[http]]></category>
		<category><![CDATA[kernel]]></category>
		<category><![CDATA[mochiweb]]></category>
		<category><![CDATA[networking]]></category>
		<category><![CDATA[tcp]]></category>

		<guid isPermaLink="false">http://www.metabrew.com/?p=63</guid>
		<description><![CDATA[In this series I will detail what I found out empirically about how mochiweb performs with lots of open connections, and show how to build a comet application using mochiweb, where each mochiweb connection is registered with a router which dispatches messages to various users. We end up with a working application that can cope [...]]]></description>
			<content:encoded><![CDATA[<p>In this series I will detail what I found out empirically about how mochiweb performs with lots of open connections, and show how to build a comet application using mochiweb, where each mochiweb connection is registered with a router which dispatches messages to various users. We end up with a working application that can cope with a million concurrent connections, and crucially, knowing how much RAM we need to make it work. </p>
<p>In part one:</p>
<ul>
<li>Build a basic comet mochiweb app that sends clients a message every 10 seconds.</li>
<li>Tune the Linux kernel to handle lots of TCP connections</li>
<li>Build a flood-testing tool to open lots of connections (ye olde C10k test)</li>
<li>Examine how much memory this requires per connection.</li>
</ul>
<p>Future posts in this series will cover how to build a real message routing system, additional tricks to reduce memory usage, and more testing with 100k and 1m concurrent connections. </p>
<p>I assume you know your way around the Linux command line, and know a bit of Erlang.</p>
<h2>Building a Mochiweb test application</h2>
<p>In brief:</p>
<ol>
<li>Install and build Mochiweb</li>
<li>Run: <code>/your-mochiweb-path/scripts/new_mochiweb.erl mochiconntest</code></li>
<li><code>cd mochiconntest</code> and edit <code>src/mochiconntest_web.erl</code></li>
</ol>
<p>This code (mochiconntest_web.erl) just accepts connections and uses chunked transfer to send an initial welcome message, and one message every 10 seconds to every client.</p>
<div class="dean_ch" style="white-space: wrap;">
<ol>
<li class="li1">
<div class="de1">-<span class="kw2">module</span><span class="br0">&#40;</span>mochiconntest_web<span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">-<span class="kw2">export</span><span class="br0">&#40;</span><span class="br0">&#91;</span>start/<span class="nu0">1</span>, stop/<span class="nu0">0</span>, loop/<span class="nu0">2</span><span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1"><span class="co1">%% External API</span></div>
</li>
<li class="li1">
<div class="de1">start<span class="br0">&#40;</span><span class="re0">Options</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="br0">&#123;</span><span class="re0">DocRoot</span>, <span class="re0">Options1</span><span class="br0">&#125;</span> = get_option<span class="br0">&#40;</span>docroot, <span class="re0">Options</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">Loop</span> = fun <span class="br0">&#40;</span><span class="re0">Req</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;?<span class="re0">MODULE</span>:<span class="me2">loop</span><span class="br0">&#40;</span><span class="re0">Req</span>, <span class="re0">DocRoot</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="kw1">end</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="co1">% we&#8217;ll set our maximum to 1 million connections. (default: 2048)</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; mochiweb_http:<span class="me2">start</span><span class="br0">&#40;</span><span class="br0">&#91;</span><span class="br0">&#123;</span>max, <span class="nu0">1000000</span><span class="br0">&#125;</span>, <span class="br0">&#123;</span>name, ?<span class="re0">MODULE</span><span class="br0">&#125;</span>, <span class="br0">&#123;</span>loop, <span class="re0">Loop</span><span class="br0">&#125;</span> | <span class="re0">Options1</span><span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">stop<span class="br0">&#40;</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="me1">mochiweb_http</span>:<span class="me2">stop</span><span class="br0">&#40;</span>?<span class="re0">MODULE</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2">loop<span class="br0">&#40;</span><span class="re0">Req</span>, <span class="re0">DocRoot</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="st0">&quot;/&quot;</span> ++ <span class="re0">Path</span> = <span class="re0">Req</span>:<span class="me2">get</span><span class="br0">&#40;</span>path<span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">case</span> <span class="re0">Req</span>:<span class="me2">get</span><span class="br0">&#40;</span>method<span class="br0">&#41;</span> <span class="kw1">of</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Method</span> when <span class="re0">Method</span> =:= <span class="st0">&#8216;GET&#8217;</span>; <span class="re0">Method</span> =:= <span class="st0">&#8216;HEAD&#8217;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">case</span> <span class="re0">Path</span> <span class="kw1">of</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="st0">&quot;test/&quot;</span> ++ <span class="re0">Id</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Response</span> = <span class="re0">Req</span>:<span class="me2">ok</span><span class="br0">&#40;</span><span class="br0">&#123;</span><span class="st0">&quot;text/html; charset=utf-8&quot;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span><span class="br0">&#123;</span><span class="st0">&quot;Server&quot;</span>,<span class="st0">&quot;Mochiweb-Test&quot;</span><span class="br0">&#125;</span><span class="br0">&#93;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; chunked<span class="br0">&#125;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Response</span>:<span class="me2">write_chunk</span><span class="br0">&#40;</span><span class="st0">&quot;Mochiconntest welcomes you! Your Id: &quot;</span> ++ <span class="re0">Id</span> ++ <span class="st0">&quot;<span class="es0">\n</span>&quot;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="co1">%% router:login(list_to_atom(Id), self()),</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; feed<span class="br0">&#40;</span><span class="re0">Response</span>, <span class="re0">Id</span>, <span class="nu0">1</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; _ -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Req</span>:<span class="me2">not_found</span><span class="br0">&#40;</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">end</span>;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; <span class="st0">&#8216;POST&#8217;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">case</span> <span class="re0">Path</span> <span class="kw1">of</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; _ -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Req</span>:<span class="me2">not_found</span><span class="br0">&#40;</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">end</span>;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; _ -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Req</span>:<span class="me2">respond</span><span class="br0">&#40;</span><span class="br0">&#123;</span><span class="nu0">501</span>, <span class="br0">&#91;</span><span class="br0">&#93;</span>, <span class="br0">&#91;</span><span class="br0">&#93;</span><span class="br0">&#125;</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">end</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">feed<span class="br0">&#40;</span><span class="re0">Response</span>, <span class="re0">Path</span>, <span class="re0">N</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="kw1">receive</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="co1">%{router_msg, Msg} -&gt;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="co1">% &nbsp; &nbsp;Html = io_lib:format(&quot;Recvd msg #~w: &#8216;~s&#8217;&lt;br/&gt;&quot;, [N, Msg]),</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="co1">% &nbsp; &nbsp;Response:write_chunk(Html);</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">after</span> <span class="nu0">10000</span> -&gt;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Msg</span> = io_lib:<span class="kw3">format</span><span class="br0">&#40;</span><span class="st0">&quot;Chunk ~w for id ~s<span class="es0">\n</span>&quot;</span>, <span class="br0">&#91;</span><span class="re0">N</span>, <span class="re0">Path</span><span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Response</span>:<span class="me2">write_chunk</span><span class="br0">&#40;</span><span class="re0">Msg</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">end</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; feed<span class="br0">&#40;</span><span class="re0">Response</span>, <span class="re0">Path</span>, <span class="re0">N</span><span class="nu0">+1</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li2">
<div class="de2"><span class="co1">%% Internal API</span></div>
</li>
<li class="li1">
<div class="de1">get_option<span class="br0">&#40;</span><span class="re0">Option</span>, <span class="re0">Options</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#123;</span>proplists:<span class="me2">get_value</span><span class="br0">&#40;</span><span class="re0">Option</span>, <span class="re0">Options</span><span class="br0">&#41;</span>, proplists:<span class="me2">delete</span><span class="br0">&#40;</span><span class="re0">Option</span>, <span class="re0">Options</span><span class="br0">&#41;</span><span class="br0">&#125;</span>.</div>
</li>
</ol>
</div>
<p><br/></p>
<h2>Start your mochiweb app</h2>
<p><code>make &#038;&#038; ./start-dev.sh</code><br />
By default mochiweb listens on port 8000, on all interfaces. If you are doing this on the desktop, you can test with any web browser. Just navigate to <a href="http://localhost:8000/test/foo">http://localhost:8000/test/foo</a>.<br/><br />
Here&#8217;s the command-line test:</p>
<pre>$ lynx --source "http://localhost:8000/test/foo"
Mochiconntest welcomes you! Your Id: foo&lt;br/&gt;
Chunk 1 for id foo&lt;br/&gt;
Chunk 2 for id foo&lt;br/&gt;
Chunk 3 for id foo&lt;br/&gt;
^C</pre>
<p>Yep, it works. Now let&#8217;s make it suffer.</p>
<h2>Tuning the Linux Kernel for many tcp connections</h2>
<p>Save yourself some time and tune the kernel tcp settings before testing with lots of connections, or your test will fail and you&#8217;ll see lots of <code>Out of socket memory</code> messages (and if you are masquerading, <code>nf_conntrack: table full, dropping packet.</code>)</p>
<p>Here are the sysctl settings I ended up with &#8211; YMMV, but these will probably do:</p>
<pre># General gigabit tuning:
net.core.rmem_max = 16777216
net.core.wmem_max = 16777216
net.ipv4.tcp_rmem = 4096 87380 16777216
net.ipv4.tcp_wmem = 4096 65536 16777216
net.ipv4.tcp_syncookies = 1
# this gives the kernel more memory for tcp
# which you need with many (100k+) open socket connections
net.ipv4.tcp_mem = 50576   64768   98152
net.core.netdev_max_backlog = 2500
# I was also masquerading the port comet was on, you might not need this
net.ipv4.netfilter.ip_conntrack_max = 1048576</pre>
<p>Put these in <code>/etc/sysctl.conf</code> then run <code>sysctl -p</code> to apply them. No need to reboot, now your kernel should be able to handle a lot more open connections, yay.</p>
<h2>Creating a lot of connections</h2>
<p>There are many ways to do this. <a href="http://tsung.erlang-projects.org/">Tsung</a> is quite sexy, and there and plenty of other less-sexy ways to spam an httpd with lots of requests (ab, httperf, httpload etc). None of them are ideally suited for testing a comet application, and I&#8217;d been looking for an excuse to try the Erlang http client, so I wrote a basic test to make lots of connections.<br />
Just because you can, doesn&#8217;t mean you should.. one process per connection would definitely be a waste here. I&#8217;m using one process to load urls from a file, and another process to establish and receive messages from all http connections (and one process as a timer to print a report every 10 seconds). All data received from the server is discarded, but it does increment a counter so we can keep track of how many HTTP chunks were delivered.<br />
<br/><br />
floodtest.erl</p>
<div class="dean_ch" style="white-space: wrap;">
<ol>
<li class="li1">
<div class="de1">-<span class="kw2">module</span><span class="br0">&#40;</span>floodtest<span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">-<span class="kw2">export</span><span class="br0">&#40;</span><span class="br0">&#91;</span>start/<span class="nu0">2</span>, timer/<span class="nu0">2</span>, recv/<span class="nu0">1</span><span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">start<span class="br0">&#40;</span><span class="re0">Filename</span>, <span class="re0">Wait</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="me1">inets</span>:<span class="me2">start</span><span class="br0">&#40;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; spawn<span class="br0">&#40;</span>?<span class="re0">MODULE</span>, timer, <span class="br0">&#91;</span><span class="nu0">10000</span>, self<span class="br0">&#40;</span><span class="br0">&#41;</span><span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="re0">This</span> = self<span class="br0">&#40;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; spawn<span class="br0">&#40;</span>fun<span class="br0">&#40;</span><span class="br0">&#41;</span>-&gt; <span class="me1">loadurls</span><span class="br0">&#40;</span><span class="re0">Filename</span>, fun<span class="br0">&#40;</span><span class="re0">U</span><span class="br0">&#41;</span>-&gt; <span class="re0">This</span> ! <span class="br0">&#123;</span>loadurl, <span class="re0">U</span><span class="br0">&#125;</span> <span class="kw1">end</span>, <span class="re0">Wait</span><span class="br0">&#41;</span> <span class="kw1">end</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; recv<span class="br0">&#40;</span><span class="br0">&#123;</span><span class="nu0">0</span>,<span class="nu0">0</span>,<span class="nu0">0</span><span class="br0">&#125;</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li2">
<div class="de2">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">recv<span class="br0">&#40;</span><span class="re0">Stats</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#123;</span><span class="re0">Active</span>, <span class="re0">Closed</span>, <span class="re0">Chunks</span><span class="br0">&#125;</span> = <span class="re0">Stats</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">receive</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#123;</span>stats<span class="br0">&#125;</span> -&gt; <span class="me1">io</span>:<span class="kw3">format</span><span class="br0">&#40;</span><span class="st0">&quot;Stats: ~w<span class="es0">\n</span>&quot;</span>,<span class="br0">&#91;</span><span class="re0">Stats</span><span class="br0">&#93;</span><span class="br0">&#41;</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">after</span> <span class="nu0">0</span> -&gt; <span class="me1">noop</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">end</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">receive</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#123;</span>http,<span class="br0">&#123;</span>_<span class="re0">Ref</span>,stream_start,_<span class="re0">X</span><span class="br0">&#125;</span><span class="br0">&#125;</span> -&gt; &nbsp;<span class="me1">recv</span><span class="br0">&#40;</span><span class="br0">&#123;</span><span class="re0">Active</span><span class="nu0">+1</span>,<span class="re0">Closed</span>,<span class="re0">Chunks</span><span class="br0">&#125;</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#123;</span>http,<span class="br0">&#123;</span>_<span class="re0">Ref</span>,stream,_<span class="re0">X</span><span class="br0">&#125;</span><span class="br0">&#125;</span> -&gt; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;<span class="me1">recv</span><span class="br0">&#40;</span><span class="br0">&#123;</span><span class="re0">Active</span>, <span class="re0">Closed</span>, <span class="re0">Chunks</span><span class="nu0">+1</span><span class="br0">&#125;</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#123;</span>http,<span class="br0">&#123;</span>_<span class="re0">Ref</span>,stream_end,_<span class="re0">X</span><span class="br0">&#125;</span><span class="br0">&#125;</span> -&gt; &nbsp;<span class="me1">recv</span><span class="br0">&#40;</span><span class="br0">&#123;</span><span class="re0">Active</span><span class="nu0">-1</span>, <span class="re0">Closed</span><span class="nu0">+1</span>, <span class="re0">Chunks</span><span class="br0">&#125;</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#123;</span>http,<span class="br0">&#123;</span>_<span class="re0">Ref</span>,<span class="br0">&#123;</span>error,<span class="re0">Why</span><span class="br0">&#125;</span><span class="br0">&#125;</span><span class="br0">&#125;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="me1">io</span>:<span class="kw3">format</span><span class="br0">&#40;</span><span class="st0">&quot;Closed: ~w<span class="es0">\n</span>&quot;</span>,<span class="br0">&#91;</span><span class="re0">Why</span><span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; recv<span class="br0">&#40;</span><span class="br0">&#123;</span><span class="re0">Active</span><span class="nu0">-1</span>, <span class="re0">Closed</span><span class="nu0">+1</span>, <span class="re0">Chunks</span><span class="br0">&#125;</span><span class="br0">&#41;</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#123;</span>loadurl, <span class="re0">Url</span><span class="br0">&#125;</span> -&gt;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="me1">http</span>:<span class="me2">request</span><span class="br0">&#40;</span>get, <span class="br0">&#123;</span><span class="re0">Url</span>, <span class="br0">&#91;</span><span class="br0">&#93;</span><span class="br0">&#125;</span>, <span class="br0">&#91;</span><span class="br0">&#93;</span>, <span class="br0">&#91;</span><span class="br0">&#123;</span>sync, false<span class="br0">&#125;</span>, <span class="br0">&#123;</span>stream, self<span class="br0">&#125;</span>, <span class="br0">&#123;</span>version, <span class="nu0">1.1</span><span class="br0">&#125;</span>, <span class="br0">&#123;</span>body_format, binary<span class="br0">&#125;</span><span class="br0">&#93;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; recv<span class="br0">&#40;</span><span class="re0">Stats</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">end</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">timer<span class="br0">&#40;</span><span class="re0">T</span>, <span class="re0">Who</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; <span class="kw1">receive</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">after</span> <span class="re0">T</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Who</span> ! <span class="br0">&#123;</span>stats<span class="br0">&#125;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">end</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; timer<span class="br0">&#40;</span><span class="re0">T</span>, <span class="re0">Who</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li2">
<div class="de2">&nbsp;</div>
</li>
<li class="li1">
<div class="de1"><span class="co1">% Read lines from a file with a specified delay between lines:</span></div>
</li>
<li class="li1">
<div class="de1">for_each_line_in_file<span class="br0">&#40;</span><span class="re0">Name</span>, <span class="re0">Proc</span>, <span class="re0">Mode</span>, <span class="re0">Accum0</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="br0">&#123;</span>ok, <span class="re0">Device</span><span class="br0">&#125;</span> = file:<span class="me2">open</span><span class="br0">&#40;</span><span class="re0">Name</span>, <span class="re0">Mode</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; for_each_line<span class="br0">&#40;</span><span class="re0">Device</span>, <span class="re0">Proc</span>, <span class="re0">Accum0</span><span class="br0">&#41;</span>.</div>
</li>
<li class="li2">
<div class="de2">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">for_each_line<span class="br0">&#40;</span><span class="re0">Device</span>, <span class="re0">Proc</span>, <span class="re0">Accum</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">case</span> io:<span class="me2">get_line</span><span class="br0">&#40;</span><span class="re0">Device</span>, <span class="st0">&quot;&quot;</span><span class="br0">&#41;</span> <span class="kw1">of</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; eof &nbsp;-&gt; <span class="me1">file</span>:<span class="kw3">close</span><span class="br0">&#40;</span><span class="re0">Device</span><span class="br0">&#41;</span>, <span class="re0">Accum</span>;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Line</span> -&gt; <span class="re0">NewAccum</span> = <span class="re0">Proc</span><span class="br0">&#40;</span><span class="re0">Line</span>, <span class="re0">Accum</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; for_each_line<span class="br0">&#40;</span><span class="re0">Device</span>, <span class="re0">Proc</span>, <span class="re0">NewAccum</span><span class="br0">&#41;</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="kw1">end</span>.</div>
</li>
<li class="li1">
<div class="de1">&nbsp;</div>
</li>
<li class="li1">
<div class="de1">loadurls<span class="br0">&#40;</span><span class="re0">Filename</span>, <span class="re0">Callback</span>, <span class="re0">Wait</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; <span class="me1">for_each_line_in_file</span><span class="br0">&#40;</span><span class="re0">Filename</span>,</div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; fun<span class="br0">&#40;</span><span class="re0">Line</span>, <span class="re0">List</span><span class="br0">&#41;</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">Callback</span><span class="br0">&#40;</span>string:<span class="me2">strip</span><span class="br0">&#40;</span><span class="re0">Line</span>, right, $\n<span class="br0">&#41;</span><span class="br0">&#41;</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">receive</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">after</span> <span class="re0">Wait</span> -&gt;</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="me1">noop</span></div>
</li>
<li class="li2">
<div class="de2">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">end</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; <span class="re0">List</span></div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="kw1">end</span>,</div>
</li>
<li class="li1">
<div class="de1">&nbsp; &nbsp; &nbsp; &nbsp; <span class="br0">&#91;</span>read<span class="br0">&#93;</span>, <span class="br0">&#91;</span><span class="br0">&#93;</span><span class="br0">&#41;</span>.</div>
</li>
</ol>
</div>
<p><br/><br />
Each connection we make requires an ephemeral port, and thus a file descriptor, and by default this is limited to 1024. To avoid the <code>Too many open files</code> problem you&#8217;ll need to modify the ulimit for your shell. This can be changed in <code>/etc/security/limits.conf</code>, but requires a logout/login. For now you can just sudo and modify the current shell (su back to your non-priv&#8217;ed user after calling ulimit if you don&#8217;t want to run as root):</p>
<pre>$ sudo bash
# ulimit -n 999999
# erl</pre>
<p>You might as well increase the ephemeral port range to the maximum too:<br />
<code># echo "1024    65535" &gt; /proc/sys/net/ipv4/ip_local_port_range</code></p>
<p>Generate a file of URLs to feed to the floodtest program:<br />
<code>( for i in `seq 1 10000`; do echo "http://localhost:8000/test/$i" ; done ) &gt; /tmp/mochi-urls.txt </code></p>
<p>From the erlang prompt you can now compile and launch <code>floodtest.erl</code>:<br />
<code>erl&gt; c(floodtest).<br />
erl&gt; floodtest:start("/tmp/mochi-urls.txt", 100).</code></p>
<p>This will establish 10 new connections per second (ie, 1 connection every 100ms).</p>
<p>It will output stats in the form <code>{Active, Closed, Chunks}</code> where Active is the number of connections currently established, Closed is the number that were terminated for some reason, and Chunks is the number of chunks served by chunked transfer from mochiweb. Closed should stay on 0, and Chunks should be more than Active, because each active connection will receive multiple chunks (1 every 10 seconds).<br />
<br/><br />
 <b>The resident size of the mochiweb beam process with 10,000 active connections was 450MB &#8211; that&#8217;s 45KB per connection</b>. CPU utilization on the machine was practically nothing, as expected.<br />
<br/></p>
<h2>Assessment so far</h2>
<p>That was a reasonable first attempt. 45KB per-connection seems a bit high &#8211; I could probably cook something up in C using libevent that could do this with closer to 4.5KB per connection (just a guess, if anyone has experience please leave a comment). If you factor in the amount of code and time it took to do this in Erlang compared with C, I think the increased memory usage is more excusable.<br />
<br/><br />
In future posts I&#8217;ll cover building a message router (so we can uncomment lines 25 and 41-43 in <code>mochiconntest_web.erl</code>) and talk about some ways to reduce the overall memory usage. I&#8217;ll also share the results of testing with 100k and 1M connections.</p>
<p><b>UPDATED:</b> <a href="http://www.metabrew.com/article/a-million-user-comet-application-with-mochiweb-part-2/">Part 2</a> and <a href="http://www.metabrew.com/article/a-million-user-comet-application-with-mochiweb-part-3/">Part 3</a> are online now.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.metabrew.com/article/a-million-user-comet-application-with-mochiweb-part-1//feed</wfw:commentRss>
		<slash:comments>46</slash:comments>
		</item>
		<item>
		<title>On bulk loading data into Mnesia</title>
		<link>http://www.metabrew.com/article/on-bulk-loading-data-into-mnesia/</link>
		<comments>http://www.metabrew.com/article/on-bulk-loading-data-into-mnesia/#comments</comments>
		<pubDate>Mon, 13 Oct 2008 20:08:49 +0000</pubDate>
		<dc:creator>RJ</dc:creator>
				<category><![CDATA[hacks]]></category>
		<category><![CDATA[programming]]></category>
		<category><![CDATA[erlang]]></category>
		<category><![CDATA[mnesia]]></category>

		<guid isPermaLink="false">http://www.metabrew.com/?p=82</guid>
		<description><![CDATA[Consider this a work-in-progress; I will update this post if I find a &#8216;better&#8217; way to do fast bulk loading The time has come to replace my ets-based storage backend with something non-volatile. I considered a dets/ets hybrid, but I really need this to be replicated to at least a second node for HA / [...]]]></description>
			<content:encoded><![CDATA[<p><i>Consider this a work-in-progress; I will update this post if I find a &#8216;better&#8217; way to do fast bulk loading</i></p>
<p>The time has come to replace my ets-based storage backend with something non-volatile. I considered a dets/ets hybrid, but I really need this to be replicated to at least a second node for HA / failover. Mnesia beckoned. </p>
<p>The problem:</p>
<ul>
<li>15 million [fairly simple] records</li>
<li>1 Mnesia table: bag, disc_copies, just 1 node, 1 additional index</li>
<li>Hardware is a quad-core 2GHz CPU, 16GB Ram, 8x 74Gig 15k rpm scsi disks in RAID-6</li>
<li>Takes ages* to load and spews a load of &#8220;Mnesia is overloaded&#8221; warnings</li>
</ul>
<p><b>* My definition of &#8216;takes ages&#8217;:</b> Much longer than PostgreSQL <code>\copy</code> or MySQL <code>LOAD DATA INFILE</code></p>
<p>At this point all I want is a quick way to bulk-load some data into a disc_copies table on a single node, so I can get on with running some tests.</p>
<p>Here is the table creation code:<br />
<code>    mnesia:create_table(subscription,<br />
            [<br />
                {disc_copies, [node()]},<br />
                {attributes, record_info(fields, subscription)},<br />
                {index, [subscribee]}, %index subscribee too<br />
                {type, bag}<br />
            ]<br />
            )</code><br />
The <code>subscription</code> record is fairly simple:<br />
<code>{subscription, subscriber={resource, user, 123}, subscribee={resource, artist, 456}}</code></p>
<p>I&#8217;m starting erlang like so:<br />
<code>erl +A 128 -mnesia dir '"/home/erlang/mnesia_dir"' -boot start_sasl</code></p>
<p>The interesting thing there is really the <code>+A 128</code> &#8211; this spreads the cpu load better between the 4 cores.</p>
<h3>Attempt 0) &#8216;by the book&#8217; one transaction to rule them all</h3>
<p>Something like this:<br />
<code>mnesia:transaction(fun()-> [ mnesia:write(S) || S <- Subs ] end)</code></p>
<p>Time taken: <b>Too long, I gave up after 12 hours</b><br />
Number of &#8220;Mnesia overloaded&#8221; warnings: <b>lots</b><br />
Conclusion: <b>Must be a better way</b><br />
TODO: actually run this test and time it.</p>
<h3>Attempt 1) dirty_write</h3>
<p>There isn&#8217;t really any need to do this in a transaction, so I tried dirty_write.<br />
<code>[ mnesia:dirty_write(S) || S <- Subs ]</code></p>
<p>And here&#8217;s the warning in full:<br />
<code>=ERROR REPORT==== 13-Oct-2008::16:53:57 ===<br />
Mnesia('mynode@myhost'): ** WARNING ** Mnesia is overloaded: {dump_log,<br />
                                                                       write_threshold}</code></p>
<p>Time taken: <b>890 secs</b><br />
Number of &#8220;Mnesia overloaded&#8221; warnings: <b>lots</b><br />
Conclusion: <b>Workable, but nothing to boast about. Those warnings are annoying</b></p>
<h3>Attempt 2) dirty_write, defer index creation</h3>
<p>A common trick with traditional RDBMS would be to bulk load the data into the table and add the indexes afterwards. In some scenarios you can avoid costly incremental index update operations. If you are doing this in one gigantic transaction it shouldn&#8217;t matter, and I&#8217;m not really sure how mnesia works under the hood (something I plan to rectify if I end up using it for real).<br />
I tried a similar approach by commenting out the <code>{index, [subscribee]}</code> line above, doing the load, then using <code>mnesia:add_table_index(subscriber, subscribee)</code> afterwards to add the index once all the data was loaded. Note that mnesia was still building the primary index on the fly, but that can&#8217;t be helped.<br />
Time taken: <b>883 secs</b> (679s load + 204s index creation)<br />
Number of &#8220;Mnesia overloaded&#8221; warnings: <b>lots</b><br />
Conclusion: <b>Insignificant, meh</b></p>
<h3>Attempt 3) mnesia:ets() trickery</h3>
<p>This is slightly perverted, but I tried it because I was suspicious that incrementally updating the on-disk data wasn&#8217;t especially optimal. The idea is to make a ram_only table and use the mnesia:ets() function to write directly to the ets table (doesn&#8217;t get much faster than ets). The table can then be converted to disc_copies. There are caveats &#8211; to quote The Fine Manual:</p>
<blockquote><p>Call the Fun in a raw context which is not protected by a transaction. The Mnesia function call is performed in the Fun are performed directly on the local ets tables on the assumption that the local storage type is ram_copies and the tables are not replicated to other nodes. Subscriptions are not triggered and checkpoints are not updated, but it is extremely fast.</p></blockquote>
<p>I can live with that. I don&#8217;t mind if replication takes a while to setup when I put this into production &#8211; I&#8217;ll gladly take any optimisations I can get at this stage (testing/development).</p>
<p>Loading a list of <code>subscriptions</code> looks like this:<br />
<code>mnesia:ets(fun()-> [mnesia:dirty_write(S) || S <- Subs] end).</code><br />
And to convert this into disc_copies once data is loaded in:<br />
<code>mnesia:change_table_copy_type(subscription, node(), disc_copies).</code></p>
<p>Time taken: <b>745 secs</b> (699s load + 46s convert to disc_copies)<br />
Number of &#8220;Mnesia overloaded&#8221; warnings: <b>none!</b><br />
Conclusion: <b>Fastest yet, bit hacky</b></p>
<h2>Summary</h2>
<p>At least the ets() trick doesn&#8217;t spew a million warnings. I also need to examine the output of <code>mnesia:dump_to_textfile</code> and see if loading data from that format is any faster.</p>
<p>TODO:</p>
<ul>
<li>Examine / test using the dum_to_textfile method</li>
<li>Run full transactional load and time it</li>
<li>Try similar thing with PostgreSQL</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.metabrew.com/article/on-bulk-loading-data-into-mnesia//feed</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
	</channel>
</rss>
