<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>nick@kavassalis.com&#039;s blog &#187; internet</title>
	<atom:link href="http://kavassalis.com/tag/internet/feed/" rel="self" type="application/rss+xml" />
	<link>http://kavassalis.com</link>
	<description>My rantings about code, carriers, cars, and cameras</description>
	<lastBuildDate>Thu, 09 Sep 2010 23:21:55 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>Of Nick and domain spammers</title>
		<link>http://kavassalis.com/2010/05/of-nick-and-domain-spammers/</link>
		<comments>http://kavassalis.com/2010/05/of-nick-and-domain-spammers/#comments</comments>
		<pubDate>Fri, 28 May 2010 13:30:59 +0000</pubDate>
		<dc:creator>nick</dc:creator>
				<category><![CDATA[internet]]></category>
		<category><![CDATA[email]]></category>
		<category><![CDATA[hosting]]></category>
		<category><![CDATA[marketing]]></category>
		<category><![CDATA[social media]]></category>
		<category><![CDATA[spam]]></category>

		<guid isPermaLink="false">http://kavassalis.com/?p=103</guid>
		<description><![CDATA[As far as I&#8217;m concerned, there is nothing worse than those who crawl through domain whois records for the purpose of emailing me. I get a lot of it. Funny thing is, it&#8217;s strictly forbidden in ICANN&#8217;s (the sole registrar accreditation body) rules. ICANN&#8217;s Registrar Accreditation Agreement section 3.3.6.3 states: &#8220;Registrar&#8217;s access agreement shall require the [...]]]></description>
			<content:encoded><![CDATA[<p>As far as I&#8217;m concerned, there is nothing worse than those who crawl through domain whois records for the purpose of emailing me. I get a lot of it. Funny thing is, it&#8217;s strictly forbidden in ICANN&#8217;s (the sole registrar accreditation body) rules. ICANN&#8217;s <a href="http://www.icann.org/en/registrars/ra-agreement-21may09-en.htm#3" target="_blank">Registrar Accreditation Agreement section 3.3.6.3</a> states:</p>
<blockquote><p>&#8220;Registrar&#8217;s access agreement shall require the third party to agree not to use the data to allow, enable, or otherwise support any marketing activities, regardless of the medium used. Such media include but are not limited to e-mail, telephone, facsimile, postal mail, SMS, and wireless alerts.&#8221;</p></blockquote>
<p><a href="http://www.icann.org/en/gnso/whois-tf/report-19feb03.htm#II" target="_blank">In fact this has been on the books since 2003.</a> Yet on a daily basis I receive whois-crawled spam. Now yes, you can definitely whois any domain and readily spam the owner. But these people are not just spamming one or two domains, they&#8217;re spamming thousands, if not tens of thousands at a time. (As with all spam, you&#8217;ve got to fish a lot to catch anything). Access to this requires what ICANN calls &#8216;bulk whois&#8217;, which pretty much any registrar with an API will provide you. However it is up to them to prevent people from using this as a means to spam people. While I realize it is not trivial to track the spam back to the registrar allowing the mass harvest, it&#8217;s not like there is an infinite number of registrars. Tracking down the people providing this information to spammers would not be impossible. Considering ICANN is pretty much useless for everything else (it took them how long to stop turning a blind eye to domain tasting? oh right, 10 years), they could at least enforce this policy, track down offending registrars and remove their accreditation. (I kid, ICANN will never do this, their rules are pretty much toothless, and this article is really just to ridicule spammers who take themselves seriously)</p>
<p>Now, whois-crawled spam is a bit different than your usual spam. No viagra, OEM software or luxury watches. No, whois spammers usual take themselves a lot more seriously, which makes it all the funnier because they&#8217;re just as pathetic as the guy selling Chinese V1ag4ra. Let&#8217;s look at a few of  the types of assholes who take part in this practice. (and some of my favourite examples)</p>
<p><em><strong>Web hosts offering quality web hosting at low low prices:<br />
</strong></em><a href="http://kavassalis.com/wp-content/uploads/2010/05/hosting-spam.jpg" target="_blank"><img class="alignnone size-medium wp-image-105" title="hosting-spam" src="http://kavassalis.com/wp-content/uploads/2010/05/hosting-spam-300x167.jpg" alt="" width="300" height="167" /><br />
</a>(Click to expand)</p>
<p>The idea behind this type of domain spammer is pretty simple. You own a domain name, everyone who owns a domain name has some sort of hosting for it, everyone likes cheaper hosting. Amusingly sales@ and abuse@ evul.net get *tons* of it, which is usually very poorly targeted (and ironic when it goes to abuse@). However, the spam pictured above was actually <em>well targeted</em>, because its offering local large scale hosting, the kind that evul.net might want. However there in lies the problem, why as a web site owner (or a web host like evul.net) would you ever want hosting from a <strong>*spammer*</strong>. It immediately calls into question the ethics, let alone the quality of such a provider. They all end up getting spamcop&#8217;d and in the case of this one, I wrote to the idiots to personally express my disgust, they are locally known and slimy. Hopefully enough spamcop reports will get these wannabe providers upstreams to slap them, as they&#8217;re almost always sent from North America.</p>
<p><em><strong>SEO/marketing experts asking for links:<br />
</strong></em><a href="http://kavassalis.com/wp-content/uploads/2010/05/SEO-spam.jpg" target="_blank"><img class="alignnone size-medium wp-image-104" title="SEO-spam" src="http://kavassalis.com/wp-content/uploads/2010/05/SEO-spam-300x90.jpg" alt="" width="300" height="90" /></a><br />
(Click to expand)</p>
<p>This one gets me. It&#8217;s pretty brainless. We get a ton of these at work for any blog that we have our email address on the whois for. The idea is that if you ask nicely enough for a link (they *always* come from female, likely fake, names) someone will give you that link, and your google pagerank will be increased! In reality you are a spammer. It must work enough of the time that it makes it worth while for these people to do it. I try and spamcop these guys, but it&#8217;s like pissing in the ocean: the email sources and spamvertised sites are almost always in South America, Asia or eastern Europe. (This particular one came from some ISP in Argentina) I have a feeling they are usually fronts for something else, (get a pseudo legit page pageranked up, use it to then push other pages up) as no legit page bulk emails out begging for links.</p>
<p><strong><em>People selling ads/ad network services:<br />
</em></strong><a href="http://kavassalis.com/wp-content/uploads/2010/05/ad-spam.jpg" target="_blank"><img class="alignnone size-medium wp-image-110" title="ad-spam" src="http://kavassalis.com/wp-content/uploads/2010/05/ad-spam-300x84.jpg" alt="" width="300" height="84" /></a><br />
(Click to expand)</p>
<p>I save the best for last. These are the most ironic and we get them from time to time at work. People spam our own sites essentially offering their own advertising services. It&#8217;s even better (ironic), like in the above example, when they are spamming an obvious campaign site, showing there is likely no human intervention in the spam. They&#8217;re just looking for well pageranked sites and spamming the owners. This again begs the question, who actually receives one of these and takes them up on their most reputable (lol) offers. Sadly some people must to make it worth while. Since these are all pseudo legit (wannabe) marketing companies, they&#8217;re almost always in the US, so spamcop for great justice.</p>
<p>We received a good one from a company selling a Twitter trending solution recently, again well targeted against an actual twitter-based campaign site, but it begs the question of how new are these people to the internet. How is it, that in 2010 there are people out there who still believe their business has any legitimacy as soon as they send out unsolicited-bulk email to the same people who have been fighting such email for nearly *20 years*. In short, stop buying services/products you see in spam, have sweet dreams of useless ICANN actually enforcing the bulk-whois-marketing rule instead of ignoring registrars who allow it willy-nilly, and <a href="http://spamcop.net" target="_blank">spamcop</a> everything for great justice.</p>
<p><img class="alignnone size-full wp-image-112" title="SpamCop.net - Welcome registered user" src="http://kavassalis.com/wp-content/uploads/2010/05/SpamCop.net-Welcome-registered-user.jpg" alt="" width="348" height="46" /></p>
<p>And to &#8220;Data Centers Canada Inc.&#8221;, &#8220;Comodus&#8221; and &#8220;Linkstar&#8221;, congrats you are spammers, any hopes you had of every being taken seriously as legit businesses went down the drain when you sent out unsolicited bulk email to domain owners, no matter how well targeted it was.</p>
]]></content:encoded>
			<wfw:commentRss>http://kavassalis.com/2010/05/of-nick-and-domain-spammers/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Of Nick and Google, again&#8230;</title>
		<link>http://kavassalis.com/2010/03/of-nick-and-google-again/</link>
		<comments>http://kavassalis.com/2010/03/of-nick-and-google-again/#comments</comments>
		<pubDate>Fri, 05 Mar 2010 19:59:09 +0000</pubDate>
		<dc:creator>nick</dc:creator>
				<category><![CDATA[internet]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[hosting]]></category>

		<guid isPermaLink="false">http://kavassalis.com/?p=77</guid>
		<description><![CDATA[Ugh! I totally forgot something important in my article about Google this morning. Though thats okay because it was a pretty long rant already, however at this point I am posting an abnormal number of times in a single day&#8230; Some Italian high school students uploaded a video of an autistic classmate to Google video. [...]]]></description>
			<content:encoded><![CDATA[<p>Ugh! I totally forgot something important in my <a href="http://kavassalis.com/2010/03/of-nick-and-google/" target="_blank">article about Google this morning</a>. Though thats okay because it was a pretty long rant already, however at this point I am posting an abnormal number of times in a single day&#8230;</p>
<p>Some Italian high school students uploaded a video of an autistic classmate to Google video. Let me start by saying this is terrible and cruel, and I cannot even imagine how cruel bullying must be in a post internet world. Google complied with the Italian law enforcement and handed over the details of the users who uploaded it. This is correct practice. If someone breaks the law, even if its over the internet, they are prosecutable, and content hosts are responsible to comply with law enforcement. I have done this many times and complied with Canadian law enforcement on some very interesting cases.</p>
<p>Apparently though this is not enough in Italy. Four Google Italy employees, including one who had left the company in 2008 were arrested, of which 3 were convicted of violating the videoed boy&#8217;s privacy rights. The courts demand that Google should have vetted the privacy of the content that was uploaded before sharing it with the world. <strong>REALLY?</strong> I mean are they serious? I&#8217;m sure they must realize how many videos are uploaded to Google/Youtube/etc every minute of every day? And to check to make sure EVERYONE in the video has signed a release as to their privacy? Heck its not feasible at a few videos per day, let alone <strong>the 20 hours of video uploaded to Youtube alone every minute of every day</strong>. This would be stupid coming from an armchair politician bitching on an internet forum, let alone the policy makers of an entire country. Seriously this worries me about the sanity of the law makers and future of the country.</p>
<p>It doesn&#8217;t stop at video, it could be pictures uploaded to Facebook, of which <strong>there are over a hundred million PER DAY. </strong>Should Facebook <strong>check EVERY photo to make sure nobodies privacy is being encroached</strong> upon? Get a clue!</p>
<p>My other complaint is why does every local government think they should be able to police the internet? I think Google should just pull its local offices out of Italy and then just let Italy decide if they want to block Google or not. Let all these insane countries play internet nanny for their citizens, maybe they can get a bulk deal on Cisco gear along with China and Australia&#8230; Hey Italy thanks for Ferrari but no thanks for your draconian attempts at internet policy!</p>
<p><a href="http://googleblog.blogspot.com/2010/02/serious-threat-to-web-in-italy.html" target="_blank">Read Google&#8217;s own blog post on the matter&#8230;</a></p>
]]></content:encoded>
			<wfw:commentRss>http://kavassalis.com/2010/03/of-nick-and-google-again/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Of Nick and hosting geo-diversity&#8230;</title>
		<link>http://kavassalis.com/2010/02/of-nick-and-hosting-geo-diversity/</link>
		<comments>http://kavassalis.com/2010/02/of-nick-and-hosting-geo-diversity/#comments</comments>
		<pubDate>Fri, 26 Feb 2010 15:55:27 +0000</pubDate>
		<dc:creator>nick</dc:creator>
				<category><![CDATA[Code]]></category>
		<category><![CDATA[C]]></category>
		<category><![CDATA[hosting]]></category>
		<category><![CDATA[internet]]></category>
		<category><![CDATA[Linux]]></category>
		<category><![CDATA[telecom]]></category>

		<guid isPermaLink="false">http://kavassalis.org/?p=29</guid>
		<description><![CDATA[If you look at the biggest websites and internet applications, you can pretty much divide them into two groups. Those that are geographically diverse and those that aren&#8217;t. It&#8217;s kinda shocking that in 2010, the majority of major internet properties still are located in a single (large) datacenter. Though to be fair there is a [...]]]></description>
			<content:encoded><![CDATA[<p>If you look at the biggest websites and internet applications, you can pretty much divide them into two groups. Those that are geographically diverse and those that aren&#8217;t. It&#8217;s kinda shocking that in 2010, the majority of major internet properties still are located in a single (large) datacenter. Though to be fair there is a good reason for that, geo-diversity has many challenges. Problems like directing traffic to the fastest/closest/cheapest/most available location are pretty easy to solve: most people go with BGP AnyCast, targeted DNS responses, or a combination of both. The real challenge though is making sure your actual served content is coherent among all the locations. It would be terrible for a user to upload a photo, sent the URL to their friends, only for the friends to see nothing or worse, the wrong image.</p>
<p>For static content, this is easy, even RSYNCs will be scalable to push out changes to your content amongst your farm. User uploaded content is quite a bit trickier. Within a single datacenter you can efficiently (though not always affordably) solve this using shared storage, iSCSI or NFS. Then applications pretty much can work as if they&#8217;re on a single server, session management can be tackled by using cookie or host persistence on the load balancers to make sure a user stays on the same server. What about servers in different locations though? NFS and iSCSI will not be terribly effective over transit.</p>
<p>You will have to push content between your locations then. If you are trying to geographically distribute your own application, you would just write functionality in to immediately push any user uploaded content out to other locations as its created.  Google/Youtube are great examples of this. When you hit content they&#8217;ve hosted, it isn&#8217;t even hosted on every server, and they direct you to the closest server that has said content. If that content isn&#8217;t available locally to you yet, or at all, they can stream it over their own fiber backhaul and out your closest Google POP.</p>
<p>But what if you are hosting a variety of 3rd party software. To my knowledge none of the popular blog packages, forum software, etc has any sort of geo-diversity designed into them. You could of course fork them and write your own, but then you end up supporting N different software packages for your N clients, not affordable or reasonable.  Rsync would do this task but unfortunately it is very intensive and doesn&#8217;t scale particularly well because it md5&#8242;s your entire tree constantly to see if things changed. As your content scales, the rsyncs would get slower and slower just seeing if changes occurred, eventually leading to massive delays on syncing out user created content.</p>
<p>In the end, its a cool problem, a problem that not too many people have tackled so far. We came up with our own solution, which I unfortunately probably shouldn&#8217;t disclose. I wrote the basis of the software last month, though it still needs some bug fixes, testing and more modules to be written for it. It is a difficult problem to tackle, but having worked in telecommunications, no facility is bullet proof, no power is bullet proof, no connectivity is bullet proof, no hardware is bullet proof: geo-diversity is a must going forward in this highly demanding world where everyone expects connectivity and content 24/7</p>
]]></content:encoded>
			<wfw:commentRss>http://kavassalis.com/2010/02/of-nick-and-hosting-geo-diversity/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
