Dark Light
Table of Contents Hide
  1. Platforms
  2. Agents
  3. Unrecogniseds
  4. Related

The problem with Open Anything is personalities, or the lack of them.

Syndication has a number of personalities, ranging from the “Do as I say” Dave Winer to the Aaron Swartz’s namespace-polluting stance. (Apparently my RSS is “funky”, because I actually /use/ the extension specification that Dave specified in RSS 2.0. I am patently an evil, evil man.

On the other hand, Mozilla appears to have the problem of a great deal of small gods, and no large ones. Firebird is doing well, partly – IMHO – because it has a person who sits at the top and manages it.

My single biggest pet hate about Mozilla is that it doesn’t render ­ properly. Nice problem to have, you might think, but it’s not the fact of non-support that annoys me, it’s the fact that the non-support of ­ is a bug that has been known about since “this time _four years_ ago”:http://bugzilla.mozilla.org/show_bug.cgi?id=9101 and yet has not been fixed purely because they are arguing about how it should – or shouldn’t – be displayed.

Meanwhile, the browser lacks this fairly important feature for in excess of four years (Because I really doubt it’s going to be fixed in the next 13 days), when all that needs specifing is a policy decision on whether the thing should be rendered or not.

Currently, though, the top two vendors appear to have decided not to fix bugs like this, which is annoying. Breakdown of agents over the last month for Aquarionics is interesting, though. Apparently 1% of my visitors are using NS4. This is good, because it means I can hate them indervidually.

Anyway, stats:

Platforms

Platforms are groups of Agents. Anything I don’t recognise is “Other”, anything I can’t find out any details about (is it a browser? a robot?) are “Unknown”

  • Internet Explorer – 103916 – 48.5%
  • Robots – 47209 – 22.03%
  • Mozilla – 40513 – 18.91%
  • Other – 6484 – 3.03%
  • Opera – 4634 – 2.16%
  • Netscape – 3854 – 1.8%
  • RSS Readers – 3057 – 1.43%
  • KHTML – 2459 – 1.15%
  • Wget – 1252 – 0.58%
  • Unknown – 897 – 0.42%

Agents

These are things that I could positivly ID via a series of regexes

  • IE 6 – 67026 – 31.28%
  • IE <5 – 36890 – 17.22%
  • Firebird – 16868 – 7.87%
  • Mozilla – 16848 – 7.86%
  • Inktomi – 9271 – 4.33%
  • WebCrawler – 8838 – 4.12%
  • Google – 7804 – 3.64%
  • Scooter – 6756 – 3.15%
  • Other – 6484 – 3.03%
  • Mozilla 1.4b – 3703 – 1.73%
  • Ask Jeeves – 3600 – 1.68%
  • Grub – 3159 – 1.47%
  • Phoenix – 3094 – 1.44%
  • LARBIN – 3023 – 1.41%
  • Opera 7 – 2305 – 1.08%
  • Opera <7 – 2279 – 1.06%
  • Netscape <4 – 2196 – 1.02%
  • KHTML – 1726 – 0.81%
  • Netscape 4 – 1658 – 0.77%
  • Archive.org – 1284 – 0.6%
  • wget – 1252 – 0.58%
  • PHP – 1226 – 0.57%
  • Python – 966 – 0.45%
  • WeblogMonitor – 772 – 0.36%
  • QuepasaCreep – 765 – 0.36%
  • Konqueror – 733 – 0.34%
  • SharpReader – 708 – 0.33%
  • NPBot – 598 – 0.28%
  • KNewsTicker – 500 – 0.23%
  • Zao – 386 – 0.18%
  • linkhype.com – 298 – 0.14%
  • NNTP://RSS – 257 – 0.12%
  • Radio Userland – 227 – 0.11%
  • FeedOnFeeds – 178 – 0.08%
  • Syndic8 – 167 – 0.08%
  • PostNuke – 134 – 0.06%
  • Mail Sweeper – 132 – 0.06%
  • rssSearch Harvester – 114 – 0.05%
  • Opera pretending – 50 – 0.02%
  • SiteCheck – 0 – 0%

Unrecogniseds

Anything that didn’t match a regex is here.

  • – – 1972 – 0.92%
  • libwww-perl/5.63 – 895 – 0.42%
  • NutchOrg/0.03-dev (Nutch; http://www.nutch.org/docs/bot.html; nutch-agent@lists.sourceforge.net) – 228 – 0.11%
  • http://www.almaden.ibm.com/cs/crawler [wf84] – 186 – 0.09%
  • sitecheck.internetseer.com (For more info see: http://sitecheck.internetseer.com) – 171 – 0.08%
  • RPT-HTTPClient/0.3-3 – 118 – 0.06%
  • linko/0.1 libwww-perl/5.65 – 102 – 0.05%
  • synerge asyncrawl/0.5 – 100 – 0.05%
  • dloader(NaverRobot)/1.0 – 99 – 0.05%
  • NetNewsWire/1.0.2 (Mac OS X; http://ranchero.com/netnewswire/) – 75 – 0.04%
  • HTTP agent – 74 – 0.03%
  • AmphetaDesk/0.93.1 (linux; http://www.disobey.com/amphetadesk/) – 72 – 0.03%
  • Java/1.4.1_02 – 71 – 0.03%
  • Java1.4.0_03 – 70 – 0.03%
  • Java/1.4.1 – 66 – 0.03%
  • htdig/3.1.5 (root@localhost) – 59 – 0.03%
  • lwp-trivial/1.35 – 55 – 0.03%
  • PolyBot 1.0 (http://cis.poly.edu/polybot/) – 53 – 0.02%
  • Java1.4.0_01 – 52 – 0.02%
  • TurnitinBot/1.5 http://www.turnitin.com/robot/crawlerinfo.html – 50 – 0.02%
  • Zeus 2.6 – 46 – 0.02%
  • Aggie 1.0 Release Candidate 5 – http://bitworking.org (Microsoft Windows 98 4.90.73010104.0; .NET CLR 1.0.3705.0) http://bitworking.org/AggieReferrers.html – 44 – 0.02%
  • User-Agent: NG/1.0 – 41 – 0.02%
  • MSProxy/2.0 – 38 – 0.02%
  • appie 1.1 (www.walhello.com) – 37 – 0.02%
  • rdflib-1.3.0 (http://rdflib.net/; eikeon@eikeon.com) – 36 – 0.02%
  • contype – 35 – 0.02%
  • Microsoft URL Control – 6.00.8862 – 34 – 0.02%
  • Feedster Harvester/1.0; Feedster, LLC. – 33 – 0.02%
  • vspider – 31 – 0.01%
  • NetNewsWire/1.0.3 (Mac OS X; Lite; http://ranchero.com/netnewswire/) – 30 – 0.01%
  • Lynx/2.8.4dev.16 libwww-FM/2.14 SSL-MM/1.4.1 OpenSSL/0.9.6 – 29 – 0.01%
  • Popdexter/1.0 (http://www.popdex.com/) – 28 – 0.01%
  • WebCopier v3.4 – 27 – 0.01%
  • Mozilla/5.0 – 26 – 0.01%
  • Privoxy/3.0 (Anonymous) – 25 – 0.01%
  • EbiNess 0.1a – 22 – 0.01%
  • Jigsaw/2.2.0 W3C_CSS_Validator_JFouffa/2.0 – 20 – 0.01%
  • Frontier/9.0 (WinNT) – 19 – 0.01%
  • junkbuster – 18 – 0.01%
  • psbot/0.1 (+http://www.picsearch.com/bot.html) – 17 – 0.01%
  • MovableType/2.62 – 16 – 0.01%
  • MovableType/2.64 – 15 – 0.01%
  • Feedster Harvester/1.0; FS Consulting, Inc. – 14 – 0.01%
  • WebGather 3.0 – 13 – 0.01%
  • RealPlayer G2 – 12 – 0.01%
  • Under the Rainbow 2.2 – 11 – 0.01%
  • HenryTheMiragoRobot – 10 – 0%
  • unknown/1.0 – 9 – 0%
  • parabot paracite@ecs.soton.ac.uk – 8 – 0%
  • oBot – 7 – 0%
  • Links (2.1pre9; Linux 2.4.18-19.7.x i686; x) – 6 – 0%
  • NSPlayer/8.0.0.4477 – 5 – 0%
  • JPluck 2.0 pre2 – 4 – 0%
  • DoCoMo/1.0/N504i/c10/TB – 3 – 0%
  • amibot – 2 – 0%
  • Lynx/2.7.1ac-0.102+intl+csuite libwww-FM/2.14 – 1 – 0%

Code for this is online, search for ‘case: "agents":

Related Posts