<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Warm Coffee - Reflections of a Web Developer</title>
	<atom:link href="http://paulsmith.blogs.ilrt.org/feed/" rel="self" type="application/rss+xml" />
	<link>http://paulsmith.blogs.ilrt.org</link>
	<description>A damned site, better</description>
	<lastBuildDate>Fri, 23 May 2008 13:22:54 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Scrubbing your content?</title>
		<link>http://paulsmith.blogs.ilrt.org/2008/05/21/scrubbing-your-content/</link>
		<comments>http://paulsmith.blogs.ilrt.org/2008/05/21/scrubbing-your-content/#comments</comments>
		<pubDate>Wed, 21 May 2008 12:16:15 +0000</pubDate>
		<dc:creator>Paul Smith</dc:creator>
				<category><![CDATA[content issues]]></category>
		<category><![CDATA[content publishing ownership online]]></category>

		<guid isPermaLink="false">http://paulsmith.blogs.ilrt.org/2008/05/21/scrubbing-your-content/</guid>
		<description><![CDATA[I&#8217;ve just read an interesting piece in the Guardian, concerning post-publication editing of articles, and how the Web has divided opinion on whether print versions of articles should match their online equivalents. Obviously, the nature of the Web allows for minor corrections (typo&#8217;s, spelling mistakes, etc) but should journalists have the power to change, or [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve just read an <a href="http://www.guardian.co.uk/commentisfree/2008/may/19/1">interesting piece in the Guardian</a>, concerning post-publication editing of articles, and how the Web has divided opinion on whether print versions of articles should match their online equivalents. Obviously, the nature of the Web allows for minor corrections (typo&#8217;s, spelling mistakes, etc) but should journalists have the power to change, or add to their articles once they have entered the public domain, and if so, should any &#8216;mending&#8217; be made obvious? Does this affect trust and transparency issues?</p>
<p>This piece is obviously angling towards journalism, but I can see parallels with community edited documents, or websites (such as Wikipedia) where savvy users can work out what has changed, but most won&#8217;t, and taking a quote from an online source which changes, or at worst disappears can result in issues of integrity in all fields of work (from school homework to journalism and politics). How does this work with the Freedom of Information act, when an article or page on the Web could be changed on a whim, and no record of the original kept? Is this tantamount to shredding inaccurate documents and replacing them? In this current era of the ephemeral and ubiquitous Web, who owns content, and what they can do with it is an interesting question.</p>
]]></content:encoded>
			<wfw:commentRss>http://paulsmith.blogs.ilrt.org/2008/05/21/scrubbing-your-content/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>XTech 2008 &#8211; Day 3</title>
		<link>http://paulsmith.blogs.ilrt.org/2008/05/19/xtech-2008-day-3/</link>
		<comments>http://paulsmith.blogs.ilrt.org/2008/05/19/xtech-2008-day-3/#comments</comments>
		<pubDate>Mon, 19 May 2008 16:38:12 +0000</pubDate>
		<dc:creator>Paul Smith</dc:creator>
				<category><![CDATA[conferences and workshops]]></category>
		<category><![CDATA[trip reports]]></category>
		<category><![CDATA[xtech2008 conference report]]></category>

		<guid isPermaLink="false">http://paulsmith.blogs.ilrt.org/2008/05/20/day-3/</guid>
		<description><![CDATA[Day 3
This morning&#8217;s hangover was brought to you by Smithwicks.
A short day today, with the conference finishing at around 1pm. Before the end, though, a couple of talks to go to before the final keynote.
Firstly, was a session on &#8216;Data portability with SIOC and FOAF&#8216; from Uldis Bojãrs of DERI, Galway. This talk described the [...]]]></description>
			<content:encoded><![CDATA[<h2>Day 3</h2>
<p>This morning&#8217;s hangover was brought to you by <a href="http://www.smithwicks.ie/">Smithwicks</a>.</p>
<p>A short day today, with the conference finishing at around 1pm. Before the end, though, a couple of talks to go to before the final keynote.</p>
<p>Firstly, was a session on &#8216;Data portability with <a href="http://sioc-project.org">SIOC</a> and <a href="http://www.foaf-project.org">FOAF</a>&#8216; from Uldis Bojãrs of DERI, Galway. This talk described the need and requirements for data portability, and continued with a demonstration of <a href="http://rdfs.org/sioc/applications/">SIOC applications</a> which allow SIOC and FOAF data to be produced from existing online community sites (blogs, forums, etc.). Uldis also described how this information can be ported to other sites with tools such as the <a href="http://wiki.sioc-project.org/w/SIOC_Import_Plugin">WordPress SIOC Import plugin</a>. He also demonstrated the <a href="http://sioc-project.org/firefox">Semantic Radar Firefox plugin</a> which detects the presence of SIOC, FOAF and <a href="http://usefulinc.com/doap">DOAP</a> data in a web page.</p>
<p>Next up, was Andrew Walkingshaw of the University of Cambridge, who was presenting his work on &#8216;Representing, indexing and mining scientific data using XML and RDF: Golem and CrystalEye&#8217;. I&#8217;d met Andrew over lunch the day before, and we had a good chat about many things, and I was really looking forward to this talk. The essence of the presentation was discussing the creation of tools to enable scientists to filter, mine and search both their own data and that produced by other researchers, aggregating supplementary data, and adding value to this data. Andrew described the <a href="http://wwmm.ch,cam.ac.uk/crystaleye/">CrystalEye</a> repository and his <a href="http://www.lexical.org.uk/science/golem/">Golem</a> ontology language and associated tools, which provides the ability to extract the data from the repository, and then  build new interfaces to it &#8211; making the data easier to find, analyse and reuse. A cracking talk, delivered in a very engaging way, and I will read his paper as I admit some of the talk did pass over my head!</p>
<p>Finally, after some much needed coffee, was the final keynote. Sean McGrath of Propylon gave a highly entertaining, and engaging talk on the nature of the web as we know it, taking us on a &#8220;Celtic-tinted safari of the Web featuring mythical creatures, tenuous analogies and curious interconnections.&#8221; No, really, he did. Via a lot of digression (and indeed, orangutans) Sean concluded that today&#8217;s Web is: &#8220;The Web is URIs + HTTP (and nothing else – no &#8216;pure&#8217; content)&#8221;. And I tend to agree. Definitely recommend scanning through the <a href="http://assets.expectnation.com/15/event/3/Orangutans,%20Oxen%20and%20Ogham%20stones_%20Mulling%20the%20movable%20Web%20Presentation.pdf">slides for this talk</a> &#8211; it might not make sense without the words, but there&#8217;s enough to go on to get a flavour of this great end to XTech 2008.</p>
<h2>Conclusions</h2>
<p>So we&#8217;d made it to day 3 of this excellent conference. As usual, much of the really interesting discussion was held in the restaurants and pubs around Dublin after the main day&#8217;s events were over, going on well in to the small hours in some cases.</p>
<p>My general conclusions and reflections, are:</p>
<ul>
<li>Much of the really cool and exciting stuff covered in this conference will never see the light of day in the projects I work on (with the possible exception of RDFa, but convincing clients they should do this will be hard, as will coming up with any kind of useful ontology). This may be a defeatist attitude, and perhaps there will be opportunities to embed Fire Eagle functionality into some sites in the future, or make more of FOAF and SIOC to make our sites more amenable to community building &#8211; who knows? But at least now I know these things exist, and we can start to look at OAuth for CMS integration with, say, Flickr or other such technical liaisons.</li>
<li>I don&#8217;t know anywhere enough about this stuff, both technically or indeed that it exists, and despite it not being of immediate use to me in my day-to-day work, I should spend more time finding out about it and playing &#8211; maybe in my own time if work commitments don&#8217;t allow the time.</li>
<li>What I really liked about this conference was the mix of attendees and presenters, both from academia, and the commercial world both large and small. It made it feel much more valid, and it really felt like everyone was there for the right reasons &#8211; not trying to sell anything, but out of a genuinely altruistic wish to make the web better.</li>
<li>I definitely would want to go to XTech again, and would encourage others to go too.</li>
<li>Dublin pubs sell some great beer.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://paulsmith.blogs.ilrt.org/2008/05/19/xtech-2008-day-3/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>XTech 2008 &#8211; Day 2</title>
		<link>http://paulsmith.blogs.ilrt.org/2008/05/19/xtech-2008-day-2/</link>
		<comments>http://paulsmith.blogs.ilrt.org/2008/05/19/xtech-2008-day-2/#comments</comments>
		<pubDate>Mon, 19 May 2008 16:36:54 +0000</pubDate>
		<dc:creator>Paul Smith</dc:creator>
				<category><![CDATA[conferences and workshops]]></category>
		<category><![CDATA[trip reports]]></category>
		<category><![CDATA[xtech2008 conference report]]></category>

		<guid isPermaLink="false">http://paulsmith.blogs.ilrt.org/2008/05/19/xtech-2008-day-2/</guid>
		<description><![CDATA[Day 2
After learning the lesson from yesterday&#8217;s &#8216;worst Full Irish Breakfast I&#8217;ve ever had in Ireland&#8217;, I headed off to the conference via Starbucks &#8211; this time my route problems of the previous day were banished, and I actually arrived early. Not early enough for the 8am sponsor presentation, though, so I hope somebody made [...]]]></description>
			<content:encoded><![CDATA[<h2>Day 2</h2>
<p>After learning the lesson from yesterday&#8217;s &#8216;worst Full Irish Breakfast I&#8217;ve ever had in Ireland&#8217;, I headed off to the conference via Starbucks &#8211; this time my route problems of the previous day were banished, and I actually arrived early. Not early enough for the 8am sponsor presentation, though, so I hope somebody made it in for that.</p>
<p>The first two presentations of the day were by Stephen Dunn and colleagues of the Guardian, and Brendan Quinn from the BBC. Both presentations concentrated on their recent &#8216;updates&#8217;, both approaching this upscaling and development in different ways &#8211; the Guardian talk was more &#8216;what we did&#8217; whereas the BBC is more at the stage of &#8216;what we are doing&#8217;. The Guardian talk concentrated on: using the web for services to avoid internal complexity in their software; implementing web 2.0 ideas at an enterprise scale without the enterprise baggage; using tagging to connect content and services; How we prepared our archive and ongoing information architecture for the social web. There were several interesting concepts discussed including the use of third-parties to provide data and also cut down on cacheing issues, and also an automated tagging system based (in part) on folder structure. It was interesting to note that these large scale projects rely on many, many techies, editors and writers, and it was &#8216;fun&#8217; to contrast that with how we work in ILRT (albeit on sites several orders of magnitude smaller). The BBC talk was even closer to home, with resources being their prime issue &#8211; having to cojole and argue to many people the virtues of introducing a new web delivery platform if the BBC wanted their site to continue, whereas from the outside all appeared to be going smoothly &#8211; the swan metaphor was used, I&#8217;m sure. The mutterings of surprise around the room when it was announced that the BBC site still ran on SSIs and Perl scripts was oddly noticeable.</p>
<p>Coffee, chat, and then on to listen to Kellan Elliot-McCrea from Flickr talk about OAuth. This was interesting, as I knew nothing about it beforehand (which was probably a bad thing, as this talk relied on knowing what OAuth was!) but hey, I can pick these things up fairly quickly. Anyway, so OAuth turns out to be an emerging standard for authorisation, not authentication (ie OpenID). Much like it&#8217;s authentication equivalent, OAuth works by the exchange of tokens, but also adds the ability to customise attributes for additional privileges. I must admit that this started to go over my head a bit, so more reading is required, but this definitely does look something worth more investigation. One of the sites making use of OAuth is <a href="http://www.fireeagle,com/">Fire Eagle</a>, and the next talk was by Evan Henshaw-Plath of Yahoo! Brickhouse, on Fire Eagle. This site is interesting in that it&#8217;s been designed primarily as an API, and several demo&#8217;s of how it has already been integrated with other web sites and tools were given. I now have an account on Fire Eagle, so will be trying some interactions with it soon, and hopefully coming up with some use for it!</p>
<p>After lunch and some sun outside the hotel, it was back to the dark halls to listen to Jeni Tennison of The Stationery Office talk about SemWebbing the London Gazette. I found this really interesting, and was my first real introduction to RDFa. Jeni also covered the underlying problems, in that documents need proper URIs, and how they redefined the URI space for the London Gazette. This talk opened up many possibilities in terms of making data better available via RDFa, without relying on dodgy HTML scraping scripts. Defintely a paper worth reading, and food for thought.</p>
<p>Next up was Ralph Meijer, talking about getting Social Networks to talk to each other. He described how, using Jabber/XMPP technologies besides HTTP, you can enable two-way communication between third-party clients and services (XMPP as an API), and have services exchange (events on) social objects and people. In near real-time, with built-in authorization and authentication. Very cool stuff, and worth keeping an eye on &#8211; there seems to be quite a movement to free up information held on individual social networks to get out of the closed-silo mentality of recent times.</p>
<p>On to the Steven Pemberton experience. I always enjoy seeing Stephen present, and this was no exception. His talk examined how Web 2.0 partitions the Web into a number of topical sub-Webs, and locks you in, thereby reducing the value of the network as a whole (the value of the Web is that there is only one). He went on to explain that user contributed content is not (always) a Bad Thing but it is the method of delivery and storage that is wrong. The future lies in better aggregators. This (somewhat) controversial approach suggested that we should hold all our personal data (photos, personal profile, contacts list, etc) on our own personal web servers, which can then be uploaded or shared with third party sites (Facebook, Flickr) etc, so I can have my information on any social network site I like, but only need to update/manage it in one place &#8211; my personal site. Still undecided on this approach, but then I&#8217;ve still to find a need for Facebook, and I do host my own photo archive, but as most people were trying to say, I&#8217;ve yet to find a better photo site than Flickr.</p>
<p>Finally today, was Ian Davis from Talis, presenting a paper on how to manage your API to keep everyone happy. Ian covered the design of API URLs, versioning strategies and techniques  for preserving backwards compatibility &#8211; most of which seemed quite obvious, but how many of us have gone ahead and designed or updated APIs with little or no thought for who it might effect when we go from v2 to v3.</p>
<h2>Lightning Talks</h2>
<p>Later in the evening, everyone gathered once again for the highly anticipated <a href="http://2008.xtech.org/public/schedule/detail/654">Lightning Talks session</a>.  These were generally very good, and subjects covered included HTML5, Microsoft Popfly, and a break from the normal 20:20 format by Ian Forrester from the BBC, who presented a mammoth 72 slides in 6 minutes&#8230; it wasn&#8217;t wholly successful in getting across anything tangible, but it was funny! Another talk focussed on work for the new virtual, personalised tour of Amsterdam&#8217;s <a href="http://www.rijksmuseum.nl">Rijkmuseum</a>, which was a great talk, and very well presented. I really like lightning talks &#8211; you can learn so much from them, and in some cases I feel they are far superior to the usual 45 minute presentation.</p>
<h3>Talks I wish I&#8217;d been able to go to:</h3>
<p>Most of them, to be honest &#8211; it was a really good day, with some excellent talks.</p>
]]></content:encoded>
			<wfw:commentRss>http://paulsmith.blogs.ilrt.org/2008/05/19/xtech-2008-day-2/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>XTech 2008 &#8211; Day 1</title>
		<link>http://paulsmith.blogs.ilrt.org/2008/05/19/xtech-2008-day-1/</link>
		<comments>http://paulsmith.blogs.ilrt.org/2008/05/19/xtech-2008-day-1/#comments</comments>
		<pubDate>Mon, 19 May 2008 14:19:06 +0000</pubDate>
		<dc:creator>Paul Smith</dc:creator>
				<category><![CDATA[conferences and workshops]]></category>
		<category><![CDATA[trip reports]]></category>
		<category><![CDATA[xtech2008 conference report]]></category>

		<guid isPermaLink="false">http://paulsmith.blogs.ilrt.org/2008/05/19/xtech-2008-day-1/</guid>
		<description><![CDATA[Intro
This is a brief write up of Day 1 of the 2008 XTech conference held in Dublin, Ireland from 6-9 May 2008. Days 2 and 3 to follow. The proceedings and presentations from the conference are available for further reading.
The theme for this year&#8217;s conference was &#8220;The Web on the Move&#8221;. Not being able to [...]]]></description>
			<content:encoded><![CDATA[<h2>Intro</h2>
<p>This is a brief write up of Day 1 of the<a href="http://2008,xtech.org/"> 2008 XTech conference</a> held in Dublin, Ireland from 6-9 May 2008. Days 2 and 3 to follow. The <a href="http://2008.xtech.org/public/schedule/proceedings">proceedings</a> and <a href="http://2008.xtech.org/public/schedule/presentations">presentations</a> from the conference are available for further reading.</p>
<p>The theme for this year&#8217;s conference was &#8220;The Web on the Move&#8221;. Not being able to put the intro any better, this is the intro from the parent website and the conference programme:</p>
<p>&#8220;For years we have been developing and promoting open data standards, enabling data portability. Recent developments have led to web-wide programming APIs and virtualization. It’s no longer just our data on the move, it’s our applications and even our servers too.</p>
<p>What impact will this era of unprecedented portability have on us? How should we change the way we build for the web? XTech 2008 will examine the technology, war stories and practical concerns of developing for today’s web.&#8221;</p>
<h2>Day 1</h2>
<p>The conference opening keynote was already started when I arrived (being slightly late, hot and flustered after taking the long way round to get to the venue due to a Google map print-out of restricted detail!) so I missed the introduction, but it turned out to be a presentation on the role of &#8220;open&#8221; in the process of innovation to commoditisation. I must admit to missing the point slightly, but this talk seemed to summarise (in many, many, slides) that the &#8216;open meme&#8217; will be a major driving force in behind the changes in the market in terms of getting from innovation to commoditisation, by changing the market and the services that will need to be provided. This talk was followed by David Recordon of Six-Apart, who presented a very interesting journey through the technologies behind Open Platforms (concentrating on social applications), and their common requirements, ie: ways to share abstract information, ways to communicate, ways to know who someone is, ways to know who someone knows, ways to know what someone is doing. This involved discussions of OAuth, XMPP, OpenID, XFN Microformats (XHTML Friends Network), FOAF, <a href="http://www.twitter.com/">Twitter</a>, and others. Sadly, many of these I was oblivious to, but no more &#8211; I have, as a result now got accounts on Twitter and <a href="http://www.fireeagle.com/">Fire Eagle</a>, and am experimenting further with Google Talk and its XMPP client.</p>
<p>After a welcome coffee break, next I sat through an enlightening talk from Douglas Crockford, currently of Yahoo! on &#8220;Javascript: The Good Parts&#8221; where he attempted to show that Javascript is &#8220;a beautiful, highly expressive language that is buried under a steaming pile of good intentions and blunders&#8221;. He also demonstrated <a href="http://www.jslint.com/">JSLint</a> a Javascript verifier, which should (will?) save you from all the pitfalls and nightmares of badly coded Javascript. Coincidentally, Douglas has a book being published by O&#8217;Reilly on this very subject coming out shortly.</p>
<p>Then it was on to a talk by Michael Smith of the W3C who gave a rapid look at some of the important changes in the browser landscape since XTech 2007, and at what those changes mean for developers. There was a lot of information on how <a href="http://www.google.co.uk/url?sa=t&amp;ct=res&amp;cd=1&amp;url=http%3A%2F%2Fwebkit.org%2F&amp;ei=XYsxSKOUD4200QT9g7XYAQ&amp;usg=AFQjCNEGJ7R9gMF3znWbtSquGFEELChj0g&amp;sig2=5Yt1Mw0O5wmQFV4jnB3utQ">Webkit</a> is becoming more prevalent, and how there has been much development in the mobile browser arena, with Opera still a major player in the absence of any significant challenge from Mozilla (so far &#8211; they are now working on a Gecko based mobile browser). Otherwise, it was some CSS3, better SVG and much surprise that the IE8 beta seems to be better than people were expecting.</p>
<p>I then went to a couple of presentations on AJAX, the first by Bob Buffone of Nexaweb Technologies who talked about ways to optimise AJAX applications. Areas covered were: <a href="http://www.mozilla.org/rhino/">Mozilla&#8217;s Rhino JavaScript engine</a> as a complete performance monitoring tool for Ajax code bases; Injecting monitoring code into every function of JavaScript within an application to create a complete performance picture; Locating performance issues through drilldowns of function call counts, total time spent, average time per call, and call stacks; and start time optimization using <a href="http://dojotoolkit.org/">Dojo</a>, Gzip, and Compression. Most of this was fairly obvious, but a useful summary, and was also introduced to YSlow, a Firefox plugin for measuring page download/rendering times, which I&#8217;d not come across before. The second talk was by Clinton Smullen of the University of Tennessee who presented a paper on how they tried to increase the performance of part of their University website by using AJAX. He described the tests they used to determine the best way of delivering the AJAX content (ie partial HTML, XML, <a href="http://www.json.org/">JSON</a>, and CSV). Interesting stuff, but I felt fairly limited to that particular application, but definitely worth bearing in mind that there may be better ways to serve your content than the obvious, and that you should look at other methods where possible.</p>
<p>After coffee, I went to listen to Arve Bervendsen of Opera give a talk on giving Web Applications and Widgets access to device and user data. Not knowing really what widgets were in this sense, I thought this would be an interesting learning session for me, but I have to say I&#8217;m still none the wiser, and so this probably wasn&#8217;t a great talk to go to as it did presuppose some knowledge on the subject. Still, it appears you can do what he said you could, and had some demos. Mostly I just caught up on email.</p>
<p>Finally today, I went to listen to Fabrice Desre of Orange Labs talk about <a href="http://www.open-mashups.org/">Open Mashups</a>. The Open Mashups editor is a Firefox extension, which aims to allow the creation of mashups which provide: clean separation of functional (what the application is doing) and non-functionnal (e.g look and feel) aspect; independence from devices and execution platforms: no vendor, device or platform locking; and user friendliness : no need to be a coder to use it. This looked very cool, and I do intend to have a play when time allows. Currently reliant on FF3 though.</p>
<h3>Talks I wished I&#8217;d been able to go to:</h3>
<p>Blaine Cook &#8211; Building the Real-Time Web<br />
Simon Willison &#8211; Unobtrusive JavaScript with jQuery</p>
]]></content:encoded>
			<wfw:commentRss>http://paulsmith.blogs.ilrt.org/2008/05/19/xtech-2008-day-1/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Web Security 101 with SANS</title>
		<link>http://paulsmith.blogs.ilrt.org/2007/12/06/web-security-101-with-sans/</link>
		<comments>http://paulsmith.blogs.ilrt.org/2007/12/06/web-security-101-with-sans/#comments</comments>
		<pubDate>Thu, 06 Dec 2007 15:52:47 +0000</pubDate>
		<dc:creator>Paul Smith</dc:creator>
				<category><![CDATA[conferences and workshops]]></category>
		<category><![CDATA[trip reports]]></category>
		<category><![CDATA[web security]]></category>

		<guid isPermaLink="false">http://paulsmith.blogs.ilrt.org/2007/12/06/web-security-101-with-sans/</guid>
		<description><![CDATA[I recently attended a Web Application Security workshop run by SANS in London.
Dominic Hiles has made extensive notes on this at:

Day 1
Day 2

]]></description>
			<content:encoded><![CDATA[<p>I recently attended a Web Application Security workshop run by SANS in London.</p>
<p>Dominic Hiles has made extensive notes on this at:</p>
<ul>
<li><a href="http://codetricks.blogs.ilrt.org/2007/12/06/lessons-learnt-from-the-sans-web-application-security-workshop-2007-part-1/">Day 1</a></li>
<li><a href="http://codetricks.blogs.ilrt.org/2007/12/06/lessons-learnt-from-the-sans-web-application-security-workshop-2007-part-2/">Day 2</a></li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://paulsmith.blogs.ilrt.org/2007/12/06/web-security-101-with-sans/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>New methods of web analysis?</title>
		<link>http://paulsmith.blogs.ilrt.org/2007/04/20/new-methods-of-web-analysis/</link>
		<comments>http://paulsmith.blogs.ilrt.org/2007/04/20/new-methods-of-web-analysis/#comments</comments>
		<pubDate>Fri, 20 Apr 2007 10:19:38 +0000</pubDate>
		<dc:creator>Paul Smith</dc:creator>
				<category><![CDATA[web metrics]]></category>

		<guid isPermaLink="false">http://paulsmith.blogs.ilrt.org/2007/04/20/new-methods-of-web-analysis/</guid>
		<description><![CDATA[I received an email the other day from a colleague asking the following:
&#8220;I was surfing for info on Unique Users and came across this &#8211; I&#8217;d really appreciate your views on it if you&#8217;ve got time to have a look. Do you know anything about this company and would it be worth getting involved with [...]]]></description>
			<content:encoded><![CDATA[<p>I received an email the other day from a colleague asking the following:</p>
<p><em>&#8220;I was surfing for info on Unique Users and came across this &#8211; I&#8217;d really appreciate your views on it if you&#8217;ve got time to have a look. Do you know anything about this company and would it be worth getting involved with them?&#8221;</em></p>
<p><a href="http://www.thinkmetrics.com/New-methods-of-web-analysis.php">http://www.thinkmetrics.com/New-methods-of-web-analysis.ph</a></p>
<p>I read the article, and suggest you do to. Hmmm.. sounds like a miracle cure, doesn&#8217;t it?</p>
<p>After four years supporting the WebTrends web log analysis software in the 1990&#8217;s, I&#8217;m used to the limitations of log and cookie based analysis, and there are several assumptions in the method described in this article that I would take issue with&#8230;</p>
<p><em>&#8220;1. If the same cookie is present on multiple visits, it?s the same person.&#8221;</em></p>
<p>Not true &#8211; it&#8217;s the same browser / computer, but not necessarily the same person. Take a public library computer: hundreds of people could use that &#8211; it&#8217;s most likely that on a popular site with no authentication, that a cookie is set once, which would then be used by anyone using that computer.</p>
<p><em>&#8220;4. We know some IP addresses cannot be shared by one person. These are the ones that would require a person to move faster than possible. If we have one IP address in New York, then one in Tokyo 60 minutes later, we know it can?t be the same person because you can?t get from New York to Tokyo in one hour. &#8221; </em></p>
<p>Not true &#8211; IP addresses can be set by a remote data centre / proxy depending on your ISP. The user doesn&#8217;t physically have to be in those places, it just depends on how their traffic is routed. The &#8216;AOL problem&#8217; (many IP addresses designated during any one session) applies here too. Whoever wrote this, doesn&#8217;t seem to understand network topography, which I would suggest is fairly useful when talking about Internet user measurements.</p>
<p>Its also states that &#8220;30-50% of people delete cookies&#8221;&#8230;. and then proposes a whole system based on persistent cookies. It&#8217;s no good using a range if the range drops by 30-50% in the first place, even if you weight this. Again, a flawed concept.</p>
<p>But I think the most telling &#8216;wouldn&#8217;t touch with a bargepole&#8217; comment in the article would be:</p>
<p><em>&#8220;We&#8217;ve approached some household names in metrics, online advertising delivery, and major search engines. The response has been zilch. No one wants to know.&#8221; </em></p>
<p>If online ad agencies and other metrics companies aren&#8217;t interested, you must be flogging a dead horse.</p>
<p>To be honest, if you read this without focussing on the techie bits it&#8217;s a simple sales pitch for a dodgy scheme.</p>
<p>So the summary answer I gave to the emailed query was &#8220;I&#8217;ve not heard of this company, I&#8217;d not heard of this &#8216;research&#8217;, and I would suggest our time could possibly be better spent.&#8221;</p>
<p>On a similar note, an article on the BBC site &#8211;  					<a href="http://news.bbc.co.uk/1/hi/technology/6564671.stm">Web counting tools &#8216;need change&#8217;</a> &#8211; makes interesting reading, although again, offers no real solutions. I&#8217;d be very interested to know if anyone has come across any other methods of measuring users on Web sites.</p>
]]></content:encoded>
			<wfw:commentRss>http://paulsmith.blogs.ilrt.org/2007/04/20/new-methods-of-web-analysis/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Firefox bookmarks and keywords</title>
		<link>http://paulsmith.blogs.ilrt.org/2007/04/17/firefox-bookmarks-and-keywords/</link>
		<comments>http://paulsmith.blogs.ilrt.org/2007/04/17/firefox-bookmarks-and-keywords/#comments</comments>
		<pubDate>Tue, 17 Apr 2007 10:12:00 +0000</pubDate>
		<dc:creator>Paul Smith</dc:creator>
				<category><![CDATA[browser hacks]]></category>

		<guid isPermaLink="false">http://paulsmith.blogs.ilrt.org/2007/04/17/firefox-bookmarks-and-keywords/</guid>
		<description><![CDATA[This is a neat trick which allows you to create &#8216;active&#8217; bookmarks which provide functionality without having to initially visit the site in question first. By creating a bookmark with an associated keyword, you can run searches on target sites from the Firefox address bar.
Example: The Oxford English Dictionary
To get the relevant query string, go [...]]]></description>
			<content:encoded><![CDATA[<p>This is a neat trick which allows you to create &#8216;active&#8217; bookmarks which provide functionality without having to initially visit the site in question first. By creating a bookmark with an associated keyword, you can run searches on target sites from the Firefox address bar.</p>
<p><strong>Example: The Oxford English Dictionary</strong></p>
<p>To get the relevant query string, go to www.oed.com, and run a search. You will then be presented with the completed query string for running a search on this site, eg:</p>
<p><em>http://dictionary.oed.com/cgi/findword?query_type=word&amp;queryword=test&amp;find.x=16&amp;find.y=22&amp;find=Find+word</em></p>
<p>We want to make this dynamic, so need to replace the test string with a string variable. In this case, the relevant form element is &#8216;queryword&#8217;, so we can take this URL with its query string, and create a bookmark from it.</p>
<p>In Firefox, create a new bookmark &#8211; call it OED Search, or whatever you like. In the location box, add the following:</p>
<p><em>http://dictionary.oed.com/cgi/findword?query_type=word&amp;queryword=%s</em></p>
<p>You can strip the rest of the query string, this is just cruft.  Add a keyword, say &#8216;oed&#8217; (this should be unique, so make it fairly obvious yet distinct). Save your bookmark.</p>
<p>Now, in your Firefox address bar, you can type something like &#8216;oed floccinaucinihilipilification&#8217; and your browser should then integrate this into your bookmark location string, run the search, and return you a page of results as normal.</p>
<p>Obviously this can be done with any site which runs a search through a query string. Some sites obfuscate the search page in the results, so you will need to take the relevant form values from the initial form, rather than a completed query string, but the principle is the same.</p>
]]></content:encoded>
			<wfw:commentRss>http://paulsmith.blogs.ilrt.org/2007/04/17/firefox-bookmarks-and-keywords/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>MPEG21 &#8211; DIDL Workshop</title>
		<link>http://paulsmith.blogs.ilrt.org/2007/01/31/mpeg21-didl-workshop/</link>
		<comments>http://paulsmith.blogs.ilrt.org/2007/01/31/mpeg21-didl-workshop/#comments</comments>
		<pubDate>Wed, 31 Jan 2007 17:21:10 +0000</pubDate>
		<dc:creator>Paul Smith</dc:creator>
				<category><![CDATA[conferences and workshops]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[respositories]]></category>

		<guid isPermaLink="false">http://paulsmith.blogs.ilrt.org/2007/01/31/mpeg21-didl-workshop/</guid>
		<description><![CDATA[I spent this afternoon attending an introductory workshop on MPEG-21 DIDL, at the Watershed in Bristol, organised by UKOLN. The presenter was Frances Knudson of the  Los Alamos National Laboratory Research Library. For the uninitiated, MPEG21 is a standard which &#8220;defines an open framework for describing multimedia applications&#8221; &#8211; although this can really mean [...]]]></description>
			<content:encoded><![CDATA[<p>I spent this afternoon attending an <a href="http://www.ukoln.ac.uk/events/mpeg-21-didl/programme/">introductory workshop on MPEG-21 DIDL</a>, at the Watershed in Bristol, organised by <a href="http://www.ukoln.ac.uk/">UKOLN</a>. The presenter was Frances Knudson of the  Los Alamos National Laboratory Research Library. For the uninitiated, <a href="http://www.chiariglione.org/mpeg/standards/mpeg-21/mpeg-21.htm">MPEG21</a> is a standard which &#8220;defines an open framework for describing multimedia applications&#8221; &#8211; although this can really mean anything!</p>
<p>The programme consisted of a whistle-stop introduction and overview of MPEG 21 DIDL, followed by some more hands-on theory where we were asked to create creating DIDs for various datatypes, based on what we had learned in the earlier session.</p>
<p>MPEG21 has 18 components, and this short workshop only really looked at parts 2 and 3, (Digital Item Declaration and Digital Item Identification respectively). Parts 5, 6 and 7 (Rights Expression Language, Rights Data Dictionary, Digital Item Adaptation) were mentioned in passing, but not explained in any depth.</p>
<p>I can see how this stuff would be useful &#8211; the model is deliberately abstract,  and so can be implemented in a number of ways  (XML, RDF, etc). The logic in the packaging of items, components and resources is fairly easy to grasp, but in the true way of metadata, you can pretty much do what you like, which is fine for individual repositories, but this means that true interoperability is still a way off (although the <a href="http://ieeeltsc.org/wg11CMI/ramlet/">RAMLET</a> project may assist with some of this).</p>
<p>MPEG21 currently has only one real competitor, which is <a href="http://www.loc.gov/standards/mets/">METS</a>, although most of the current major repositories which support one support plugins for the other, so you really do pay your money and take your choice.</p>
<p>A worthwhile session, which was well attended (and turned out to be part of a two-day workshop on content packaging which I now wish I was attending all of!).</p>
<p><em>NB. Notes and slides are being made available, and I will post these here when received. </em></p>
]]></content:encoded>
			<wfw:commentRss>http://paulsmith.blogs.ilrt.org/2007/01/31/mpeg21-didl-workshop/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>
<!-- WP Super Cache is installed but broken. The path to wp-cache-phase1.php in wp-content/advanced-cache.php must be fixed! -->