<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	>
<channel>
	<title>Comments on: Google Base and Great War Soldiers</title>
	<atom:link href="http://www.investigations.4-lom.com/2007/12/27/google-base-and-great-war-soldiers/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.investigations.4-lom.com/2007/12/27/google-base-and-great-war-soldiers/</link>
	<description>Failing better at understanding the past</description>
	<pubDate>Tue, 06 Jan 2009 11:24:46 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.7</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Gavin Robinson</title>
		<link>http://www.investigations.4-lom.com/2007/12/27/google-base-and-great-war-soldiers/comment-page-1/#comment-12046</link>
		<dc:creator>Gavin Robinson</dc:creator>
		<pubDate>Fri, 28 Dec 2007 10:54:11 +0000</pubDate>
		<guid isPermaLink="false">http://www.investigations.4-lom.com/2007/12/27/google-base-and-great-war-soldiers/#comment-12046</guid>
		<description>Thanks for pointing that out. I'd somehow never heard of microformats but it looks like a brilliant idea - even easier than embedding XML. For people who have already published details of soldiers on their websites this is likely to be a better solution than trying to sync up with Google Base. On the other hand Google Base would be best for people who haven't already published their data on the web and don't know how to. It should be possible to define a Google Base type and a microformat which map to each other and contain all the necessary details of soldiers. Then someone could write a scraper which pulls it all together and allows searches across Google Base, microformatted websites, and possibly the big sites (CWGC, TNA) even if they don't have any metadata. I know that someone is already working on a Python script which scrapes the CWGC database and allows searches by regiment and service number, which the site's own search engine won't do.</description>
		<content:encoded><![CDATA[<p>Thanks for pointing that out. I&#8217;d somehow never heard of microformats but it looks like a brilliant idea - even easier than embedding XML. For people who have already published details of soldiers on their websites this is likely to be a better solution than trying to sync up with Google Base. On the other hand Google Base would be best for people who haven&#8217;t already published their data on the web and don&#8217;t know how to. It should be possible to define a Google Base type and a microformat which map to each other and contain all the necessary details of soldiers. Then someone could write a scraper which pulls it all together and allows searches across Google Base, microformatted websites, and possibly the big sites (CWGC, TNA) even if they don&#8217;t have any metadata. I know that someone is already working on a Python script which scrapes the CWGC database and allows searches by regiment and service number, which the site&#8217;s own search engine won&#8217;t do.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ben Brumfield</title>
		<link>http://www.investigations.4-lom.com/2007/12/27/google-base-and-great-war-soldiers/comment-page-1/#comment-12001</link>
		<dc:creator>Ben Brumfield</dc:creator>
		<pubDate>Fri, 28 Dec 2007 00:38:22 +0000</pubDate>
		<guid isPermaLink="false">http://www.investigations.4-lom.com/2007/12/27/google-base-and-great-war-soldiers/#comment-12001</guid>
		<description>&lt;i&gt;just embed Google Base metadata in a webpage&lt;/i&gt;

This sounds reminiscent of microformats.  Have you looked to see if any connectors between the two exist?</description>
		<content:encoded><![CDATA[<p><i>just embed Google Base metadata in a webpage</i></p>
<p>This sounds reminiscent of microformats.  Have you looked to see if any connectors between the two exist?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Gavin Robinson</title>
		<link>http://www.investigations.4-lom.com/2007/12/27/google-base-and-great-war-soldiers/comment-page-1/#comment-11987</link>
		<dc:creator>Gavin Robinson</dc:creator>
		<pubDate>Thu, 27 Dec 2007 19:44:20 +0000</pubDate>
		<guid isPermaLink="false">http://www.investigations.4-lom.com/2007/12/27/google-base-and-great-war-soldiers/#comment-11987</guid>
		<description>Another thing I forgot to mention: I don't think you can nest attribute tags, which would be quite a useful thing to do. For example, the peculiarities of the British Army numbering system in this period mean that a service number needs to be associated with a regiment (or sometimes even a battalion) to make any sense.

Even if Base isn't quite up to pulling in complete datasets from big sites like CWGC and TNA, it could still be valuable for people who have compiled a database of men who served in a battalion and want to make it available to other people on the web but who don't have the technical skills or the money to host it themselves. I might try some small scale experiments and see where it goes (although I have lots of other things to finish first).

Another alternative is &lt;a href="http://www.freebase.com/" rel="nofollow"&gt;Freebase&lt;/a&gt;, but that seems to be the other extreme: absolutely anyone can edit your data.</description>
		<content:encoded><![CDATA[<p>Another thing I forgot to mention: I don&#8217;t think you can nest attribute tags, which would be quite a useful thing to do. For example, the peculiarities of the British Army numbering system in this period mean that a service number needs to be associated with a regiment (or sometimes even a battalion) to make any sense.</p>
<p>Even if Base isn&#8217;t quite up to pulling in complete datasets from big sites like CWGC and TNA, it could still be valuable for people who have compiled a database of men who served in a battalion and want to make it available to other people on the web but who don&#8217;t have the technical skills or the money to host it themselves. I might try some small scale experiments and see where it goes (although I have lots of other things to finish first).</p>
<p>Another alternative is <a href="http://www.freebase.com/" rel="nofollow">Freebase</a>, but that seems to be the other extreme: absolutely anyone can edit your data.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
