[Note: this is not about some great new way to get porn over the Internet.] [Erased cliched introduction before publishing] Services are appearing to take Web communication to the next level: syndication. Using one form or another of a standard called RSS (Rich Site Summary), websites–even those as simple as BillSaysThis–can make content available through distributors automatically with little more effort than a few minutes to sign up. Some content publishing tools automate this process (some tools promise it’s coming real soon), some sites use scripts to generate this dynamically, and others have fans/audience members who write something site-specific.
One of the prime movers behind this effort is Jeff Barr’s Syndic8.com. This site has available over 7200 individual feeds written in at least 18 languages and is now experimenting with web services through an XML-RPC API. This API allows other sites and applications to interact with the Syndic8 database, not only to discover or manage individual feeds but to look at aggregations–categories, locations, activity. For example, the weblog you are reading now is in there.
One of the current Syndic8 efforts, driven by Bill Kearney, is to add geographic information to the feeds through the API. In fact, he wrote a script that looks up the domain to get the IP address and the location of that address (which is in the Whois database) and then uses the API to feed it to Syndic8. Of course for the most part the location determined by this brute force methodology is incorrect, for example there is no correlation between this website and the city of Davie, Florida. But Kearney has taken the first step and now the creators of the feeds can see what is happening and put in the correct data. This would mean adding the location information as metadata in their RSS; my pages have it but due to the way the RSS and underlying weblog pages are generated, it is not yet in the RSS. There are complications caused by differing versions of RSS and the toolkits used to produce the files but what else is new in Webland?
Here is one result of the aggregated information:
I asked Bill for the details on how he did the work so far:
“I did most of the work using perl, PHP, Radio, MySQL and XML-RPC calls.
I extracted the feeds from the Syndic8 server database using GetFeedInfo calls. I then locally extracted the hostname and looked up it’s IP address. Once I had the IP address I used the Geo-IP and NetGeo database to cross-reference it. This info was then sent back up to the Syndic8 database using the SetLocationInfo call.
So the latitude/longitude pair info is based solely on the server IP address info as can be determined from thet NetGeo database. That’s largely based on the whois databases. This means it’s only as accurate as the info contained in that database.”
