andrewducker: (Default)
[personal profile] andrewducker
Can anyone see a way that a distributed Twitter could work?

Obviously you could use RSS to aggregate tweeters you like into one place - but that only covers very basic functionality.  For instance, I can't see a way to both decentralise it _and_ allow for hashtag filtering over the whole database to find the tweets you like.

Any thoughts?

Date: 2009-06-17 11:28 am (UTC)
From: [identity profile] red-phil.livejournal.com
Yes you can, you just have to search 500 sites.

The problem is finding what sites exist at any one time.
Especially if they are hosted on peoples home PCs and are liable to drop off the net at random.

Date: 2009-06-16 01:49 pm (UTC)
drplokta: (Default)
From: [personal profile] drplokta
It's a tricky one, because the data's not much larger than the metadata, so there's not much point in maintaining complete copies of the metadata at each site while sharding the data.

In theory, your searches could start on one server and then traverse a tree of other known servers, and the server where your search begins could consolidate the results and hand them to you, but it would make searching a bit slow and unreliable. It would also make them generate a lot of network traffic.

Alternatively, each post could go to multiple sites -- one for the user, one for each other user mentioned with an @user, and one for each hashtag used. Then it would be possible to search by user, @user or hashtag because you'd know which server to send the search to.

Date: 2009-06-16 04:00 pm (UTC)
nameandnature: Giles from Buffy (Default)
From: [personal profile] nameandnature
It's Usenet: flood fill between servers, hashtags are groups and "all tweets from Fred" is a group, so hashtagged posts from Fred are crossposts to Fred's group and the hashtag group.

This isn't completely distributed as there are still servers and clients.

Date: 2009-06-17 11:58 am (UTC)
nameandnature: Giles from Buffy (serious business)
From: [personal profile] nameandnature
Rather less than the real Usenet, I'd've thought. More thoughts:

You probably don't actually want NNTP and the Usenet message format, you just want to steal their ideas. Bonus points for making it use Google Wave.

You'd want to fix Usenet's current forgery and cancel problem with magical crypto fairy dust.

Anyone can run a leaf node, but you need a bunch of peers which flood to each other. Arranging peering is a manual step in NNTP. It'd be nice if this weren't the case in the distributed system, but sorting that out is a hard problem.

You don't want special clients for it, because people don't care enough to install one. Your local server's job is to present the messages it knows about as web pages, Atom feeds and so on.

Date: 2009-06-18 12:03 am (UTC)
nameandnature: Giles from Buffy (Default)
From: [personal profile] nameandnature
I'm guessing servers won't be run by individuals who trust each other but by service providers like Twitter (or ISPs if such a thing became popular, like email is and news used to be). Providers probably take money from users (or show them ads) but they don't trust them. If I can inject a tweet saying "andrewducker is wearing baggy pants today" at my local server and have it propagate everywhere, you might get a bit annoyed. Hence magical crypto fairy dust.

Date: 2009-06-16 05:17 pm (UTC)
From: [identity profile] call-waiting.livejournal.com
Sounds like you need a Distributed Hashtag Table.

Just kidding. Not really.</dodgeball>

Date: 2009-06-16 09:19 pm (UTC)
matgb: Artwork of 19th century upper class anarchist, text: MatGB (Default)
From: [personal profile] matgb
Hashtags were created independently of Twitter and initially you had to go to places like hashtags.com to search for them anyway.

I already have a live bookmark with an RSS of incoming link referrals and mentions of my name that comes from blog pinging servers.

All you need is a microblogging ping server and a decent search setup for complete aggregation—search for @andrewducker or whatever and it'll show all entries on all sources that are pinging your engine of choice.

Automattic already run a free pinging service for blogs (that I keep meaning to link to), a similar service would mean your host has to ping one repository, it pings all the search setups, etc.

Easy.

Completely beyond my technical skills, but the technology is already there.

Date: 2009-06-17 11:41 am (UTC)
matgb: Artwork of 19th century upper class anarchist, text: MatGB (Default)
From: [personal profile] matgb
I can't see that scaling well beyond a few services.

But it already does work for blogging. How many blogging platforms are there out there? I can track incoming links from all over the shop very easily, and search for specific terms effectively.

They pick up links, terms, tags, etc. You only actually need to ping the search engines anyway, if you're searching on just your site then you only get stuff from there, but if you search at, say, hashtags, you get referrals, tags, etc from everywhere that's pinging a service that pings hashtags.

Date: 2009-06-17 11:48 am (UTC)
matgb: Artwork of 19th century upper class anarchist, text: MatGB (Default)
From: [personal profile] matgb
http://www.icerocket.com/search?tab=blog&fr=h&q=iran&x=0&y=0

It won't get everything, for the simple reason that sites like, say, LJ don't automatically ping stuff, but it'll get stuff that wants to be found.

And something like http://hashtags.org is already set up for this sort of thing.

Date: 2009-06-17 11:56 am (UTC)
matgb: Artwork of 19th century upper class anarchist, text: MatGB (Default)
From: [personal profile] matgb
Well no, it can't find everything. But neither does every search on Twitter where they're only interrogating their own database.

For a post to be found, it has to want to be found, LJ is notoriously bad at helping people find stuff on the site, DW is at least looking at a pinging service for more than just the very crappy weblogs.com.

The advantage of a distributed model is competition, redundancy and a distributed system making things less prone to fall over.

The disadvantage is that not all the sites competing will talk to each other (without consumer pressure) and you still need to have some people, somewhere, tracking stuff.

If you want your search box to be built into whatever service you're using, then that adds extra weight to the spec, search requires a central repository of some sort. Or your server will need to track and remember everything tweeted on every service globally, just in case you want to search for that.

Which I suspect will actually happen at some point, but not for awhile.

Date: 2009-06-17 12:29 pm (UTC)
matgb: Artwork of 19th century upper class anarchist, text: MatGB (Default)
From: [personal profile] matgb
But you don't need every site to ping every other site.

You need to have a couple sites that aggregate pings to other search engines and possibly feed portals. Your site would be grabbing feeds from your subscriptions and would pick up on everything in there, and everything else would come from a search engine—which release feeds of the results so you'd get your @replies anyway.

You choose your search engine(s), they sort out the pinging APIs between them.

Date: 2009-06-17 12:42 pm (UTC)
matgb: Artwork of 19th century upper class anarchist, text: MatGB (Default)
From: [personal profile] matgb
You need every producing site to ping every search site.

No, you need to have a couple sites providing a pinging service, like Auttommatic does with ping-o-matic. Then you, the user, chose which pinging service to use.

Which also means they all have to keep track of every search site out there.

No, the pinging service does that for you.

And most blog platforms already provide feeds for a huge number of search services, sort of—every RSS reader in the world is effectively acting as a search service to an extent.

Actually, given that Twitter already has a huge pile of published APIs, and a lot of people are used to using clients already, the impetus for a distributed model is pretty much already there, all it'd need is for Twitter to have an outage and people'll step in.

October 2025

S M T W T F S
    1 2 3 4
5 67891011
12131415161718
19202122232425
262728293031 

Most Popular Tags

Style Credit

Expand Cut Tags

No cut tags
Page generated Oct. 7th, 2025 12:44 am
Powered by Dreamwidth Studios