[nycbug-talk] mapreduce, hadoop, and UNIX
Isaac Levy
ike at lesmuug.org
Sat Aug 25 11:50:59 EDT 2007
Awesome,
On Aug 25, 2007, at 11:34 AM, Alex Pilosov wrote:
> On Sat, 25 Aug 2007, Isaac Levy wrote:
>> Afterthought addition,
>> On Aug 25, 2007, at 10:48 AM, Isaac Levy wrote:
>>> From kernel to userland to network, I'm dying to find similar
>>> works,
>>> any help is much appreciated!
>>
>> E.G.:
>>
>> Distributed computing implementations:
>> - Plan 9?
>> - DragonflyBSD Clustering?
> We all are hoping today to have clusters similar to what VMS had 25
> years
> ago - fully transparent non-shared memory clustering aka "single
> system
> image". You don't know, and you don't care which node on the
> cluster the
> job is running on, and jobs can be migrated to and from nodes
> depending on
> the load.
>
> For proper clustering, you need a distributed filesystem,
> distributed lock
> manager, and job distribution engine.
>
> On linux front, closest thing would be MOSIX, which is *almost* that.
> Unfortunately, MOSIX is first and foremost a research project, with
> restrictive licensing and fragmented community (see, openmosix).
> Today,
> the project to have properly working clusters is openssi.org - I
> believe
> it is based on openmosix and opengfs.
/sigh
>
> Clustering is hard, comparing to writing an OS - even Linus can do
> that
> one.
/heh
>
>> Data implementations:
>> - Sun ZFS?
>> - AFS and the like?
>> - RH GFS and the like?
> If you are talking about proper distributed filesystems, they are
> few and
> far between.
>
> gfs/opengfs
> oracle ocfs
> intermezzo/lustre
> pvfs
> veritas dfs
> sgi cxfs (distributed xfs)
AWESOME list, thanks again Alex.
>
> Distributed filesystems are hard, compared to writing an OS.
Dude, if it wasn't a hard problem, we'd all have been using them for
many years now :)
Rocket-
.ike
More information about the talk
mailing list