Re: Options for synchronising filesystems

To: Isaac Levy <ike@xxxxxxxxxxx>
Subject: Re: Options for synchronising filesystems
From: Eric Anderson <anderson@xxxxxxxxxxxx>
Date: Mon, 26 Sep 2005 15:38:01 -0500
Cc: freebsd-isp@xxxxxxxxxxx, freebsd-cluster@xxxxxxxxxxx
Delivered-to: freebsd-cluster@xxxxxxxxxxx
In-reply-to: <E7A2AE04-87DC-4F3A-87DE-97CD5B51E60F@xxxxxxxxxxx>
List-archive: <http://lists.freebsd.org/pipermail/freebsd-cluster>
List-help: <mailto:freebsd-cluster-request@freebsd.org?subject=help>
List-id: Clustering FreeBSD <freebsd-cluster.freebsd.org>
List-post: <mailto:freebsd-cluster@freebsd.org>
List-subscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-cluster>, <mailto:freebsd-cluster-request@freebsd.org?subject=subscribe>
List-unsubscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-cluster>, <mailto:freebsd-cluster-request@freebsd.org?subject=unsubscribe>
References: <20050924141025.GA1236@xxxxxxxxxxxxxx> <E7A2AE04-87DC-4F3A-87DE-97CD5B51E60F@xxxxxxxxxxx>
User-agent: Mozilla/5.0 (X11; U; FreeBSD i386; en-US; rv:1.7.11) Gecko/20050914

Isaac Levy wrote:

Hi Brian, All,

This email has one theme: GEOM! :)

On Sep 24, 2005, at 10:10 AM, Brian Candler wrote:
Hello,

I was wondering if anyone would care to share their experiences in
synchronising filesystems across a number of nodes in a cluster. Ican thinkof a number of options, but before changing what I'm doing at themoment I'd
like to see if anyone has good experiences with any of the others.

The application: a clustered webserver. The users' CGIs run in a  chroot
environment, and these clearly need to be identical (otherwise a CGIrunning
on one box would behave differently when running on a different box).
Ultimately I'd like to synchronise the host OS on each server too.

Note that this is a single-master, multiple-slave type of filesystem
synchronisation I'm interested in.
I just wanted to throw out some quick thoughts on a totally differentapproach which nobody has really explored in this thread, solutionswhich are production level software. (Sorry if I'm repeating things orgiving out info yall' already know:)
--
Geom:
http://www.freebsd.org/doc/en_US.ISO8859-1/books/handbook/geom- intro.html

The core Disk IO framework for FreeBSD, as of 5.x, led by PHK:
http://www.bsdcan.org/2004/papers/geom.pdf
This framework itself is not as useful to you as the utilities whichmake use of it,
--
Geom Gate:
http://kerneltrap.org/news/freebsd?from=20

Network device-level client/server disk mapping tool.
(VERY IMPORTANT COMPONENT, it's reportedly faster, and more stable thanNFS has ever been- so people have immediately and happily deployed itin production systems!)
--
Gvinum and Gmirror:

Gmirror
http://people.freebsd.org/~rse/mirror/
http://www.ie.freebsd.org/doc/en_US.ISO8859-1/books/handbook/geom.html
(Sidenote: even Greg Lehey (original author of Vinum), has stated thatit's better to use Geom-based tools than Vinum for the forseeable future.)
--
In a nutshell, to address your needs, let me toss out the followingexample setup:
I know of one web-shop in Canada, which is running 2 machines for everyvirtual cluster, in the following configuration:
2 servers,
4 SATA drives per box,
quad copper/ethernet gigabit nic on each box
each drive is mirrored using gmirror, over each of the gigabit ethernetnics
each box is running Vinum Raid5 across the 4  mirrored drives
The drives are then sliced appropriately, and server resources aredistributed across the boxes- with various slices mounted on each box.The folks I speak of simply have a suite of failover shell scriptsprepared, in the event of a machine experiencing total hardware failure.
Pretty tough stuff, very high-performance, and CHEAP.

--
With that, I'm working towards similar setups, oriented aroundredundant jailed systems, with an eventual end to tie CARP (from pf)into the mix to make for nearly-instantaneous jailed failoverredundancy- (but it's going to be some time before I have what I wantworked out for production on my own).
Regardless, it's worth tapping into the GEOM dialogues, as there aremany new ways of working with disks coming into existence- and the GEOMframework itself provides an EXTREMELY solid base to bring 'exotic'disk configurations up to production level quickly.(Also noteworthy, there's a couple of encrypted disk systems based onGEOM emerging now too...)

I think the original poster (and I at least) knew about this already,but what I still fail to see is how you can get several machines usingthe same data at the same time, and still do updates to that data? Theonly way I know of is to use a syncing tool (like rsync) or a sharedfilesystem (like NFS, or CXFS, or Polyserve FS, opengfs, etc), none ofwhich run on FreeBSD.

What I read from above, is a redundant server setup, not ahigh-performance setup (meaning multiple machines serving the same datato many clients). If I'm missing something, please fill me in..


Eric




--
------------------------------------------------------------------------
Eric Anderson        Sr. Systems Administrator        Centaur Technology
Anything that works is better than anything that doesn't.
------------------------------------------------------------------------

Follow-Ups:
- Re: Options for synchronising filesystems
  - From: Isaac Levy

References:
- Options for synchronising filesystems
  - From: Brian Candler
- Re: Options for synchronising filesystems
  - From: Isaac Levy

Prev by Date: Re: Options for synchronising filesystems
Next by Date: Re: Options for synchronising filesystems
Previous by thread: Re: Options for synchronising filesystems
Next by thread: Re: Options for synchronising filesystems
Index(es):
- Date
- Thread