Brent A Nelson
19 years ago
Well, thanks to several people on the list, I got Lustre 1.4.5 running on
my test setup (with Ubuntu Breezy, no less!), and it seems stable (no
problems so far that I didn't cause myself ;-)).
However, I've noticed that some things are performing rather subpar in
some limited testing with a single client. Large sequential reads and
writes seem quick (although perhaps not nearly as quick as would be
theoretically possible with this setup, it might make up for that when
multiple clients are running), but "ls -lR >/dev/null" (even with just a
single client and no other activity) and a "cp -a /usr /lustre1/test1"
(~3.5 minutes for a <350MB /usr) both perform more slowly than to an older
Linux box running NFS on fast ethernet (my Lustre servers have
channel-bonded gigabit, and dual PIII 1GHz processors rather than the NFS
server's 450MHz processors).
I tried increasing the lru_size on everything, but that didn't seem to
have any effect at all in this scenario (maybe it only matters when there
are many more clients). I also added mballoc and extents to the mount
options for the OSTs (small effect, if any). Setting the debug level to
zero helped significantly, but it's still much slower than NFS. The cp
takes maybe 50% longer than NFS and the ls takes about 300% longer. The
numbers are fairly similar whether I have 2 OSS servers serving 3 drbd
mirrors each, or the same servers just serving out a logical volume from
the system drive on each (although the more complex scenario is actually a
little faster for these tests, but still much slower than NFS). I
originally had the MDS on one of the OSS servers and tried moving it to a
third server, but the speed stayed the same.
Any ideas?
Many thanks! I know I've been rather scant on details; just let me know
and I'll provide whatever info you need.
Brent Nelson
Director of Computing
Dept. of Physics
University of Florida
my test setup (with Ubuntu Breezy, no less!), and it seems stable (no
problems so far that I didn't cause myself ;-)).
However, I've noticed that some things are performing rather subpar in
some limited testing with a single client. Large sequential reads and
writes seem quick (although perhaps not nearly as quick as would be
theoretically possible with this setup, it might make up for that when
multiple clients are running), but "ls -lR >/dev/null" (even with just a
single client and no other activity) and a "cp -a /usr /lustre1/test1"
(~3.5 minutes for a <350MB /usr) both perform more slowly than to an older
Linux box running NFS on fast ethernet (my Lustre servers have
channel-bonded gigabit, and dual PIII 1GHz processors rather than the NFS
server's 450MHz processors).
I tried increasing the lru_size on everything, but that didn't seem to
have any effect at all in this scenario (maybe it only matters when there
are many more clients). I also added mballoc and extents to the mount
options for the OSTs (small effect, if any). Setting the debug level to
zero helped significantly, but it's still much slower than NFS. The cp
takes maybe 50% longer than NFS and the ls takes about 300% longer. The
numbers are fairly similar whether I have 2 OSS servers serving 3 drbd
mirrors each, or the same servers just serving out a logical volume from
the system drive on each (although the more complex scenario is actually a
little faster for these tests, but still much slower than NFS). I
originally had the MDS on one of the OSS servers and tried moving it to a
third server, but the speed stayed the same.
Any ideas?
Many thanks! I know I've been rather scant on details; just let me know
and I'll provide whatever info you need.
Brent Nelson
Director of Computing
Dept. of Physics
University of Florida