[TriLUG] server up for 262 days

Dana Smith Dana.Smith at altroutestudios.com
Fri Nov 30 10:36:16 EST 2001


I've got ~50 boxes that have been up for ~464 days. Anybody know if I'm looking at the same limit with Red Hat 6.0?

> Dana L. Smith
> Alternate Route Studios
> (919) 531-4116
> Dana.Smith at altroutestudios.com
> http://www.altroutestudios.com
> 


-----Original Message-----
From: scott jacobs [mailto:sjacobs at plurimus.com]
Sent: Friday, November 30, 2001 10:01 AM
To: trilug at trilug.org
Subject: Re: [TriLUG] server up for 262 days


On Thu, Nov 29, 2001 at 09:59:12PM -0500, Andrew C. Oliver wrote:
> Just my little linux stability testimonial. 

And here's mine... :)

We had two boxes make it to 497 days, 2 hours, and 17 or 26 minutes
depending on the box.  i.e. They both died within 9 minutes of each other.
(Within the +- 5 minute range of our custom heartbeat monitoring)  Do
some math and you'll see that is about 2^32/100 seconds, the point at which 
that particular kernel's jiffies counter rolls over.  The load on the boxes
spiked dramatically and they went down... down... down...  We don't know
which particular program or kernel piece couldn't handle it, but it would
require a lot of patience to debug. (Yes, we considered building a kernel 
with an uptime counter that doesn't start at 0. ;) )  Instead, we'll just 
reboot before that from now on.  We've got 7 other boxes +300 days.  I
think the boxes that died were RH 5.2 boxes running 2.0.36.

> 
> So anyhow, once it rolls around to about a year I'll reboot it.
> 
> -Andy

You can push it longer than that. :)

scott


-- 
---------------------------------------------------------------------
scott jacobs                                     plurimus corporation
---------------------------------------------------------------------
_______________________________________________
TriLUG mailing list
http://www.trilug.org/mailman/listinfo/trilug



More information about the TriLUG mailing list