Computer issues...
Nov. 7th, 2004 03:36 pmSomething is up with my computer.
Either it's the PSU, CPU or the graphics card. Whatever it is, it is NOT drivers, because I'm having the exact same issues in both Linux and Windows. Whenever my CPU usage jumps up to more than 30%, my monitor clicks and goes blank (like a mode change) for a fraction of a second then comes back on, and sometimes my computer will simply restart.
Just to say again, this is not a driver issue. Problems occur in both Windows and Linux. I've run Memtest on my memory just on the offchance it was that, but my memory is fine.
Now I've been monitoring my voltage coming from my PSU and there doesn't appear to by any major drops there. It could be the CPU, but I don't see how that would be drawing enough power to cut out the graphics card... Shit, it could be my motherboard. It seems to be intermittent, though. Because generally whatever OS I'm booting to will manage to get up and running without any issues, and occasionally I can work with 100% CPU with no trouble, and others it'll just die.
Heating issues.... Well, I've never had this problem before, and I've check that the fans are working without issues. CPU temp never seems to get over 50'C and is generally between 35'C-45'C when idle. Motherboard temp is less and doesn't rise above 40'C for sure. I don't think it's heating because I've had this machine compiling KDE for 16 hours straight (100% CPU usage) without any issues whatsoever quite recently.
I haven't added any new hardware for ages, either.
Specs:
2.53GHz Pentium 4
1024MB RAM
GeForce 4 Ti4600
Primary OS is Gentoo Linux, but I dual boot to WindowsXP too.
One 250G HDD, one 80G HDD.
450W PSU.
Any advice?
X-Posted.
Either it's the PSU, CPU or the graphics card. Whatever it is, it is NOT drivers, because I'm having the exact same issues in both Linux and Windows. Whenever my CPU usage jumps up to more than 30%, my monitor clicks and goes blank (like a mode change) for a fraction of a second then comes back on, and sometimes my computer will simply restart.
Just to say again, this is not a driver issue. Problems occur in both Windows and Linux. I've run Memtest on my memory just on the offchance it was that, but my memory is fine.
Now I've been monitoring my voltage coming from my PSU and there doesn't appear to by any major drops there. It could be the CPU, but I don't see how that would be drawing enough power to cut out the graphics card... Shit, it could be my motherboard. It seems to be intermittent, though. Because generally whatever OS I'm booting to will manage to get up and running without any issues, and occasionally I can work with 100% CPU with no trouble, and others it'll just die.
Heating issues.... Well, I've never had this problem before, and I've check that the fans are working without issues. CPU temp never seems to get over 50'C and is generally between 35'C-45'C when idle. Motherboard temp is less and doesn't rise above 40'C for sure. I don't think it's heating because I've had this machine compiling KDE for 16 hours straight (100% CPU usage) without any issues whatsoever quite recently.
I haven't added any new hardware for ages, either.
Specs:
2.53GHz Pentium 4
1024MB RAM
GeForce 4 Ti4600
Primary OS is Gentoo Linux, but I dual boot to WindowsXP too.
One 250G HDD, one 80G HDD.
450W PSU.
Any advice?
X-Posted.
no subject
Date: 2004-11-07 07:43 am (UTC)A couple of ideas would be to try pulling out, say, a CD-writer to see if it's power related, or underclock the CPU. If it's a timing issue in the BIOS then you may find it goes away if you *overclock* the CPU slightly.
Good luck - these sort of things are never fun to troubleshoot!
no subject
Date: 2004-11-07 07:45 am (UTC)Then again, I'm no PSU expert, so I'll try that just in case.
no subject
Date: 2004-11-07 07:49 am (UTC)I'm no PSU expert either, but I know a bit about what they supply and how they work.
no subject
Date: 2004-11-07 08:03 am (UTC)no subject
Date: 2004-11-07 08:08 am (UTC)Another possibility I forgot was some sort of hardware conflict. At my past school the main server was randomly rebooting (as in worked fine for a few days, then went straight to the POST from windows without passing go and without collecting £200). The culprit: apparently the unused but enabled onboard network controller. Disabling that cured it.
no subject
Date: 2004-11-07 08:10 am (UTC)no subject
Date: 2004-11-07 08:14 am (UTC)It does however sit on the BSoD *just* long enough for you to go "what the...shit!"
no subject
Date: 2004-11-07 08:16 am (UTC)I once left a machine at work rebooting as I left for the weekend. Got in on the monday, and it had been doing the bluescreen / reboot thing all weekend. Whoops.
no subject
Date: 2004-11-07 08:17 am (UTC)I've had computers do weird power related shit like that too due to motherboads or PSUs fucking out.
no subject
Date: 2004-11-07 08:22 am (UTC)*randomly friends you*
Popped caps?
Date: 2004-11-07 08:27 am (UTC)You have a newer system, so I doubt you'll find any, but a couple of years ago a ton of motherboards were made with bootleg capacitors that built up gas and popped.
Your problem definately sounds like a power-related issue. I'd agree with trying a different PSU.
no subject
Date: 2004-11-07 08:36 am (UTC)*randomly adds you back* :)
no subject
Date: 2004-11-07 08:47 am (UTC)no subject
Date: 2004-11-07 09:15 am (UTC)no subject
Date: 2004-11-07 09:17 am (UTC)no subject
Date: 2004-11-07 09:21 am (UTC)hmmmm.....
Date: 2004-11-07 12:57 pm (UTC)I've seen GPU (aka video cards) cause spurious freezes and reboots. Start there, especially with the spurious "shuts off video and comes back or resets" issue acting as the major pointer here. Try swapping it for a known good card.
There was discussion earlier about BSODs. Typically, if it's a hard ware re-set issue there won't be an entry in the system log, only one that says that the previous shutdown was unexpected. Linux will usually do an fsck after this, which will be reflected in the syslog after it comes back up. rarely, it won't come back easily from a fsck, which means either a single user mode startup or other entertaining issues. Usually, unless you pulled down the debugger for dump analysis, it's pretty hard to diagnose a BSOD issue based on the stop error alone some times (the descriptions can get quite vague, and at one point I had to resort to using the dump analysis tools to tell me that it was a peice of software behaving badly))
I don't think it's a CPU issue, because according to what you've stated, a 16 hour compile session runs ok.
I'd borrow (or get) a thrmometer, and start checking the temp of most of the logic chips on the video card and mainboard, and start from there. for the GPU, I'd also look at the support logic. Sometimes if it goes flakey, it can cause problems.
Re: hmmmm.....
Date: 2004-11-07 01:25 pm (UTC)Under windows you don't even get that - all you get is a "eventlog started" without a "eventlog stopped" just before it. (A BSoD will log a "Save dump" entry with the details of the bugcheck). And if you're running NTFS, then you don't even get the obligatory whining about lost clusters and crosslinked files. AFAIK, you only get any sense out of the BSoD if you save at least a kernel dump, or have a debugger talking to the computer (and then you probably need to run a 'checked' version or something fancy). Then again, you actually get something, which is more than you get with the hardware deciding to go back to square one.
Re: hmmmm.....
Date: 2004-11-07 02:22 pm (UTC)The only time you can make any real sense out of a BSOD is to run the debugging tools on the dump file. There are checked versions and retail versions of the symbols availible on the Microsoft web site (you *will* have to dig around for them a bit to find them, unfortuantely, one has to wonder why they don't want you looking at why it crashed? ;) The debugging tools are also there as well)
I know of very few techs that would even bother with the serial console and debugging. One would assume that it would be for serious development work (and I'm not sure about that even).
Between the BSOD and the debugger tools, I can usually figure out what program at least caused the show stopper, and take actions from there. That's how I ultimately found out that the Jetsuite software for my LaserJet 3100 does not like XP, even though HP says it ought to work. It also provided me with a clue that the A/V software on a few systems at work caused STOP errors.
As far as the reset issue goes, That's almost always caused by hardware faults. Very seldom is it software caused.
Re: hmmmm.....
Date: 2004-11-07 02:34 pm (UTC)You *will* also curse when you realise that the files run into 3-digit numbers of megabytes and you're on dialup.
I imagine you'd use a serial console when developing a device driver, especially one that loads early on in the boot process (say a SCSI driver). Otherwise you can't really step thru it or see what messages it throws up.
BTW, Win2k3 might have the shutdown message back, as it certainly asks you just why you wanted to shut the computer down in the first place.
Re: hmmmm.....
Date: 2004-11-07 02:40 pm (UTC)BTW, Win2k3 might have the shutdown message back, as it certainly asks you just why you wanted to shut the computer down in the first place.
"But I don't *want* to shut down! I'm a server, I can't shut do-" BEEP!
I find humorous irony (especially when I'm dealing with a machine that is behaving _really badly_) that the shut down option is located in the Start menu.
And I'm sure that server '03 had it's teething problems before it went gold. I know ours has been performing like a champ, but then, we don't have much running on it. Come to think about it, all our servers have been performing pretty decently, except the mail server... hmm...
no subject
Date: 2004-11-07 03:53 pm (UTC)no subject
Date: 2004-11-07 04:46 pm (UTC)I ended up doing a BIOS flash. Fixed it up wonderfully :)
no subject
Date: 2004-11-07 05:12 pm (UTC)no subject
Date: 2004-11-10 09:56 pm (UTC)I'd switch to ATI's cards, if I hadn't been using Nvidia's ever since 3DFX stopped making Voodoo cards.
Re: Popped caps?
Date: 2004-11-10 09:57 pm (UTC)Re: hmmmm.....
Date: 2004-11-10 10:00 pm (UTC)Then again, my dad's even more of a maniacal computer genius than I am, and I'm the one running modern games on a 733 celeron with pc100 ram and a PCI video card. Doom 3 anyone??? >=D
no subject
Date: 2004-11-11 05:41 am (UTC)