PDA

View Full Version : SLES 10 boot problem - urgent!



bkesting
22-Oct-2012, 19:06
I have a SLES 10 box that lost a drive in a RAID 5 array. I took out the drive and in the meantime I co-worker, for whatever reason, rebooted the server. The server recognizes the drive (now degraded) and brings up the SLES 10 splash screen. When I try to boot the OS, it is hanging on "Loading ata_piix".

How can I get past this or disable ata_piix from loading so I can get this system booted?

Thanks

malcolmlewis
22-Oct-2012, 19:29
I have a SLES 10 box that lost a drive in a RAID 5 array. I took out
the drive and in the meantime I co-worker, for whatever reason, rebooted
the server. The server recognizes the drive (now degraded) and brings
up the SLES 10 splash screen. When I try to boot the OS, it is hanging
on "Loading ata_piix".

How can I get past this or disable ata_piix from loading so I can get
this system booted?

Thanks



Hi
Add the grub boot option;


brokenmodules=ata_piix


--
Cheers Malcolm °¿° (Linux Counter #276890)
openSUSE 12.2 (x86_64) Kernel 3.4.11-2.16-desktop
up 15:12, 4 users, load average: 0.09, 0.08, 0.05
CPU Intel i5 CPU M520@2.40GHz | Intel Arrandale GPU

bkesting
22-Oct-2012, 19:49
I shut the system down.....reseated everything and I have now gotten past the ata_piix error. It is now sitting at "Replaying journal"..........I believe this is some type of file system check. Can I expect this to take a long time?

(I'll remember your suggestion about that grub boot option.....thanks!)

malcolmlewis
22-Oct-2012, 20:06
I shut the system down.....reseated everything and I have now gotten
past the ata_piix error. It is now sitting at "Replaying
journal"..........I believe this is some type of file system check. Can
I expect this to take a long time?

(I'll remember your suggestion about that grub boot option.....thanks!)



I imagine so, what filesystem are you running, reiserfs?

--
Cheers Malcolm °¿° (Linux Counter #276890)
openSUSE 12.2 (x86_64) Kernel 3.4.11-2.16-desktop
up 15:47, 4 users, load average: 0.01, 0.05, 0.10
CPU Intel i5 CPU M520@2.40GHz | Intel Arrandale GPU

bkesting
22-Oct-2012, 20:22
I imagine so, what filesystem are you running, reiserfs?

--
Cheers Malcolm °¿° (Linux Counter #276890)
openSUSE 12.2 (x86_64) Kernel 3.4.11-2.16-desktop
up 15:47, 4 users, load average: 0.01, 0.05, 0.10
CPU Intel i5 CPU M520@2.40GHz | Intel Arrandale GPU

Yes. It said filesystem was clean then started the check. It's been running for about 45 min to an hour now with the cursor flashing behind "Replaying journal". I just wonder if its still actually doing something or if it is hung up. The system had three 146 GB drives in a RAID 5 array, now the array is currently running degraded to two drives.

KBOYLE
22-Oct-2012, 20:32
bkesting wrote:

> The system had three 146 GB drives in a
> RAID 5 array, now the array is currently running degraded to two
> drives.

You have lost the data on one drive. The missing data is being
reconstructed from the parity information as it is needed. That is why
it is so slow.

It is not uncommon for a second drive failure to occur shortly after
the first one since all the drives in a RAID configuration are usually
subjected to the same environmental conditions. I recommend you replace
the defective drive and rebuild the array ASAP to avoid permanent data
loss.

--
Kevin Boyle - Knowledge Partner
If you find this post helpful and are using the web interface,
show your appreciation and click on the star below...

bkesting
22-Oct-2012, 20:37
bkesting wrote:

> The system had three 146 GB drives in a
> RAID 5 array, now the array is currently running degraded to two
> drives.

You have lost the data on one drive. The missing data is being
reconstructed from the parity information as it is needed. That is why
it is so slow.

It is not uncommon for a second drive failure to occur shortly after
the first one since all the drives in a RAID configuration are usually
subjected to the same environmental conditions. I recommend you replace
the defective drive and rebuild the array ASAP to avoid permanent data
loss.

--
Kevin Boyle - Knowledge Partner
If you find this post helpful and are using the web interface,
show your appreciation and click on the star below...

That is my plan......it just happens that the spare drive I had on hand for the array was bad as well. So I am waiting for a new drive(s) to get shipped here. In the meantime I need to get this system up. I will continue to let the "Replaying" to occur. So.....should I expect hours for 146 GB of data to be replaced in the journal replay?

bkesting
22-Oct-2012, 21:55
It's been "replaying" now for about 2 to 2.5 hours......should I let it continue or do you think it's hung?

malcolmlewis
22-Oct-2012, 22:15
It's been "replaying" now for about 2 to 2.5 hours......should I let it
continue or do you think it's hung?



Hi
You should be able to open a second VT (virtual terminal) by pressing
ctrl+alt+F2 (ctrl+alt+F1 to get back) and check the status via mdadm.

--
Cheers Malcolm °¿° (Linux Counter #276890)
openSUSE 12.2 (x86_64) Kernel 3.4.11-2.16-desktop
up 17:58, 3 users, load average: 0.00, 0.04, 0.09
CPU Intel i5 CPU M520@2.40GHz | Intel Arrandale GPU

bkesting
22-Oct-2012, 22:26
Ctrl+Alt+F2 does nothing......I think it's hung

bkesting
22-Oct-2012, 22:41
I have a USB keyboard and mouse hooked up to this system.....does that matter? I'm not sure either of them work anyway during the boot process

bkesting
22-Oct-2012, 22:57
I rebooted the system...........plugged in a USB keyboard, and now it hangs on ata_piix again. I tried the flag "brokenmodules=ata_piix" but it still hangs at ata_piix

bkesting
22-Oct-2012, 23:16
I meant PS2 keyboard