Opened 8 years ago

Last modified 4 years ago

#396 new defect

Mondo does not timeout or exit if the tape drive fails

Reported by: tastle73 Owned by: bruno
Priority: normal Milestone:
Component: mondo Version: 2.2.9.1
Severity: normal Keywords:
Cc:

Description (last modified by bruno)

I noticed on a few occasions that when running mondorescue in verify or backup mode my tape drive will throw some kind of SCSI error. Rather than failing when it cannot read the tape, mondo runs away and takes 100% of cpu and never exits.

The command is called from a script called from cron:

mondoarchive -Vt -0 -d /dev/nst0 -o -N -e

/var/log/mondoarchive.log:

buffer (reader): failed to read input: Input/output error
                        [Main] libmondo-stream.c->read_file_from_stream_FULL#1090: where_I_was_... = 11927552
        [Main] libmondo-stream.c->start_to_read_from_next_tape#1438: Next tape requested.
        [Main] libmondo-stream.c->insist_on_this_tape_number#465: Insisting on tape #2
When the tape drive goes quiet, please insert volume 2 in this series.
Waiting while the tape drive settles
Waiting while the tape drive settles
        [Main] libmondo-stream.c->insist_on_this_tape_number#482: I assume user has inserted it. They _say_ they have...
        [Main] libmondo-stream.c->insist_on_this_tape_number#486: OK, I've finished insisting. On with the revelry.
        [Main] libmondo-stream.c->start_to_read_from_next_tape#1440: Opening IN the next tape
root is mounted at /dev/mapper/vg-lv03

No, Schlomo, that doesn't mean /dev/mapper/vg-lv03 is the root partition. It's just a debugging message. Relax. It's part of am_I_in_disaster_recovery_mode()
[Main] libmondo-devices.c->am_I_in_disaster_recovery_mode#147: Is this a ramdisk? result = 0
running: dd if=/dev/zero bs=1024 count=16k 2> /dev/null | buffer -o /dev/null -s 32768 -m 32m > /home/mondo.tmp.WCWYnD/mondo-run-prog-thing.tmp 2> /home/mond
--------------------------------start of output-----------------------------
--------------------------------end of output------------------------------
...ran just fine. :-)
Negotiated max buffer of 32 MB 
        [Main] libmondo-fifo.c->open_device_via_buffer#121: Calling buffer --- command = 'buffer -m 32m -p20 -B -s32768 -i /dev/nst0 2>> /var/log/mondoarchiv
        [Main] libmondo-fifo.c->open_device_via_buffer#125: Successfully opened ('r') tape device /dev/nst0
running: ps auxww | grep "buffer -m 32m -p20 -B -s32768 -i /dev/nst0 2>> /var/log/mondoarchive.log" > /home/mondo.tmp.WCWYnD/mondo-run-prog-thing.tmp 2> /hom
--------------------------------start of output-----------------------------
root     25852  0.0  0.0   8700   968 ?        S    01:15   0:00 sh -c buffer -m 32m -p20 -B -s32768 -i /dev/nst0 2>> /var/log/mondoarchive.log
root     25855  0.0  0.0   8700   956 ?        S    01:15   0:00 sh -c ps auxww | grep "buffer -m 32m -p20 -B -s32768 -i /dev/nst0 2>> /var/log/mondoarchive.
root     25857  0.0  0.0   6020   532 ?        D    01:15   0:00 grep buffer -m 32m -p20 -B -s32768 -i /dev/nst0 2>> /var/log/mondoarchive.log
--------------------------------end of output------------------------------
...ran just fine. :-)
running: ps auxww | grep buffer | grep -v grep > /home/mondo.tmp.WCWYnD/mondo-run-prog-thing.tmp 2> /home/mondo.tmp.WCWYnD/mondo-run-prog-thing.err
--------------------------------start of output-----------------------------
root     25852  0.0  0.0   8700   968 ?        S    01:15   0:00 sh -c buffer -m 32m -p20 -B -s32768 -i /dev/nst0 2>> /var/log/mondoarchive.log
root     25853  1.5  0.0  34284   344 ?        D    01:15   0:00 buffer -m 32m -p20 -B -s32768 -i /dev/nst0
root     25854  0.0  0.0  34284   116 ?        S    01:15   0:00 buffer -m 32m -p20 -B -s32768 -i /dev/nst0
--------------------------------end of output------------------------------
...ran just fine. :-)
Buffer successfully started.
buffer (reader): failed to read input: Input/output error

Change History (3)

comment:1 Changed 8 years ago by bruno

  • Description modified (diff)

comment:2 Changed 8 years ago by bruno

  • Milestone changed from 2.2.9.3 to 2.2.10

comment:3 Changed 4 years ago by bruno

  • Milestone 3.1.0 deleted

Milestone 3.1.0 deleted

Note: See TracTickets for help on using tickets.