Networker

Re: [Networker] tapes getting marked full before actual capacity usage, read errors on tape

2011-05-01 09:01:35
Subject: Re: [Networker] tapes getting marked full before actual capacity usage, read errors on tape
From: jee <jee AT ERESMAS DOT NET>
To: NETWORKER AT LISTSERV.TEMPLE DOT EDU
Date: Sun, 1 May 2011 13:59:39 +0100
Hi Subin,

is this a new installation or has the configurationf the drives changed 
recently ? (please double check as this may have hapened ).

The problem is not with the recover. The problem occurred during the backups.

The error "media warning: /dev/nst0 reading: Input/output error" means that 
you have some problem with the media and/or the device.

In this case I would say the problem is on the media.  It looks like a SCSI 
reset  occurred during the backups (the saveset is browsable but the tape is 
not readable). If a SCSI reset occurrs, the drive rewinds the tape but NW is 
not aware of it and continues sending data to the drive. the media db and 
indices are updated according to a successful backup. If a R/W error occurs 
when reading or writing after tht NW marks the tape as full for security 
reasons.


Try scanning the tape in dummy mode (using -n) and add some verbosity 
using -vvv. Try /dev/nst0 first, then /dev/nst1. 

Load the tape without mounting and run  
 scanner -n -i -vvv /dev/nst0   >  scanner.<VOLNAME>.nst0.txt 2>&1 
 scanner -n -i -vvv /dev/nst1   >  scanner.<VOLNAME>.nst1.txt 2>&1 

and check the output.


Also check 
grep -e nst0 -e nst1 daemon.log  messages > and check for errors related to 
the drives during the tie frame of the RMAN backups

Also check /var/log files for SCSI errors within that timeframe .


Other checks: block size

load the offending tape without mounting on /dev/nst0 and run:
# mt -f /dev/nst0 status
(same on /dev/nst1)

That should show the blocksize (check NW device configuration and scanner for 
the block size) 



jee


On Sunday 01 May 2011 10:49:06 Subin Shahul Hameed wrote:
> Hi,
>
> I get read errors and restore takes very very long or get aborted,
> There were no write errors during backup.  What could be reason?  How
> can I do tape
> drive cleaning using cleaning tape from Networker?
>
> --------
> 11627 04/23/2011 10:57:11 AM  0 0 2 2863594064 4212 0 backup_db nsrd
> DRDB1:oracle browsing
> 11627 04/23/2011 10:57:11 AM  0 0 2 2863594064 4212 0 backup_db nsrd
> DRDB1:oracle browsing
> 38742 04/23/2011 10:58:51 AM  0 0 2 2863594064 4212 0 backup_db nsrd
> db1:RMAN:_749131206_498_1_KAU11PUB (4/22/11) starting read from AAD028
> of 27 GB
> 38742 04/23/2011 10:58:51 AM  0 0 2 2863594064 4212 0 backup_db nsrd
> db1:RMAN:_749131206_499_1_KAU11PUB (4/22/11) starting read from AAD028
> of 27 GB
> 38758 04/23/2011 11:17:00 AM  2 0 0 2863594064 4212 0 backup_db nsrd
> media warning: /dev/nst0 reading: Input/output error
> 42506 04/23/2011 12:00:00 PM  2 0 0 2863594064 4212 0 backup_db nsrd
> savegroup info: skipped starting KAU11PUB1_ORCL_LEVEL0
> 38758 04/23/2011 12:24:13 PM  2 0 0 2863594064 4212 0 backup_db nsrd
> media warning: /dev/nst0 reading: Input/output error
> 38758 04/23/2011 01:32:30 PM  2 0 0 2863594064 4212 0 backup_db nsrd
> media warning: /dev/nst0 reading: Input/output error
> 38758 04/23/2011 01:47:40 PM  2 0 0 2863594064 4212 0 backup_db nsrd
> media warning: /dev/nst0 reading: Input/output error
> 38758 04/23/2011 02:07:42 PM  2 0 0 2863594064 4212 0 backup_db nsrd
> media warning: /dev/nst0 reading: Input/output error
> 38732 04/23/2011 02:19:40 PM  0 0 2 2863594064 4212 0 backup_db nsrd
> db1:RMAN:_749131206_498_1_KAU11PUB (4/22/11) done reading 27 GB
> 38732 04/23/2011 02:19:42 PM  0 0 2 2863594064 4212 0 backup_db nsrd
> db1:RMAN:_749131206_499_1_KAU11PUB (4/22/11) done reading 27 GB
> 11625 04/23/2011 02:30:35 PM  0 0 2 2863594064 4212 0 backup_db nsrd
> DRDB1:oracle done browsing
> 11627 04/23/2011 02:30:43 PM  0 0 2 2863594064 4212 0 backup_db nsrd
> DRDB1:oracle browsing
> 38742 04/23/2011 02:30:44 PM  0 0 2 2863594064 4212 0 backup_db nsrd
> db1:RMAN:_749131206_500_1_KAU11PUB (4/22/11) starting read from AAD028
> of 33 GB
> 11625 04/23/2011 02:31:08 PM  0 0 2 2863594064 4212 0 backup_db nsrd
> DRDB1:oracle done browsing
> 11627 04/23/2011 02:31:20 PM  0 0 2 2863594064 4212 0 backup_db nsrd
> DRDB1:oracle browsing
> 38742 04/23/2011 02:31:25 PM  0 0 2 2863594064 4212 0 backup_db nsrd
> db1:RMAN:_749131206_497_1_KAU11PUB (4/22/11) starting read from AAD028
> of 37 GB
> 38758 04/23/2011 03:21:30 PM  2 0 0 2863594064 4212 0 backup_db nsrd
> media warning: /dev/nst0 reading: Input/output error
>
> -------
>
> Also I see that tapes are marked full much before it reaches capacity,
> what could be the reason?.  I am using LTO4 tapes:
>
> ------
> 38758 04/24/2011 05:27:43 PM  2 0 0 2863594064 4212 0 backup_db nsrd
> media warning: /dev/nst1 writing: Input/output error, at file 192
> record 13255
> 42506 04/24/2011 05:27:43 PM  2 0 0 2863594064 4212 0 backup_db nsrd
> media notice: LTO Ultrium-4 tape AAD032 on /dev/nst1 is full
> 42506 04/24/2011 05:27:43 PM  2 0 0 2863594064 4212 0 backup_db nsrd
> media notice: LTO Ultrium-4 tape AAD032 used 105 GB of 800 GB capacity
> 42506 04/24/2011 05:27:46 PM  2 0 0 2863594064 4212 0 backup_db nsrd
> media info: WORM capable for device /dev/nst1 has been set
> 42506 04/24/2011 05:28:02 PM  2 0 0 2863594064 4212 0 backup_db nsrd
> media info: WORM capable for device /dev/nst1 has been set
> ------
>
> I am using the following:
> Backup Software:
> EMC Networker 7.5.1 Build 269
> Networker Module for Oracle 4.5
>
> Backup Hardware:
> Tape Library: FibreCAT TX48 with 2 LTO4 Drives and 12 slots (1 slot
> dedicated for cleaning tape).  (firmware upgraded to latest).
>
> Backup Server:
> OS: Redhat Enterprise Linux 5.1
>
> I need your help on this at the earliest.  Thanks in advance.
>
> Regards,
> Subin Hameed
>
> To sign off this list, send email to listserv AT listserv.temple DOT edu and 
> type
> "signoff networker" in the body of the email. Please write to
> networker-request AT listserv.temple DOT edu if you have any problems with 
> this
> list. You can access the archives at
> http://listserv.temple.edu/archives/networker.html or via RSS at
> http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER

To sign off this list, send email to listserv AT listserv.temple DOT edu and 
type "signoff networker" in the body of the email. Please write to 
networker-request AT listserv.temple DOT edu if you have any problems with this 
list. You can access the archives at 
http://listserv.temple.edu/archives/networker.html or
via RSS at http://listserv.temple.edu/cgi-bin/wa?RSS&L=NETWORKER

<Prev in Thread] Current Thread [Next in Thread>