Results 1 to 8 of 8
  1. #1
    Member
    Join Date
    Jan 2008
    Posts
    13
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Default Failure Code 42 and 14 - Job Failure

    Server is on Linux and the client is on windows. The client can ping the server and communication ports are set correctly (default). The job fails at about 8% everytime.

    I enabled more detailed logging and the most notable parts included below...
    "dtcp_read: TCP - failure: recv socket (372) (TCP 10058: Can't send after socket shutdown)
    10:03:38.816 AM: [1000.1564] <16> dtcp_read: TCP - failure: recv socket (372) (TCP 10058: Can't send after socket shutdown)
    10:03:39.831 AM: [1000.1564] <16> dtcp_read: TCP - failure: recv socket (372) (TCP 10058: Can't send after socket shutdown)
    10:03:40.847 AM: [1000.1564] <16> dtcp_read: TCP - failure: recv socket (372) (TCP 10058: Can't send after socket shutdown)"

    Has anyone ran into this before? I am not able to find anything helpful on the symantec site regarding this.

  2. #2
    Moderator
    Join Date
    Sep 2002
    Location
    Indiana
    Posts
    2,560
    Thanks
    0
    Thanked 2 Times in 2 Posts

    Default

    Looks like one side has shutdown the TCP connection without notifying the other side.. You could sniff the network and look for the FIN packet to find out which side.

    Is this backup taking a long time? If so, is there a timeout value you can increase?

    -Aaron
    Three things are certain:
    Death, taxes, and lost data.
    Guess which has occurred.

  3. #3
    Member
    Join Date
    Jan 2008
    Posts
    13
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Default

    This backup fails quite quickly (usually after about 5 minutes) and ends up getting to about 5-8% completed before failing. All other servers are backing up correctly except for this one.

    Unfortunatly I work for a very large company and I am involved on the Windows Team, Server belongs to the Unix Team and then the network team would have to be the ones to run the sniffer. Talk about red tape!

    Anyway, I found this article here which is pretty informative and is the same thing as I am getting - [FONT='Calibri','sans-serif']https://forums.symantec.com/syment/board/message?board.id=21&thread.id=5862 [/FONT]

    [FONT='Calibri','sans-serif']So I think what I am going to try is uninstalling the Netbackup agent completely and then reinstalling it and updating it to MP3 to see if that fixes it.[/FONT]


  4. #4
    Senior Member
    Join Date
    Apr 2005
    Location
    Michigan
    Posts
    1,359
    Thanks
    0
    Thanked 2 Times in 2 Posts

    Default

    Verify your NDD variables at the OS level, If you are failing after 5 minutes, you've run into a contention issue of I would expect validation of files to expire.
    Run a check against your Comm and Idle timeout variables, and number of retries as well.
    Give me the exact error and I'll go to the support site and dig into the knowledge base.
    If the meantime, check your media server, and the media being requested. Perhaps you have run into a bad sector on the tape itself or dust on the tape drive head. Run some queries on the mount requests and see if there is a common fault.
    Good luck - let me know
    Steven Gabriel
    Principal -SGSolutions Inc.
    http://www.sgsolutionsinc.com

  5. #5
    Member
    Join Date
    Jan 2008
    Posts
    13
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Default

    This is the error messages -
    02/06/2008 15:48:35 - Error bpdm (pid=24384) system call failed - Connection reset by peer (at child.c.1174)
    02/06/2008 15:48:35 - Critical bpbrm (pid=24149) from client %servername%: FTL - tar file write error (10054)
    02/06/2008 15:48:35 - Error bpdm (pid=24384) unable to perform read from client socket, connection may have been broken
    02/06/2008 15:48:36 - Error bpdm (pid=24347) media manager terminated by parent process
    02/06/2008 15:48:38 - Error bpbrm (pid=24149) could not send server status message
    02/06/2008 15:48:40 - Warning bpbrm (pid=24149) Could not set linger value on socket. Errno = 22: Invalid argument
    02/06/2008 15:48:44 - end writing; write time: 0:01:25

    Quote Originally Posted by sgabriel62 View Post
    Verify your NDD variables at the OS level, If you are failing after 5 minutes, you've run into a contention issue of I would expect validation of files to expire.
    Run a check against your Comm and Idle timeout variables, and number of retries as well.
    Give me the exact error and I'll go to the support site and dig into the knowledge base.
    If the meantime, check your media server, and the media being requested. Perhaps you have run into a bad sector on the tape itself or dust on the tape drive head. Run some queries on the mount requests and see if there is a common fault.
    Good luck - let me know

  6. #6
    Member
    Join Date
    Jan 2008
    Posts
    13
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Default

    This is a good troubleshooting guide for this specific problem, although I have done everything short of changing the NIC on the server.
    http://ftp.support.veritas.com/pub/s...ver/278568.pdf

    I will be changing the NIC on the server tomorrow evening and will post the results.

  7. #7
    Member
    Join Date
    Jan 2008
    Posts
    13
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Default

    Ok, so I ended up getting this backup to work without changing the NIC.

    The server in question was a Biscom Faxcom Server...before I had thought it was a regular 2003 Windows Server with a fax software installed on it. My company was alread having problems with our disk staging running out of space with other servers which were reporting a 129 disk full error, while this particular one was reporting the error that this thread is about.

    So for kicks, I changed the job to backup straight to tape and ran it manually for the one that always failed and WTF it worked.

    So I am concluding that this servers is running into the disk full error, while reporting it different since it is a non-standard machine.

  8. #8
    Member
    Join Date
    Sep 2008
    Posts
    11
    Thanks
    0
    Thanked 0 Times in 0 Posts

    Default resolution?

    Was the resolution of this confirmed to be disk space?

Similar Threads

  1. reconcile failure
    By Lizard in forum Administrative Client
    Replies: 0
    Last Post: 11-30-2007, 10:26 PM
  2. TCP/IP Connection Failure :(
    By adeelb7 in forum Others
    Replies: 2
    Last Post: 03-19-2007, 10:05 AM
  3. Continuous Failure
    By OKTSMGURU21 in forum Backup / Archive Discussion
    Replies: 2
    Last Post: 11-08-2006, 11:24 PM
  4. reason code 168. db2 backup failure
    By waire in forum TDP/Application Layer Backup
    Replies: 4
    Last Post: 06-01-2006, 10:26 AM
  5. backup job failed because of TCP/IP connection failure
    By fwang in forum Backup / Archive Discussion
    Replies: 1
    Last Post: 01-05-2005, 04:02 PM

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •