Retaining mid-pipeline exit status

I had a Bourne Shell (sh) script I needed to capture the exit status of, but it was being run through â€œteeâ€ to capture a log file, so â€œ$?â€ always returned the exit status of â€œteeâ€, not the script. In a nutshell, it went something like this:
#!/bin/sh DO_LOG=$1 LOGNAME="`hostname`.out" if [ "$DO_LOG" -eq "1" ]; then # Logging is turned on, so relaunch ourself with logging disabled, and tee the output to the logfile sh $0 0 | tee $LOGNAME exit $? fi #... Do lots of things in the script exit $ERRORCODE

Now, the important thing here is that the script sets very specific error codes (we have 16 defined) based on different error states, so that a tool like HP Opsware can give us different reports based on the exit status. When run with â€œ0â€³ for no logging, this works great, but it requires the controlling tool to capture logs, and not all do (especially cheap â€œforâ€ loops in a shell script.)

But when run with logging enabled, all of the fancy error code handling (45 lines of subroutinesâ€™ worth) gets lost, because â€œ$!â€ is equal to the status code of the â€œteeâ€ command. Bash scripters out there will say â€œbut what about $PIPESTATUS ?â€ If we could use bash, the code would be:
#!/bin/sh DO_LOG=$1 LOGNAME="`hostname`.out" if [ "$DO_LOG" -eq "1" ]; then # Logging is turned on, so relaunch ourself with logging disabled, and tee the output to the logfile sh $0 0 | tee $LOGNAME exit ${PIPESTATUS[0]} fi #... Do lots of things in the script exit $ERRORCODE

(Note the single line change in the conditional exit.)

But, I donâ€™t have the luxury of bash (thanks AIX and FreeBSD and Solaris 8), so we needed to get fancyâ€¦
#!/bin/sh DO_LOG=$1 LOGNAME="`hostname`.out" if [ "$DO_LOG" -eq "1" ]; then # Logging is turned on, so relaunch ourself with logging disabled, and tee the output to the logfile cp /dev/null $LOGNAME tail -f $LOGNAME & TAILPID=$! sh $0 0 >> $LOGNAME 2>&1 RETURNCODE=$? kill TAILPID exit $RETURNCODE fi #... Do lots of things in the script exit $ERRORCODE

In this last example, weâ€™re creating the empty logfile by copying /dev/null to the logname, then starting a backgrounded â€œtailâ€ command on the empty file. Because we havenâ€™t disconnected STDOUT in the backgrounding, we will still get the screen output we desire from â€œtailâ€. The script now only writes *its* output, with redirected STDOUT and STDERR, to the log file, which is already being tailed to the actual screen. At the end of the script, we capture the true exit code, clean up the tail ugliness, and exit with the desired status code.

This does have a serious downside that if the script encounters and error and exits, the â€œtailâ€ is left running indefinitely on Linux and Solaris, since the kernel there will simply scavenge the process to be owned by init. So, if you take this method, be very careful to capture all errors you may possibly encounter. Or, just use a better scripting tool. 🙂

totalnetsolutions.net