Alex Hudson
2008-10-13 11:10:16 UTC
Bugfinder general Jur found these messages in his queue log:
Oct 12 11:31:38 y104 syslog[17710]: DEBUG queue - Handled command
from 127.0.0.1
Oct 12 11:31:38 y104 syslog[17710]: DEBUG queue - Handled command
from 127.0.0.1
Oct 12 11:31:38 y104 syslog[17710]: DEBUG queue - Handled command
from 127.0.0.1
Oct 12 11:31:38 y104 syslog[17710]: DEBUG queue - Handled command
from 127.0.0.1
Oct 12 11:31:38 y104 syslog[17710]: DEBUG queue - Handled command
from 127.0.0.1
Oct 12 11:31:38 y104 syslog[17710]: DEBUG queue - Handled command
from 127.0.0.1
From looking at the code, this seems to be a path in the command loop
where it's handling commands it doesn't recognise. Because it's spinning
around in this big loop, I'm assuming that there is a bug - probably in
SMTP - where the agent and the queue are getting "out of sync" protocol
wise, and the queue is trying to interpret commands out of a message
body or something.
So, there are two things here:
a. I wonder if we can work out somehow what's actually going on here and
causing stuff to fail badly. Jur had run out of disk space, so
potentially something is ignoring an 'out of space' error condition and
sending the mail on anyway (this seems likely to me).
b. why are we trying to tolerate command protocol errors? It seems
pretty likely to me that one error can easily be compounded by another.
Why not just quit the connection? At worse, the agent will reconnect and
make the same mistake.
Cheers,
Alex.
Oct 12 11:31:38 y104 syslog[17710]: DEBUG queue - Handled command
from 127.0.0.1
Oct 12 11:31:38 y104 syslog[17710]: DEBUG queue - Handled command
from 127.0.0.1
Oct 12 11:31:38 y104 syslog[17710]: DEBUG queue - Handled command
from 127.0.0.1
Oct 12 11:31:38 y104 syslog[17710]: DEBUG queue - Handled command
from 127.0.0.1
Oct 12 11:31:38 y104 syslog[17710]: DEBUG queue - Handled command
from 127.0.0.1
Oct 12 11:31:38 y104 syslog[17710]: DEBUG queue - Handled command
from 127.0.0.1
From looking at the code, this seems to be a path in the command loop
where it's handling commands it doesn't recognise. Because it's spinning
around in this big loop, I'm assuming that there is a bug - probably in
SMTP - where the agent and the queue are getting "out of sync" protocol
wise, and the queue is trying to interpret commands out of a message
body or something.
So, there are two things here:
a. I wonder if we can work out somehow what's actually going on here and
causing stuff to fail badly. Jur had run out of disk space, so
potentially something is ignoring an 'out of space' error condition and
sending the mail on anyway (this seems likely to me).
b. why are we trying to tolerate command protocol errors? It seems
pretty likely to me that one error can easily be compounded by another.
Why not just quit the connection? At worse, the agent will reconnect and
make the same mistake.
Cheers,
Alex.