We recently upgraded to cognos 8.3. Last Friday our production environment crashed - it went into almost a zombie state where the content store was still communicating but users were getting a CAM-AAA-0071 error when trying to login. A recycle of the services resolved the issue but we crashed again this past Monday. The logs are filled the below error messages
xmlns:bus="
http://developer.cognos.com/schemas/bibus/3/">
<severity>error</severity>
<errorCode>cmHeaderFault</errorCode>
<bus:message>
<messageString>CM-REQ-4159 Content Manager returned an error in the response header.</messageString>
</bus:message>
</bus:exception>
</detail>
170.6.137.4:9300 2316 2008-10-03 14:40:45.148 -5 Thread-20 caf 692 2 Audit.dispatcher.caf Request Warning received SOAP fault during capability check: details => <detail>
<bus:exception xmlns:bus="
http://developer.cognos.com/schemas/bibus/3/">
<severity>error</severity>
<errorCode>cmHeaderFault</errorCode>
<bus:message>
<messageString>CM-REQ-4159 Content Manager returned an error in the response header.</messageString>
We pulled the box out that was the content manager and put in our 02 box. We are now starting to see the same errors again on the 02 box - not as frequent but we figure it took about 4 weeks for the 01 to start getting to that zombie state. Any insight would greatly be appreciated.