fst[ 64]:err_code:0xa0403034, clock:0x0d7c7bf2 2007/03/03 18:33:22
ofst[ 68]:err_code:0xa0404432, clock:0x0d7c7bf9 2007/03/03 18:33:29
Is frequently shown, which according to the syscon errlog logs.pdf means the following:
a0403034 - Step number 40 on powercycle, Category 3 Fatal booting error, BE Error (IC1001) - so at this point you are thinking, mmm must be the CELL processor is not happy?. Well its related to the next error
a0404432 - Step number 40 on powercycle, Category 4 Data error, BE or RSX Error (IC1001 or IC2001)
OK, so we have a possible faulty CELL or RSX. Lets try and bring the board up and see what the current status gives us.
>$ bringup
bringup
[SSM] state: 0000 -> 0101
Bringup Mode #0 (0xFF)
[SSM] ssmCb_OnStartingBePowOn() called.
[SSM] First Boot.
[SSM] Bringup mode : syspm_stat=00000000/00000000
[POWSEQ] PowerSeq_Setup called.
[SSM] state: 0101 -> 0201
[POWSEQ] AV Backend Setup
[SSM] state: 0201 -> 0102
>$
[SSM] state: 0102 -> 0202
[SSM] state: 0202 -> 0103
[SSM] state: 0103 -> 0203
[SSM] ssmCb_BeforeBeOn() called.
[SSM] state: 0203 -> 0104
Psbd_SbTransMode_Half:0x20e7
[POWERSEQ] Error : BitTraining RSX:RRAC:RX3:GLOBAL1:RX_STATUS
[SSM] state: 0104 -> 0304
[SSM] ssmCb_AfterBeOn2() called.
[SSM] PowSeq Fail : Detected !
[SSM] state: 0304 -> 0700
[POWSEQ] AV Backend Letup
[SSM] Shutdown mode : syspm_stat=00000000/00000000
[ERROR]: 0xa0404432
[ERROR]: 0xa0403034
[POWSEQ] PowerSeq_Letup called.
[SSM] state: 0700 -> 0600
(PowerOff State) (Fatal)
Mmmm, [POWERSEQ] Error : BitTraining RSX:RRAC:RX3:GLOBAL1:RX_STATUS this is interesting?
To me this is looking like an RSX issue, on a similiar board BE was showing instead of RSX. It had the CELL replaced and rejigged and worked again.
In this case it seems the RSX is having problems communicating to the CELL.
At the moment this is not a NEC tokin issue, because the power boot states are working fine as its getting to last boot state. Early code errors will be in the 1 or 2 category. BUT NEC tokins can manifest later on when in a running state under load. Then error codes 1 and 2 will come into play.
So based on what has been presented to me, I know from experience this will need a reflow or replace the RSX.
I can do some further multimeter testing in the RSX area and determine the resistance on the NEC tokins POSITIVE and GND.
The RSX is showing a reading of 1.4ohms and the CELL shows 4.5ohms. This is a measure of the core status (RSX = 1.4ohms, CELL = 4.5ohms) as a complete circuit. Removing the RSX or CELL will produce a high resistance in the millions because its no longer completing the circuit.
If the RSX and CELL are reading below 0.9ohms then they have died. If you are getting 0.6ohms below, you have a short.
Usually on the RSX the ram chips die and short out.
To me 1.4ohms on the RSX is kinda low, and the RSX will mostly fail sooner or later or the heat from reflow will kill it.
So for me I will replace the RSX..
Hope this run down helps?
https://imgur.com/a/YBJDsqa - showing how i connect the usb ttl cable and serial points on the SEM-001 motherboard.
Green = TX, Orange = RX, Yellow = Diag and Black = GND
The usb ttl lead is a 3.3v serial lead using a PL2303 chip. Its my permanent diag lead so have soldered leads with crocodile ends so i can just clip on the new awg 30 cables from the ps3 motherboard.