Try to change 'step()' to 'trace()'. This routine looks similar, but they has quite different implementation on the CPU level. 'trace()' uses capability of CPU which generate exception ( trace ) after every instruction. 'step()' use software breakpoints and for some situation ( as a thread stopping ) can 'lose' execution flow.
↧