Enabling debug mode...
Setting history weight to 0.3...
Setting reinforcement at the end of unsuccessful path to 1000... [needs -reward=lastaction]
Setting reinforcement along unsuccessful path to 1... [needs -reward=lastaction]
Setting reinforcement at the end of successful path to 10000... [needs -reward=lastaction]
Setting reinforcement along successful path to 10000... [needs -reward=lastaction]
Setting path reward to Last Action(10000,10000,0,1000)...
Setting policy update to complete count method...
Doing interval estimation instead of hypothesis testing...
Enabling last action identification for checkpoint formulae...
Starting threads... 0.078s
Learning...
 Block[#]{# satisfying traces, # falsifying traces}
 Block[0]{0, 2000} 123
 Block[1]{0, 2000} 123
 Block[2]{0, 2000} 123
 Block[3]{0, 2000} 123
 Block[4]{0, 2000} 123
 Block[5]{0, 2000} 123
 Block[6]{0, 2000} 123
 Block[7]{0, 2000} 123
 Block[8]{0, 2000} 123
 Block[9]{0, 2000} 123
 Block[10]{0, 2000} 123
 Block[11]{0, 2000} 123
 Block[12]{0, 2000} 123
 Block[13]{0, 2000} 123
 Block[14]{0, 2000} 123
 Block[15]{1, 1999} 123
 Block[16]{0, 2000} 123
 Block[17]{0, 2000} 123
 Block[18]{0, 2000} 123
 Block[19]{0, 2000} 123
 Block[20]{0, 2000} 123
 Block[21]{0, 2000} 123
 Block[22]{0, 2000} 123
 Block[23]{0, 2000} 123
 Block[24]{0, 2000} 123
 Block[25]{0, 2000} 123
 Block[26]{0, 2000} 123
 Block[27]{0, 2000} 123
 Block[28]{0, 2000} 123
 Block[29]{0, 2000} 123
 Block[30]{0, 2000} 123
 Block[31]{0, 2000} 123
 Block[32]{0, 2000} 123
 Block[33]{0, 2000} 123
 Block[34]{0, 2000} 123
 Block[35]{0, 2000} 123
 Block[36]{0, 2000} 123
 Block[37]{0, 2000} 123
 Block[38]{0, 2000} 123
 Block[39]{0, 2000} 123
 Block[40]{0, 2000} 123
 Block[41]{0, 2000} 123
 Block[42]{0, 2000} 123
 Block[43]{0, 2000} 123
 Block[44]{0, 2000} 123
 Block[45]{1, 1999} 123
 Block[46]{0, 2000} 123
