fdctam32.c uses the pmulhrw in place of pmulhw+por.  This improves the
accuracy of the fdct_col_4 macro (column-transform.)
pmulhrw used instead of pmulhw+por
---------------------------------------------------
IEEE-1180 test, for [fdctam32.c]  'AP922 MMX+3DNow FDCT'
---------------------------------------------------
IEEE test conditions: -L = -256, +H = 255, sign = 1, #iters = 10000
Peak absolute values of errors:
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   2   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
Worst peak error = 2  (FAILS spec limit 1)

Mean square errors:
   0.0560   0.0512   0.0516   0.0587   0.0512   0.0526   0.0521   0.0557
   0.0530   0.0580   0.0533   0.0543   0.0507   0.0556   0.0519   0.0536
   0.0550   0.0555   0.0538   0.0481   0.0499   0.0518   0.0554   0.0532
   0.0531   0.0509   0.0529   0.0546   0.0583   0.0538   0.0527   0.0540
   0.0525   0.0542   0.0501   0.0561   0.0585   0.0556   0.0530   0.0570
   0.0494   0.0533   0.0522   0.0512   0.0501   0.0519   0.0545   0.0531
   0.0532   0.0547   0.0565   0.0576   0.0535   0.0525   0.0590   0.0495
   0.0581   0.0519   0.0538   0.0580   0.0595   0.0530   0.0526   0.0557
Worst pmse = 0.059500  (meets spec limit 0.06)
Overall mse = 0.053817  (FAILS spec limit 0.02)

Mean errors:
   0.0204   0.0006  -0.0006   0.0211   0.0172  -0.0004  -0.0013   0.0197
  -0.0026  -0.0024   0.0025   0.0019   0.0051   0.0060  -0.0039  -0.0036
  -0.0014  -0.0021  -0.0012   0.0033  -0.0039   0.0020  -0.0020  -0.0002
   0.0191   0.0001   0.0019   0.0214   0.0213   0.0006   0.0001   0.0174
   0.0183   0.0004   0.0019   0.0215   0.0191  -0.0030  -0.0016   0.0234
   0.0006   0.0007   0.0004  -0.0008   0.0011  -0.0027  -0.0001  -0.0031
   0.0002  -0.0009   0.0023   0.0046  -0.0069   0.0001  -0.0012   0.0037
   0.0245   0.0003  -0.0028   0.0244   0.0215   0.0034   0.0022   0.0205
Worst mean error = 0.024500  (FAILS spec limit 0.015)
Overall mean error = 0.005127  (FAILS spec limit 0.0015)

0 elements of IDCT(0) were not zero



---------------------------------------------------
IEEE-1180 test, for [fdctam32.c]  'AP922 MMX+3DNow FDCT'
---------------------------------------------------
IEEE test conditions: -L = -5, +H = 5, sign = 1, #iters = 10000
Peak absolute values of errors:
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
Worst peak error = 1  (meets spec limit 1)

Mean square errors:
   0.0531   0.0485   0.0527   0.0563   0.0556   0.0468   0.0513   0.0533
   0.0548   0.0530   0.0520   0.0518   0.0528   0.0478   0.0514   0.0511
   0.0492   0.0558   0.0473   0.0517   0.0507   0.0504   0.0521   0.0505
   0.0540   0.0506   0.0508   0.0538   0.0559   0.0487   0.0520   0.0573
   0.0532   0.0509   0.0488   0.0576   0.0553   0.0492   0.0513   0.0552
   0.0551   0.0562   0.0482   0.0542   0.0514   0.0502   0.0484   0.0521
   0.0535   0.0503   0.0491   0.0499   0.0506   0.0499   0.0505   0.0511
   0.0579   0.0505   0.0512   0.0539   0.0535   0.0528   0.0510   0.0551
Worst pmse = 0.057900  (meets spec limit 0.06)
Overall mse = 0.052050  (FAILS spec limit 0.02)

Mean errors:
   0.0139   0.0023  -0.0009   0.0201   0.0202  -0.0034   0.0009   0.0181
  -0.0004  -0.0016   0.0008   0.0004   0.0004  -0.0030   0.0030   0.0027
  -0.0020  -0.0016   0.0017   0.0015   0.0011   0.0040   0.0003   0.0003
   0.0162   0.0028   0.0010   0.0190   0.0231  -0.0017  -0.0004   0.0187
   0.0202   0.0033  -0.0042   0.0202   0.0181   0.0000  -0.0013   0.0168
  -0.0001   0.0006  -0.0024  -0.0022   0.0002   0.0000   0.0010   0.0023
  -0.0005   0.0051  -0.0041  -0.0029   0.0022  -0.0015  -0.0015   0.0015
   0.0239  -0.0035   0.0000   0.0179   0.0215   0.0012   0.0004   0.0219
Worst mean error = 0.023900  (FAILS spec limit 0.015)
Overall mean error = 0.004869  (FAILS spec limit 0.0015)

0 elements of IDCT(0) were not zero



---------------------------------------------------
IEEE-1180 test, for [fdctam32.c]  'AP922 MMX+3DNow FDCT'
---------------------------------------------------
IEEE test conditions: -L = -300, +H = 300, sign = 1, #iters = 10000
Peak absolute values of errors:
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
Worst peak error = 1  (meets spec limit 1)

Mean square errors:
   0.0487   0.0423   0.0458   0.0491   0.0475   0.0429   0.0482   0.0467
   0.0455   0.0469   0.0430   0.0463   0.0456   0.0488   0.0469   0.0483
   0.0383   0.0437   0.0434   0.0418   0.0464   0.0434   0.0450   0.0461
   0.0481   0.0426   0.0461   0.0482   0.0463   0.0443   0.0449   0.0508
   0.0478   0.0447   0.0396   0.0481   0.0457   0.0472   0.0472   0.0478
   0.0450   0.0449   0.0452   0.0470   0.0413   0.0454   0.0481   0.0477
   0.0449   0.0461   0.0475   0.0460   0.0473   0.0430   0.0458   0.0481
   0.0476   0.0459   0.0500   0.0485   0.0457   0.0459   0.0500   0.0481
Worst pmse = 0.050800  (meets spec limit 0.06)
Overall mse = 0.046016  (FAILS spec limit 0.02)

Mean errors:
   0.0145   0.0011   0.0032   0.0169   0.0193  -0.0003  -0.0028   0.0177
  -0.0037  -0.0003   0.0016   0.0009  -0.0004   0.0004   0.0013  -0.0019
  -0.0009  -0.0011   0.0002   0.0008  -0.0002  -0.0006  -0.0022   0.0005
   0.0157  -0.0034   0.0009   0.0174   0.0177  -0.0043   0.0005   0.0176
   0.0180  -0.0003   0.0012   0.0151   0.0181   0.0002   0.0004   0.0180
   0.0024   0.0017  -0.0004  -0.0042  -0.0005   0.0016   0.0001  -0.0027
  -0.0021   0.0027   0.0003   0.0008  -0.0027   0.0000   0.0012   0.0015
   0.0176  -0.0003   0.0000   0.0185   0.0145   0.0005   0.0024   0.0161
Worst mean error = 0.019300  (FAILS spec limit 0.015)
Overall mean error = 0.004153  (FAILS spec limit 0.0015)

0 elements of IDCT(0) were not zero



---------------------------------------------------
IEEE-1180 test, for [fdctam32.c]  'AP922 MMX+3DNow FDCT'
---------------------------------------------------
IEEE test conditions: -L = -256, +H = 255, sign = -1, #iters = 10000
Peak absolute values of errors:
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
Worst peak error = 1  (meets spec limit 1)

Mean square errors:
   0.0557   0.0523   0.0518   0.0583   0.0510   0.0528   0.0523   0.0572
   0.0530   0.0579   0.0539   0.0537   0.0503   0.0577   0.0521   0.0536
   0.0556   0.0550   0.0553   0.0469   0.0517   0.0537   0.0545   0.0536
   0.0538   0.0543   0.0554   0.0557   0.0593   0.0549   0.0509   0.0543
   0.0531   0.0570   0.0488   0.0551   0.0593   0.0567   0.0539   0.0563
   0.0503   0.0558   0.0545   0.0527   0.0505   0.0543   0.0553   0.0532
   0.0532   0.0559   0.0573   0.0552   0.0516   0.0513   0.0571   0.0504
   0.0556   0.0519   0.0544   0.0587   0.0557   0.0517   0.0537   0.0568
Worst pmse = 0.059300  (meets spec limit 0.06)
Overall mse = 0.054153  (FAILS spec limit 0.02)

Mean errors:
   0.0181   0.0017  -0.0012   0.0195   0.0180  -0.0010   0.0009   0.0218
   0.0032   0.0039  -0.0017  -0.0027  -0.0013  -0.0019   0.0035   0.0010
  -0.0006   0.0030   0.0027  -0.0009   0.0031  -0.0001   0.0003  -0.0018
   0.0214  -0.0019   0.0004   0.0223   0.0179   0.0003   0.0017   0.0217
   0.0213  -0.0020   0.0008   0.0171   0.0205   0.0007   0.0003   0.0171
  -0.0009  -0.0020  -0.0023  -0.0013  -0.0015   0.0009   0.0005   0.0014
  -0.0006  -0.0009  -0.0021  -0.0022   0.0056  -0.0007   0.0019  -0.0024
   0.0168  -0.0011   0.0022   0.0147   0.0169  -0.0023  -0.0027   0.0182
Worst mean error = 0.022300  (FAILS spec limit 0.015)
Overall mean error = 0.004738  (FAILS spec limit 0.0015)

0 elements of IDCT(0) were not zero



---------------------------------------------------
IEEE-1180 test, for [fdctam32.c]  'AP922 MMX+3DNow FDCT'
---------------------------------------------------
IEEE test conditions: -L = -5, +H = 5, sign = -1, #iters = 10000
Peak absolute values of errors:
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
Worst peak error = 1  (meets spec limit 1)

Mean square errors:
   0.0544   0.0495   0.0516   0.0561   0.0573   0.0489   0.0487   0.0536
   0.0543   0.0527   0.0522   0.0535   0.0519   0.0500   0.0515   0.0523
   0.0513   0.0553   0.0479   0.0523   0.0539   0.0520   0.0522   0.0509
   0.0564   0.0506   0.0489   0.0555   0.0557   0.0470   0.0488   0.0569
   0.0563   0.0501   0.0523   0.0572   0.0571   0.0527   0.0519   0.0591
   0.0554   0.0533   0.0507   0.0546   0.0511   0.0499   0.0490   0.0510
   0.0542   0.0497   0.0492   0.0486   0.0510   0.0532   0.0520   0.0514
   0.0588   0.0519   0.0512   0.0575   0.0515   0.0527   0.0515   0.0537
Worst pmse = 0.059100  (meets spec limit 0.06)
Overall mse = 0.052561  (FAILS spec limit 0.02)

Mean errors:
   0.0218  -0.0023   0.0004   0.0181   0.0229   0.0043  -0.0017   0.0194
   0.0005   0.0007   0.0008   0.0003  -0.0011   0.0020  -0.0019  -0.0015
   0.0023   0.0011  -0.0021  -0.0011   0.0011  -0.0056  -0.0020   0.0017
   0.0220  -0.0028  -0.0009   0.0229   0.0177   0.0030   0.0012   0.0225
   0.0191  -0.0051   0.0049   0.0188   0.0193   0.0033   0.0025   0.0247
   0.0016   0.0003   0.0009   0.0028   0.0007   0.0019  -0.0006  -0.0020
  -0.0010  -0.0057   0.0020   0.0018   0.0010  -0.0014   0.0004  -0.0006
   0.0186   0.0039  -0.0004   0.0233   0.0189  -0.0019  -0.0023   0.0171
Worst mean error = 0.024700  (FAILS spec limit 0.015)
Overall mean error = 0.005164  (FAILS spec limit 0.0015)

0 elements of IDCT(0) were not zero



---------------------------------------------------
IEEE-1180 test, for [fdctam32.c]  'AP922 MMX+3DNow FDCT'
---------------------------------------------------
IEEE test conditions: -L = -300, +H = 300, sign = -1, #iters = 10000
Peak absolute values of errors:
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
   1   1   1   1   1   1   1   1
Worst peak error = 1  (meets spec limit 1)

Mean square errors:
   0.0495   0.0442   0.0445   0.0488   0.0471   0.0424   0.0465   0.0472
   0.0459   0.0476   0.0424   0.0439   0.0457   0.0493   0.0461   0.0476
   0.0392   0.0436   0.0448   0.0425   0.0454   0.0447   0.0449   0.0443
   0.0499   0.0443   0.0459   0.0462   0.0459   0.0435   0.0436   0.0472
   0.0442   0.0465   0.0396   0.0489   0.0454   0.0455   0.0481   0.0496
   0.0447   0.0441   0.0465   0.0452   0.0410   0.0450   0.0472   0.0462
   0.0448   0.0419   0.0485   0.0470   0.0474   0.0437   0.0461   0.0486
   0.0475   0.0448   0.0472   0.0484   0.0448   0.0433   0.0486   0.0455
Worst pmse = 0.049900  (meets spec limit 0.06)
Overall mse = 0.045631  (FAILS spec limit 0.02)

Mean errors:
   0.0177  -0.0010  -0.0011   0.0166   0.0157   0.0012   0.0033   0.0178
   0.0031  -0.0010   0.0006  -0.0003  -0.0009  -0.0027  -0.0001   0.0014
   0.0004   0.0014   0.0012  -0.0003  -0.0026   0.0005   0.0019  -0.0041
   0.0177   0.0017  -0.0007   0.0128   0.0155   0.0035  -0.0010   0.0166
   0.0152   0.0005   0.0016   0.0197   0.0150  -0.0005   0.0015   0.0144
  -0.0027   0.0007   0.0001   0.0016   0.0020  -0.0018  -0.0002   0.0032
   0.0014   0.0001  -0.0023  -0.0002   0.0020  -0.0015  -0.0005  -0.0014
   0.0151   0.0010  -0.0022   0.0158   0.0194  -0.0001  -0.0022   0.0171
Worst mean error = 0.019700  (FAILS spec limit 0.015)
Overall mean error = 0.004166  (FAILS spec limit 0.0015)

0 elements of IDCT(0) were not zero


Profile: Function timing, sorted by time
Date:    Sat Jul 22 21:27:09 2000


Program Statistics
------------------
    Command line at 2000 Jul 22 21:26: "C:\work\proj\fdct1180\Debug\ieee1180" -256 255 1 10000
    Total time: 8998.574 millisecond
    Time outside of functions: 5.621 millisecond
    Call depth: 2
    Total functions: 26
    Total hits: 680010
    Function coverage: 38.5%
    Overhead Calculated 6
    Overhead Average 6

Module Statistics for ieee1180.exe
----------------------------------
    Time in module: 8992.952 millisecond
    Percent of time in module: 100.0%
    Functions in module: 26
    Hits in module: 680010
    Module function coverage: 38.5%

        Func          Func+Child           Hit
        Time   %         Time      %      Count  Function
---------------------------------------------------------
    5448.346  60.6     5448.346  60.6    20000 _ref_idct (ieeetest.obj)
    2710.997  30.1     2710.997  30.1    10000 _ref_fdct (ieeetest.obj)
     452.539   5.0      452.539   5.0   640000 _ieeerand (ieeetest.obj)
     359.764   4.0     8992.952 100.0        1 _main (ieeetest.obj)
      21.068   0.2       21.068   0.2    10000 _fdct_am32 (fdctam32.obj)
       0.189   0.0        0.189   0.0        1 _init_fdct_3dnow (fdct3dn.obj)
       0.024   0.0        0.024   0.0        1 _ref_dct_init (ieeetest.obj)
       0.011   0.0        0.011   0.0        1 _intfdct (fdctint.obj)
       0.009   0.0        0.009   0.0        1 _init_fdctdaan (fdctref.obj)
       0.005   0.0        0.005   0.0        5 _meets (ieeetest.obj)