This project has moved and is read-only. For the latest updates, please go here.
x64 Release ( 128 threads_per * 768 blocks = 98304 threads)
Cities  11;   Permutations:   39916800:
---------------------------------------
With disk cache empty and only a single class ...
           Total     Load      Run
              ms       ms       ms  distance
CpuTsp     14693 =      0 +  14693; 110.7368: 
MpuTsp      7404 =      0 +   7404; 110.7368: 
MpuTspA     5148 =      0 +   5148; 110.7368: - MpuTsp_Better
GpuTsp0     3012 =   2576 +    436; 110.7368: - cold
GpuTsp0     1832 =   1453 +    379; 110.7368: - warm

... now with separate classes
           Total     Load      Run
              ms       ms       ms  distance
GpuTsp1     1540 =   1159 +    381; 110.7368: - 1_SeparateClass
GpuTsp2     1359 =   1194 +    165; 110.7368: - 2_StructArray

GpuTsp3     2021 =   1826 +    195; 110.7368: - 3_Architecture_x64_2_1
GpuTsp3a    1994 =   1798 +    196; 110.7368: - 3_PathArrayStrided
GpuTsp3b    2028 =   1827 +    201; 110.7368: - 3_DivisorsCachedGlobal

GpuTsp4     2451 =   1796 +    655; 110.7368: - 4_Long
GpuTsp4a    2367 =   1792 +    575; 110.7368: - 4_PathArrayStrided
GpuTsp4b    2276 =   1758 +    518; 110.7368: - 4_DivisorsCachedGlobal

GpuTsp3c    1886 =   1743 +    143; 110.7368: - 3_MultiplyInstead
GpuTsp4c    1936 =   1742 +    194; 110.7368: - 4_MultiplyInstead

... and now with disk cache populated.
           Total     Load      Run
              ms       ms       ms  distance
GpuTsp1      430 =     88 +    342; 110.7368: - 1_SeparateClass
GpuTsp2      231 =     90 +    141; 110.7368: - 2_StructArray

GpuTsp3      264 =     93 +    171; 110.7368: - 3_Architecture_x64_2_1
GpuTsp3a     263 =     93 +    170; 110.7368: - 3_PathArrayStrided
GpuTsp3b     269 =     98 +    171; 110.7368: - 3_DivisorsCachedGlobal

GpuTsp4      641 =     97 +    544; 110.7368: - 4_Long
GpuTsp4a     640 =    100 +    540; 110.7368: - 4_PathArrayStrided
GpuTsp4b     615 =    106 +    509; 110.7368: - 4_DivisorsCachedGlobal

GpuTsp3c     240 =    105 +    135; 110.7368: - 3_MultiplyInstead
GpuTsp4c     290 =    103 +    187; 110.7368: - 4_MultiplyInstead
Next: Complete Results for 12 Cities

Last edited Nov 5, 2012 at 12:28 AM by pgeerkens, version 2