Difference between revisions of "SPEC CPU2006 benchmarks"
m (→Create the SPEC CPU2006 processes for M5 SE mode) |
|||
(54 intermediate revisions by 7 users not shown) | |||
Line 1: | Line 1: | ||
− | '''This is a work in-progress. Everyone should feel free to extend this page with their experiences to help new users get started.''' | + | '''This is a work in-progress. Everyone should feel free to extend this page with their experiences to help new users get started.''' |
===== Input sets and Binaries ===== | ===== Input sets and Binaries ===== | ||
− | We can't provide the binaries or input files because of licensing restrictions, but It's not hard to build the binaries by yourself. In this short article, we will share our experiences about what we have done so far. | + | We can't provide the binaries or input files because of licensing restrictions, but It's not hard to build the binaries by yourself. In this short article, we will share our experiences about what we have done so far. |
===== Build the cross-compiler for alpha machine ===== | ===== Build the cross-compiler for alpha machine ===== | ||
− | Download the crosstool-0.43.tar.gz from [http://kegel.com/crosstool/ http://kegel.com/crosstool] and modify these three lines in the demo-alpha.sh : | + | It is suggested that you use crosstool-ng available [http://ymorin.is-a-geek.org/dokuwiki/projects/crosstool#download here], a new tool based on Dan Kegel's cross tool that is more up to date. Follow the instructions available on that page for building the cross-compiler. |
+ | |||
+ | If you do not wish to build this tool yourself, you may be able to use one of the pre-compiled cross-compilers available on the [[Download|M5 Download Page]]. | ||
+ | |||
+ | Or, you can use the older crosstool. Download the crosstool-0.43.tar.gz from [http://kegel.com/crosstool/ http://kegel.com/crosstool] and modify these three lines in the demo-alpha.sh : | ||
<pre> | <pre> | ||
Line 16: | Line 20: | ||
Then follow the steps in the [http://www.kegel.com/crosstool/current/doc/crosstool-howto.html crosstool-howto] page to build the cross compiler. | Then follow the steps in the [http://www.kegel.com/crosstool/current/doc/crosstool-howto.html crosstool-howto] page to build the cross compiler. | ||
− | ===== Build the | + | ===== Build the SPEC CPU2006 alpha binaries ===== |
− | Install the | + | Install the SPEC CPU2006 from DVD and modify the CC, CXX, and FC in config/alpha.cfg. If alpha.cfg is not in the config/ directory, you can use linux32-i386-gcc42.cfg and modify the CC, CXX, and FC variables. |
<pre> | <pre> | ||
For example: | For example: | ||
− | CC = /home/mjwu/crosstool/gcc-4.1.0-glibc-2.3.6/alpha/bin/alpha-gcc | + | CC = /home/mjwu/crosstool/gcc-4.1.0-glibc-2.3.6/alpha/bin/alpha-gcc |
− | CXX = /home/mjwu/crosstool/gcc-4.1.0-glibc-2.3.6/alpha/bin/alpha-g++ | + | CXX = /home/mjwu/crosstool/gcc-4.1.0-glibc-2.3.6/alpha/bin/alpha-g++ |
FC = /home/mjwu/crosstool/gcc-4.1.0-glibc-2.3.6/alpha/bin/alpha-gfortran | FC = /home/mjwu/crosstool/gcc-4.1.0-glibc-2.3.6/alpha/bin/alpha-gfortran | ||
</pre> | </pre> | ||
Line 30: | Line 34: | ||
<pre> | <pre> | ||
For example: | For example: | ||
− | runspec --config=alpha.cfg --action= | + | runspec --config=alpha.cfg --action=build --tune=base bzip2 |
+ | </pre> | ||
+ | |||
+ | ===== Create the SPEC CPU2006 processes for M5 SE mode===== | ||
+ | |||
+ | A good reference for the correct command line options can be found here: | ||
+ | [http://kejo.be/ELIS/spec_cpu2006/spec_cpu2006_command_lines.html SPEC_CPU2006_Commands]. | ||
+ | |||
+ | For your convenience, here is our benchmark python file for the M5 SE mode. | ||
+ | |||
+ | <pre> | ||
+ | #Mybench.py | ||
+ | |||
+ | #400.perlbench | ||
+ | perlbench = LiveProcess() | ||
+ | perlbench.executable = binary_dir+'400.perlbench_base.alpha-gcc' | ||
+ | perlbench.cmd = [perlbench.executable] + ['-I./lib', 'attrs.pl'] | ||
+ | perlbench.output = 'attrs.out' | ||
+ | |||
+ | #401.bzip2 | ||
+ | bzip2 = LiveProcess() | ||
+ | bzip2.executable = binary_dir+'401.bzip2_base.alpha-gcc' | ||
+ | data=data_dir+'401.bzip2/data/all/input/input.program' | ||
+ | bzip2.cmd = [bzip2.executable] + [data, '1'] | ||
+ | bzip2.output = 'input.program.out' | ||
+ | |||
+ | #403.gcc | ||
+ | gcc = LiveProcess() | ||
+ | gcc.executable = binary_dir+'403.gcc_base.alpha-gcc' | ||
+ | data=data_dir+'403.gcc/data/test/input/cccp.i' | ||
+ | output='/import/home1/mjwu/work_spec2006/403.gcc/m5/cccp.s' | ||
+ | gcc.cmd = [gcc.executable] + [data]+['-o',output] | ||
+ | gcc.output = 'ccc.out' | ||
+ | |||
+ | #410.bwaves | ||
+ | bwaves = LiveProcess() | ||
+ | bwaves.executable = binary_dir+'410.bwaves_base.alpha-gcc' | ||
+ | bwaves.cmd = [bwaves.executable] | ||
+ | |||
+ | #416.gamess | ||
+ | gamess=LiveProcess() | ||
+ | gamess.executable = binary_dir+'416.gamess_base.alpha-gcc' | ||
+ | gamess.cmd = [gamess.executable] | ||
+ | gamess.input='exam29.config' | ||
+ | gamess.output='exam29.output' | ||
+ | |||
+ | #429.mcf | ||
+ | mcf = LiveProcess() | ||
+ | mcf.executable = binary_dir+'429.mcf_base.alpha-gcc' | ||
+ | data=data_dir+'429.mcf/data/test/input/inp.in' | ||
+ | mcf.cmd = [mcf.executable] + [data] | ||
+ | mcf.output = 'inp.out' | ||
+ | |||
+ | #433.milc | ||
+ | milc=LiveProcess() | ||
+ | milc.executable = binary_dir+'433.milc_base.alpha-gcc' | ||
+ | stdin=data_dir+'433.milc/data/test/input/su3imp.in' | ||
+ | milc.cmd = [milc.executable] | ||
+ | milc.input=stdin | ||
+ | milc.output='su3imp.out' | ||
+ | |||
+ | #434.zeusmp | ||
+ | zeusmp=LiveProcess() | ||
+ | zeusmp.executable = binary_dir+'434.zeusmp_base.alpha-gcc' | ||
+ | zeusmp.cmd = [zeusmp.executable] | ||
+ | zeusmp.output = 'zeusmp.stdout' | ||
+ | |||
+ | #435.gromacs | ||
+ | gromacs = LiveProcess() | ||
+ | gromacs.executable = binary_dir+'435.gromacs_base.alpha-gcc' | ||
+ | data=data_dir+'435.gromacs/data/test/input/gromacs.tpr' | ||
+ | gromacs.cmd = [gromacs.executable] + ['-silent','-deffnm',data,'-nice','0'] | ||
+ | |||
+ | #436.cactusADM | ||
+ | cactusADM = LiveProcess() | ||
+ | cactusADM.executable = binary_dir+'436.cactusADM_base.alpha-gcc' | ||
+ | data=data_dir+'436.cactusADM/data/test/input/benchADM.par' | ||
+ | cactusADM.cmd = [cactusADM.executable] + [data] | ||
+ | cactusADM.output = 'benchADM.out' | ||
+ | |||
+ | #437.leslie3d | ||
+ | leslie3d=LiveProcess() | ||
+ | leslie3d.executable = binary_dir+'437.leslie3d_base.alpha-gcc' | ||
+ | stdin=data_dir+'437.leslie3d/data/test/input/leslie3d.in' | ||
+ | leslie3d.cmd = [leslie3d.executable] | ||
+ | leslie3d.input=stdin | ||
+ | leslie3d.output='leslie3d.stdout' | ||
+ | |||
+ | #444.namd | ||
+ | namd = LiveProcess() | ||
+ | namd.executable = binary_dir+'444.namd_base.alpha-gcc' | ||
+ | input=data_dir+'444.namd/data/all/input/namd.input' | ||
+ | namd.cmd = [namd.executable] + ['--input',input,'--iterations','1','--output','namd.out'] | ||
+ | namd.output='namd.stdout' | ||
+ | |||
+ | #445.gobmk | ||
+ | gobmk=LiveProcess() | ||
+ | gobmk.executable = binary_dir+'445.gobmk_base.alpha-gcc' | ||
+ | stdin=data_dir+'445.gobmk/data/test/input/capture.tst' | ||
+ | gobmk.cmd = [gobmk.executable]+['--quiet','--mode','gtp'] | ||
+ | gobmk.input=stdin | ||
+ | gobmk.output='capture.out' | ||
+ | |||
+ | #447.dealII | ||
+ | dealII=LiveProcess() | ||
+ | dealII.executable = binary_dir+'447.dealII_base.alpha-gcc' | ||
+ | dealII.cmd = [gobmk.executable]+['8'] | ||
+ | dealII.output='log' | ||
+ | |||
+ | |||
+ | #450.soplex | ||
+ | soplex=LiveProcess() | ||
+ | soplex.executable = binary_dir+'450.soplex_base.alpha-gcc' | ||
+ | data=data_dir+'450.soplex/data/test/input/test.mps' | ||
+ | soplex.cmd = [soplex.executable]+['-m10000',data] | ||
+ | soplex.output = 'test.out' | ||
+ | |||
+ | #453.povray | ||
+ | povray=LiveProcess() | ||
+ | povray.executable = binary_dir+'453.povray_base.alpha-gcc' | ||
+ | data=data_dir+'453.povray/data/test/input/SPEC-benchmark-test.ini' | ||
+ | #povray.cmd = [povray.executable]+['SPEC-benchmark-test.ini'] | ||
+ | povray.cmd = [povray.executable]+[data] | ||
+ | povray.output = 'SPEC-benchmark-test.stdout' | ||
+ | |||
+ | #454.calculix | ||
+ | calculix=LiveProcess() | ||
+ | calculix.executable = binary_dir+'454.calculix_base.alpha-gcc' | ||
+ | data='/import/RaidHome/mjwu/work_spec2006/454.calculix/m5/beampic' | ||
+ | calculix.cmd = [calculix.executable]+['-i',data] | ||
+ | calculix.output = 'beampic.log' | ||
+ | |||
+ | #456.hmmer | ||
+ | hmmer=LiveProcess() | ||
+ | hmmer.executable = binary_dir+'456.hmmer_base.alpha-gcc' | ||
+ | data=data_dir+'456.hmmer/data/test/input/bombesin.hmm' | ||
+ | hmmer.cmd = [hmmer.executable]+['--fixed', '0', '--mean', '325', '--num', '5000', '--sd', '200', '--seed', '0', data] | ||
+ | hmmer.output = 'bombesin.out' | ||
+ | |||
+ | #458.sjeng | ||
+ | sjeng=LiveProcess() | ||
+ | sjeng.executable = binary_dir+'458.sjeng_base.alpha-gcc' | ||
+ | data=data_dir+'458.sjeng/data/test/input/test.txt' | ||
+ | sjeng.cmd = [sjeng.executable]+[data] | ||
+ | sjeng.output = 'test.out' | ||
+ | |||
+ | #459.GemsFDTD | ||
+ | GemsFDTD=LiveProcess() | ||
+ | GemsFDTD.executable = binary_dir+'459.GemsFDTD_base.alpha-gcc' | ||
+ | GemsFDTD.cmd = [GemsFDTD.executable] | ||
+ | GemsFDTD.output = 'test.log' | ||
+ | |||
+ | #462.libquantum | ||
+ | libquantum=LiveProcess() | ||
+ | libquantum.executable = binary_dir+'462.libquantum_base.alpha-gcc' | ||
+ | libquantum.cmd = [libquantum.executable],'33','5' | ||
+ | libquantum.output = 'test.out' | ||
+ | |||
+ | #464.h264ref | ||
+ | h264ref=LiveProcess() | ||
+ | h264ref.executable = binary_dir+'464.h264ref_base.alpha-gcc' | ||
+ | data=data_dir+'464.h264ref/data/test/input/foreman_test_encoder_baseline.cfg' | ||
+ | h264ref.cmd = [h264ref.executable]+['-d',data] | ||
+ | h264ref.output = 'foreman_test_encoder_baseline.out' | ||
+ | |||
+ | #470.lbm | ||
+ | lbm=LiveProcess() | ||
+ | lbm.executable = binary_dir+'470.lbm_base.alpha-gcc' | ||
+ | data=data_dir+'470.lbm/data/test/input/100_100_130_cf_a.of' | ||
+ | lbm.cmd = [lbm.executable]+['20', 'reference.dat', '0', '1' ,data] | ||
+ | lbm.output = 'lbm.out' | ||
+ | |||
+ | #471.omnetpp | ||
+ | omnetpp=LiveProcess() | ||
+ | omnetpp.executable = binary_dir+'471.omnetpp_base.alpha-gcc' | ||
+ | data=data_dir+'471.omnetpp/data/test/input/omnetpp.ini' | ||
+ | omnetpp.cmd = [omnetpp.executable]+[data] | ||
+ | omnetpp.output = 'omnetpp.log' | ||
+ | |||
+ | #473.astar | ||
+ | astar=LiveProcess() | ||
+ | astar.executable = binary_dir+'473.astar_base.alpha-gcc' | ||
+ | astar.cmd = [astar.executable]+['lake.cfg'] | ||
+ | astar.output = 'lake.out' | ||
+ | |||
+ | #481.wrf | ||
+ | wrf=LiveProcess() | ||
+ | wrf.executable = binary_dir+'481.wrf_base.alpha-gcc' | ||
+ | wrf.cmd = [wrf.executable]+['namelist.input'] | ||
+ | wrf.output = 'rsl.out.0000' | ||
+ | |||
+ | #482.sphinx | ||
+ | sphinx3=LiveProcess() | ||
+ | sphinx3.executable = binary_dir+'482.sphinx_livepretend_base.alpha-gcc' | ||
+ | sphinx3.cmd = [sphinx3.executable]+['ctlfile', '.', 'args.an4'] | ||
+ | sphinx3.output = 'an4.out' | ||
+ | |||
+ | #483.xalancbmk | ||
+ | xalancbmk=LiveProcess() | ||
+ | xalancbmk.executable = binary_dir+'483.Xalan_base.alpha-gcc' | ||
+ | xalancbmk.cmd = [xalancbmk.executable]+['-v','test.xml','xalanc.xsl'] | ||
+ | xalancbmk.output = 'test.out' | ||
+ | |||
+ | #998.specrand | ||
+ | specrand_i=LiveProcess() | ||
+ | specrand_i.executable = binary_dir+'998.specrand_base.alpha-gcc' | ||
+ | specrand_i.cmd = [specrand_i.executable] + ['324342','24239'] | ||
+ | specrand_i.output = 'rand.24239.out' | ||
+ | |||
+ | #999.specrand | ||
+ | specrand_f=LiveProcess() | ||
+ | specrand_f.executable = binary_dir+'999.specrand_base.alpha-gcc' | ||
+ | specrand_f.cmd = [specrand_i.executable] + ['324342','24239'] | ||
+ | specrand_f.output = 'rand.24239.out' | ||
+ | |||
+ | </pre> | ||
+ | |||
+ | ===== M5 python configure script ===== | ||
+ | Here is our system configuration python file for the M5 SE mode. | ||
+ | |||
+ | <pre> | ||
+ | #cmp.py | ||
+ | # Simple configuration script | ||
+ | |||
+ | import m5 | ||
+ | from m5.objects import * | ||
+ | import os, optparse, sys | ||
+ | m5.AddToPath('./configs') | ||
+ | import Simulation | ||
+ | from Caches import * | ||
+ | import Mybench | ||
+ | |||
+ | # Get paths we might need. It's expected this file is in m5/configs/example. | ||
+ | config_path = os.path.dirname(os.path.abspath(__file__)) | ||
+ | print config_path | ||
+ | config_root = os.path.dirname(config_path)+"/configs" | ||
+ | print config_root | ||
+ | m5_root = os.path.dirname(config_root) | ||
+ | print m5_root | ||
+ | |||
+ | parser = optparse.OptionParser() | ||
+ | |||
+ | # Benchmark options | ||
+ | |||
+ | parser.add_option("-b", "--benchmark", default="", | ||
+ | help="The benchmark to be loaded.") | ||
+ | |||
+ | parser.add_option("-c", "--chkpt", default="", | ||
+ | help="The checkpoint to load.") | ||
+ | |||
+ | execfile(os.path.join(config_root, "configs", "Options.py")) | ||
+ | |||
+ | (options, args) = parser.parse_args() | ||
+ | |||
+ | if args: | ||
+ | print "Error: script doesn't take any positional arguments" | ||
+ | sys.exit(1) | ||
+ | |||
+ | if options.benchmark == 'perlbench': | ||
+ | process = Mybench.perlbench | ||
+ | elif options.benchmark == 'bzip2': | ||
+ | process = Mybench.bzip2 | ||
+ | elif options.benchmark == 'gcc': | ||
+ | process = Mybench.gcc | ||
+ | elif options.benchmark == 'bwaves': | ||
+ | process = Mybench.bwaves | ||
+ | elif options.benchmark == 'gamess': | ||
+ | process = Mybench.gamess | ||
+ | elif options.benchmark == 'mcf': | ||
+ | process = Mybench.mcf | ||
+ | elif options.benchmark == 'milc': | ||
+ | process = Mybench.milc | ||
+ | elif options.benchmark == 'zeusmp': | ||
+ | process = Mybench.zeusmp | ||
+ | elif options.benchmark == 'gromacs': | ||
+ | process = Mybench.gromacs | ||
+ | elif options.benchmark == 'cactusADM': | ||
+ | process = Mybench.cactusADM | ||
+ | elif options.benchmark == 'leslie3d': | ||
+ | process = Mybench.leslie3d | ||
+ | elif options.benchmark == 'namd': | ||
+ | process = Mybench.namd | ||
+ | elif options.benchmark == 'gobmk': | ||
+ | process = Mybench.gobmk; | ||
+ | elif options.benchmark == 'dealII': | ||
+ | process = Mybench.dealII | ||
+ | elif options.benchmark == 'soplex': | ||
+ | process = Mybench.soplex | ||
+ | elif options.benchmark == 'povray': | ||
+ | process = Mybench.povray | ||
+ | elif options.benchmark == 'calculix': | ||
+ | process = Mybench.calculix | ||
+ | elif options.benchmark == 'hmmer': | ||
+ | process = Mybench.hmmer | ||
+ | elif options.benchmark == 'sjeng': | ||
+ | process = Mybench.sjeng | ||
+ | elif options.benchmark == 'GemsFDTD': | ||
+ | process = Mybench.GemsFDTD | ||
+ | elif options.benchmark == 'libquantum': | ||
+ | process = Mybench.libquantum | ||
+ | elif options.benchmark == 'h264ref': | ||
+ | process = Mybench.h264ref | ||
+ | elif options.benchmark == 'tonto': | ||
+ | process = Mybench.tonto | ||
+ | elif options.benchmark == 'lbm': | ||
+ | process = Mybench.lbm | ||
+ | elif options.benchmark == 'omnetpp': | ||
+ | process = Mybench.omnetpp | ||
+ | elif options.benchmark == 'astar': | ||
+ | process = Mybench.astar | ||
+ | elif options.benchmark == 'wrf': | ||
+ | process = Mybench.wrf | ||
+ | elif options.benchmark == 'sphinx3': | ||
+ | process = Mybench.sphinx3 | ||
+ | elif options.benchmark == 'xalancbmk': | ||
+ | process = Mybench.xalancbmk | ||
+ | elif options.benchmark == 'specrand_i': | ||
+ | process = Mybench.specrand_i | ||
+ | elif options.benchmark == 'specrand_f': | ||
+ | process = Mybench.specrand_f | ||
+ | |||
+ | if options.chkpt != "": | ||
+ | process.chkpt = options.chkpt | ||
+ | |||
+ | (CPUClass, test_mem_mode, FutureClass) = Simulation.setCPUClass(options) | ||
+ | |||
+ | CPUClass.clock = '1.0GHz' | ||
+ | |||
+ | #np = options.num_cpus | ||
+ | np = 1 | ||
+ | |||
+ | system = System(cpu = [CPUClass(cpu_id=i) for i in xrange(np)], | ||
+ | physmem = PhysicalMemory(range=AddrRange("4096MB")), | ||
+ | membus = Bus(), mem_mode = 'timing') | ||
+ | |||
+ | system.physmem.port = system.membus.port | ||
+ | |||
+ | for i in xrange(np): | ||
+ | if options.caches: | ||
+ | system.cpu[i].addPrivateSplitL1Caches(L1Cache(size = '64kB'), | ||
+ | L1Cache(size = '64kB')) | ||
+ | if options.l2cache: | ||
+ | system.l2 = L2Cache(size='2MB') | ||
+ | system.tol2bus = Bus() | ||
+ | system.l2.cpu_side = system.tol2bus.port | ||
+ | system.l2.mem_side = system.membus.port | ||
+ | system.cpu[i].connectMemPorts(system.tol2bus) | ||
+ | else: | ||
+ | system.cpu[i].connectMemPorts(system.membus) | ||
+ | system.cpu[i].workload = process[i] | ||
+ | |||
+ | |||
+ | root = Root(system = system) | ||
+ | |||
+ | Simulation.run(options, root, system, FutureClass) | ||
+ | </pre> | ||
+ | |||
+ | ===== The SPEC CPU2006 testing dataset results ===== | ||
+ | We use the quard-core Xeon 2.5GHz with 16G memory machine. The operation system is 64bits CentOS 5.2. The timing results are from the simple cpu model and SPEC CPU2006 testing data set. | ||
+ | |||
+ | {| cellspacing="0" border="1" | ||
+ | |||
+ | |benchmark | ||
+ | |datatype | ||
+ | |language | ||
+ | |input data | ||
+ | |number of instructions | ||
+ | |host seconds | ||
+ | |comment | ||
+ | |- | ||
+ | |400.perlbench | ||
+ | |integer | ||
+ | |C | ||
+ | |attrs.out | ||
+ | |align="right"|- | ||
+ | |align="right"|- | ||
+ | |fatal: fault (unalign) detected @ PC 0x12009cedc | ||
+ | |- | ||
+ | |401.bzip2 | ||
+ | |integer | ||
+ | |C | ||
+ | |input.program | ||
+ | |align="right"|3171671617 | ||
+ | |align="right"|1353.56 | ||
+ | |o.k. | ||
+ | |- | ||
+ | |403.gcc | ||
+ | |integer | ||
+ | |C | ||
+ | |cccp.i | ||
+ | |align="right"| - | ||
+ | |align="right"|- | ||
+ | |never end, but o.k. for smaller input | ||
+ | |- | ||
+ | |410.bwaves | ||
+ | |floating | ||
+ | |Fortran | ||
+ | |test | ||
+ | |align="right"|119365801487 | ||
+ | |align="right"|51703.94 | ||
+ | |o.k. | ||
+ | |- | ||
+ | |416.gamess | ||
+ | |floating | ||
+ | |Fortran | ||
+ | |exam29 | ||
+ | |align="right"| - | ||
+ | |align="right"| - | ||
+ | |abormal exit | ||
+ | |- | ||
+ | |429.mcf | ||
+ | |integer | ||
+ | |C | ||
+ | |inp.in | ||
+ | |align="right"|5112705810 | ||
+ | |align="right"|3386.09 | ||
+ | |o.k. | ||
+ | |- | ||
+ | |433.milc | ||
+ | |floating | ||
+ | |C | ||
+ | |su3imp.in | ||
+ | |align="right"|38027871822 | ||
+ | |align="right"|18402.06 | ||
+ | |output is different with the reference | ||
+ | |- | ||
+ | |434.zeusmp | ||
+ | |floating | ||
+ | |Fortran | ||
+ | |zmp_inp | ||
+ | |align="right"|62107158516 | ||
+ | |align="right"|27746.77 | ||
+ | |output is different with the reference | ||
+ | |- | ||
+ | |435.gromacs | ||
+ | |floating | ||
+ | |C/Fortran | ||
+ | |gromacs.tpr | ||
+ | |align="right"|10861507208 | ||
+ | |align="right"|4457.33 | ||
+ | |output is wrong, mremap has problem | ||
+ | |- | ||
+ | |436.cactusADM | ||
+ | |floating | ||
+ | |C/Fortran | ||
+ | |benchADM.par | ||
+ | |align="right"| - | ||
+ | |align="right"| - | ||
+ | |fatal: fault (unalign) detected @ PC 0x120026614 | ||
+ | |- | ||
+ | |437.leslie3d | ||
+ | |floating | ||
+ | |Fortran | ||
+ | |leslie3d.in | ||
+ | |align="right"|87402135744 | ||
+ | |align="right"|41635.96 | ||
+ | |o.k. | ||
+ | |- | ||
+ | |444.namd | ||
+ | |floating point | ||
+ | |C++ | ||
+ | |namd.input | ||
+ | |align="right"|64449976020 | ||
+ | |align="right"|26798.88 | ||
+ | |o.k | ||
+ | |- | ||
+ | |445.gobmk | ||
+ | |integer | ||
+ | |C | ||
+ | |capture.tst | ||
+ | |align="right"|494502991 | ||
+ | |align="right"|260.29 | ||
+ | |o.k. | ||
+ | |- | ||
+ | |447.dealII | ||
+ | |floating | ||
+ | |C++ | ||
+ | |8 | ||
+ | |align="right"| - | ||
+ | |align="right"| - | ||
+ | |output is wrong | ||
+ | |- | ||
+ | |450.soplex | ||
+ | |floating | ||
+ | |C++ | ||
+ | |test.mps | ||
+ | |align="right"|72422927 | ||
+ | |align="right"|31.95 | ||
+ | |o.k. | ||
+ | |- | ||
+ | |453.povray | ||
+ | |floating point | ||
+ | |C++ | ||
+ | |test.ini | ||
+ | |align="right"|3597778011 | ||
+ | |align="right"|1737.24 | ||
+ | |o.k. | ||
+ | |- | ||
+ | |454.calculix | ||
+ | |floating point | ||
+ | |C | ||
+ | |beampic.inp | ||
+ | |align="right"|251699786 | ||
+ | |align="right"|101.04 | ||
+ | |o.k. | ||
+ | |- | ||
+ | |456.hmmer | ||
+ | |integer | ||
+ | |C | ||
+ | |bombesin.hmm | ||
+ | |align="right"|2386768547 | ||
+ | |align="right"|997.97 | ||
+ | |o.k. | ||
+ | |- | ||
+ | |458.sjeng | ||
+ | |integer | ||
+ | |C | ||
+ | |test.txt | ||
+ | |align="right"|21682684235 | ||
+ | |align="right"|9406.80 | ||
+ | |o.k. | ||
+ | |- | ||
+ | |459.GemsFDTD | ||
+ | |floating | ||
+ | |Fortran | ||
+ | |test.in | ||
+ | |align="right"|11046857318 | ||
+ | |align="right"|5289.88 | ||
+ | |o.k. | ||
+ | |- | ||
+ | |462.libquantum | ||
+ | |integer | ||
+ | |C | ||
+ | |33 5 | ||
+ | |align="right"|292639209 | ||
+ | |align="right"|111.64 | ||
+ | |o.k. | ||
+ | |- | ||
+ | |464.h264ref | ||
+ | |integer | ||
+ | |C | ||
+ | |foreman_test_encoder_baseline.cfg | ||
+ | |align="right"|154340641371 | ||
+ | |align="right"|67426.00 | ||
+ | |o.k. | ||
+ | |- | ||
+ | |465.tonto | ||
+ | |floating | ||
+ | |Fortran | ||
+ | | - | ||
+ | |align="right"| - | ||
+ | |align="right"| - | ||
+ | |compile error | ||
+ | |- | ||
+ | |470.lbm | ||
+ | |floating | ||
+ | |C | ||
+ | |100_100_130_cf_a.of | ||
+ | |align="right"|7058506019 | ||
+ | |align="right"|4599.69 | ||
+ | |o.k. | ||
+ | |- | ||
+ | |471.omnetpp | ||
+ | |integer | ||
+ | |C++ | ||
+ | |omnetpp.ini | ||
+ | |align="right"|2450821721 | ||
+ | |align="right"|1153.36 | ||
+ | |o.k. | ||
+ | |- | ||
+ | |473.astar | ||
+ | |integer | ||
+ | |C++ | ||
+ | |lake.cfg | ||
+ | |align="right"|35796103621 | ||
+ | |align="right"|16433.83 | ||
+ | |output is different with the reference | ||
+ | |- | ||
+ | |481.wrf | ||
+ | |floating | ||
+ | |C/Fortran | ||
+ | | - | ||
+ | |align="right"| - | ||
+ | |align="right"| - | ||
+ | |STOP wrf_abort. Need library | ||
+ | |- | ||
+ | |482.sphinx3 | ||
+ | |floating | ||
+ | |C | ||
+ | |args.an4 | ||
+ | |align="right"|9352006427 | ||
+ | |align="right"|4011.67 | ||
+ | |o.k. | ||
+ | |- | ||
+ | |483.xalancbmk | ||
+ | |integer | ||
+ | |C++ | ||
+ | |test.xml | ||
+ | |align="right"|501493417 | ||
+ | |align="right"|276.77 | ||
+ | |o.k. | ||
+ | |- | ||
+ | |998.specrand | ||
+ | |integer | ||
+ | |C | ||
+ | |324342 24239 | ||
+ | |align="right"|71348559 | ||
+ | |align="right"|32.93 | ||
+ | |o.k. | ||
+ | |- | ||
+ | |999.specrand | ||
+ | |floating | ||
+ | |C | ||
+ | |324342 24239 | ||
+ | |align="right"|71348559 | ||
+ | |align="right"|31.58 | ||
+ | |o.k. | ||
+ | |- | ||
+ | |} | ||
+ | |||
+ | ===== Trouble shooting ===== | ||
+ | You may encounter errors while executing the SPEC CPU2006, and these two errors are common on 32-bits machine. | ||
+ | |||
+ | 1. terminate called after throwing an instance of 'std::bad_alloc': | ||
+ | |||
+ | The M5 cannot allocate memory from you system. This happens a lot in the 32-bits machine, because only 3G memory can be used. To make life easier, you need a 64-bits machine. | ||
+ | |||
+ | 2. bus error: | ||
+ | |||
+ | The same as above. | ||
+ | |||
+ | |||
+ | <pre> | ||
+ | If you have any questions, please email to M5 Mailing-List. | ||
</pre> | </pre> |
Latest revision as of 16:09, 17 September 2016
This is a work in-progress. Everyone should feel free to extend this page with their experiences to help new users get started.
Contents
Input sets and Binaries
We can't provide the binaries or input files because of licensing restrictions, but It's not hard to build the binaries by yourself. In this short article, we will share our experiences about what we have done so far.
Build the cross-compiler for alpha machine
It is suggested that you use crosstool-ng available here, a new tool based on Dan Kegel's cross tool that is more up to date. Follow the instructions available on that page for building the cross-compiler.
If you do not wish to build this tool yourself, you may be able to use one of the pre-compiled cross-compilers available on the M5 Download Page.
Or, you can use the older crosstool. Download the crosstool-0.43.tar.gz from http://kegel.com/crosstool and modify these three lines in the demo-alpha.sh :
RESULT_TOP=where_you_want_to_put_the_compiler GCC_LANGUAGES="c,c++,fortran" eval `cat alpha.dat gcc-4.1.0-glibc-2.3.6.dat` sh all.sh --notest
Then follow the steps in the crosstool-howto page to build the cross compiler.
Build the SPEC CPU2006 alpha binaries
Install the SPEC CPU2006 from DVD and modify the CC, CXX, and FC in config/alpha.cfg. If alpha.cfg is not in the config/ directory, you can use linux32-i386-gcc42.cfg and modify the CC, CXX, and FC variables.
For example: CC = /home/mjwu/crosstool/gcc-4.1.0-glibc-2.3.6/alpha/bin/alpha-gcc CXX = /home/mjwu/crosstool/gcc-4.1.0-glibc-2.3.6/alpha/bin/alpha-g++ FC = /home/mjwu/crosstool/gcc-4.1.0-glibc-2.3.6/alpha/bin/alpha-gfortran
Then follow the instructions in the ./Docs/install-guide-unix.html to build the binaries
For example: runspec --config=alpha.cfg --action=build --tune=base bzip2
Create the SPEC CPU2006 processes for M5 SE mode
A good reference for the correct command line options can be found here: SPEC_CPU2006_Commands.
For your convenience, here is our benchmark python file for the M5 SE mode.
#Mybench.py #400.perlbench perlbench = LiveProcess() perlbench.executable = binary_dir+'400.perlbench_base.alpha-gcc' perlbench.cmd = [perlbench.executable] + ['-I./lib', 'attrs.pl'] perlbench.output = 'attrs.out' #401.bzip2 bzip2 = LiveProcess() bzip2.executable = binary_dir+'401.bzip2_base.alpha-gcc' data=data_dir+'401.bzip2/data/all/input/input.program' bzip2.cmd = [bzip2.executable] + [data, '1'] bzip2.output = 'input.program.out' #403.gcc gcc = LiveProcess() gcc.executable = binary_dir+'403.gcc_base.alpha-gcc' data=data_dir+'403.gcc/data/test/input/cccp.i' output='/import/home1/mjwu/work_spec2006/403.gcc/m5/cccp.s' gcc.cmd = [gcc.executable] + [data]+['-o',output] gcc.output = 'ccc.out' #410.bwaves bwaves = LiveProcess() bwaves.executable = binary_dir+'410.bwaves_base.alpha-gcc' bwaves.cmd = [bwaves.executable] #416.gamess gamess=LiveProcess() gamess.executable = binary_dir+'416.gamess_base.alpha-gcc' gamess.cmd = [gamess.executable] gamess.input='exam29.config' gamess.output='exam29.output' #429.mcf mcf = LiveProcess() mcf.executable = binary_dir+'429.mcf_base.alpha-gcc' data=data_dir+'429.mcf/data/test/input/inp.in' mcf.cmd = [mcf.executable] + [data] mcf.output = 'inp.out' #433.milc milc=LiveProcess() milc.executable = binary_dir+'433.milc_base.alpha-gcc' stdin=data_dir+'433.milc/data/test/input/su3imp.in' milc.cmd = [milc.executable] milc.input=stdin milc.output='su3imp.out' #434.zeusmp zeusmp=LiveProcess() zeusmp.executable = binary_dir+'434.zeusmp_base.alpha-gcc' zeusmp.cmd = [zeusmp.executable] zeusmp.output = 'zeusmp.stdout' #435.gromacs gromacs = LiveProcess() gromacs.executable = binary_dir+'435.gromacs_base.alpha-gcc' data=data_dir+'435.gromacs/data/test/input/gromacs.tpr' gromacs.cmd = [gromacs.executable] + ['-silent','-deffnm',data,'-nice','0'] #436.cactusADM cactusADM = LiveProcess() cactusADM.executable = binary_dir+'436.cactusADM_base.alpha-gcc' data=data_dir+'436.cactusADM/data/test/input/benchADM.par' cactusADM.cmd = [cactusADM.executable] + [data] cactusADM.output = 'benchADM.out' #437.leslie3d leslie3d=LiveProcess() leslie3d.executable = binary_dir+'437.leslie3d_base.alpha-gcc' stdin=data_dir+'437.leslie3d/data/test/input/leslie3d.in' leslie3d.cmd = [leslie3d.executable] leslie3d.input=stdin leslie3d.output='leslie3d.stdout' #444.namd namd = LiveProcess() namd.executable = binary_dir+'444.namd_base.alpha-gcc' input=data_dir+'444.namd/data/all/input/namd.input' namd.cmd = [namd.executable] + ['--input',input,'--iterations','1','--output','namd.out'] namd.output='namd.stdout' #445.gobmk gobmk=LiveProcess() gobmk.executable = binary_dir+'445.gobmk_base.alpha-gcc' stdin=data_dir+'445.gobmk/data/test/input/capture.tst' gobmk.cmd = [gobmk.executable]+['--quiet','--mode','gtp'] gobmk.input=stdin gobmk.output='capture.out' #447.dealII dealII=LiveProcess() dealII.executable = binary_dir+'447.dealII_base.alpha-gcc' dealII.cmd = [gobmk.executable]+['8'] dealII.output='log' #450.soplex soplex=LiveProcess() soplex.executable = binary_dir+'450.soplex_base.alpha-gcc' data=data_dir+'450.soplex/data/test/input/test.mps' soplex.cmd = [soplex.executable]+['-m10000',data] soplex.output = 'test.out' #453.povray povray=LiveProcess() povray.executable = binary_dir+'453.povray_base.alpha-gcc' data=data_dir+'453.povray/data/test/input/SPEC-benchmark-test.ini' #povray.cmd = [povray.executable]+['SPEC-benchmark-test.ini'] povray.cmd = [povray.executable]+[data] povray.output = 'SPEC-benchmark-test.stdout' #454.calculix calculix=LiveProcess() calculix.executable = binary_dir+'454.calculix_base.alpha-gcc' data='/import/RaidHome/mjwu/work_spec2006/454.calculix/m5/beampic' calculix.cmd = [calculix.executable]+['-i',data] calculix.output = 'beampic.log' #456.hmmer hmmer=LiveProcess() hmmer.executable = binary_dir+'456.hmmer_base.alpha-gcc' data=data_dir+'456.hmmer/data/test/input/bombesin.hmm' hmmer.cmd = [hmmer.executable]+['--fixed', '0', '--mean', '325', '--num', '5000', '--sd', '200', '--seed', '0', data] hmmer.output = 'bombesin.out' #458.sjeng sjeng=LiveProcess() sjeng.executable = binary_dir+'458.sjeng_base.alpha-gcc' data=data_dir+'458.sjeng/data/test/input/test.txt' sjeng.cmd = [sjeng.executable]+[data] sjeng.output = 'test.out' #459.GemsFDTD GemsFDTD=LiveProcess() GemsFDTD.executable = binary_dir+'459.GemsFDTD_base.alpha-gcc' GemsFDTD.cmd = [GemsFDTD.executable] GemsFDTD.output = 'test.log' #462.libquantum libquantum=LiveProcess() libquantum.executable = binary_dir+'462.libquantum_base.alpha-gcc' libquantum.cmd = [libquantum.executable],'33','5' libquantum.output = 'test.out' #464.h264ref h264ref=LiveProcess() h264ref.executable = binary_dir+'464.h264ref_base.alpha-gcc' data=data_dir+'464.h264ref/data/test/input/foreman_test_encoder_baseline.cfg' h264ref.cmd = [h264ref.executable]+['-d',data] h264ref.output = 'foreman_test_encoder_baseline.out' #470.lbm lbm=LiveProcess() lbm.executable = binary_dir+'470.lbm_base.alpha-gcc' data=data_dir+'470.lbm/data/test/input/100_100_130_cf_a.of' lbm.cmd = [lbm.executable]+['20', 'reference.dat', '0', '1' ,data] lbm.output = 'lbm.out' #471.omnetpp omnetpp=LiveProcess() omnetpp.executable = binary_dir+'471.omnetpp_base.alpha-gcc' data=data_dir+'471.omnetpp/data/test/input/omnetpp.ini' omnetpp.cmd = [omnetpp.executable]+[data] omnetpp.output = 'omnetpp.log' #473.astar astar=LiveProcess() astar.executable = binary_dir+'473.astar_base.alpha-gcc' astar.cmd = [astar.executable]+['lake.cfg'] astar.output = 'lake.out' #481.wrf wrf=LiveProcess() wrf.executable = binary_dir+'481.wrf_base.alpha-gcc' wrf.cmd = [wrf.executable]+['namelist.input'] wrf.output = 'rsl.out.0000' #482.sphinx sphinx3=LiveProcess() sphinx3.executable = binary_dir+'482.sphinx_livepretend_base.alpha-gcc' sphinx3.cmd = [sphinx3.executable]+['ctlfile', '.', 'args.an4'] sphinx3.output = 'an4.out' #483.xalancbmk xalancbmk=LiveProcess() xalancbmk.executable = binary_dir+'483.Xalan_base.alpha-gcc' xalancbmk.cmd = [xalancbmk.executable]+['-v','test.xml','xalanc.xsl'] xalancbmk.output = 'test.out' #998.specrand specrand_i=LiveProcess() specrand_i.executable = binary_dir+'998.specrand_base.alpha-gcc' specrand_i.cmd = [specrand_i.executable] + ['324342','24239'] specrand_i.output = 'rand.24239.out' #999.specrand specrand_f=LiveProcess() specrand_f.executable = binary_dir+'999.specrand_base.alpha-gcc' specrand_f.cmd = [specrand_i.executable] + ['324342','24239'] specrand_f.output = 'rand.24239.out'
M5 python configure script
Here is our system configuration python file for the M5 SE mode.
#cmp.py # Simple configuration script import m5 from m5.objects import * import os, optparse, sys m5.AddToPath('./configs') import Simulation from Caches import * import Mybench # Get paths we might need. It's expected this file is in m5/configs/example. config_path = os.path.dirname(os.path.abspath(__file__)) print config_path config_root = os.path.dirname(config_path)+"/configs" print config_root m5_root = os.path.dirname(config_root) print m5_root parser = optparse.OptionParser() # Benchmark options parser.add_option("-b", "--benchmark", default="", help="The benchmark to be loaded.") parser.add_option("-c", "--chkpt", default="", help="The checkpoint to load.") execfile(os.path.join(config_root, "configs", "Options.py")) (options, args) = parser.parse_args() if args: print "Error: script doesn't take any positional arguments" sys.exit(1) if options.benchmark == 'perlbench': process = Mybench.perlbench elif options.benchmark == 'bzip2': process = Mybench.bzip2 elif options.benchmark == 'gcc': process = Mybench.gcc elif options.benchmark == 'bwaves': process = Mybench.bwaves elif options.benchmark == 'gamess': process = Mybench.gamess elif options.benchmark == 'mcf': process = Mybench.mcf elif options.benchmark == 'milc': process = Mybench.milc elif options.benchmark == 'zeusmp': process = Mybench.zeusmp elif options.benchmark == 'gromacs': process = Mybench.gromacs elif options.benchmark == 'cactusADM': process = Mybench.cactusADM elif options.benchmark == 'leslie3d': process = Mybench.leslie3d elif options.benchmark == 'namd': process = Mybench.namd elif options.benchmark == 'gobmk': process = Mybench.gobmk; elif options.benchmark == 'dealII': process = Mybench.dealII elif options.benchmark == 'soplex': process = Mybench.soplex elif options.benchmark == 'povray': process = Mybench.povray elif options.benchmark == 'calculix': process = Mybench.calculix elif options.benchmark == 'hmmer': process = Mybench.hmmer elif options.benchmark == 'sjeng': process = Mybench.sjeng elif options.benchmark == 'GemsFDTD': process = Mybench.GemsFDTD elif options.benchmark == 'libquantum': process = Mybench.libquantum elif options.benchmark == 'h264ref': process = Mybench.h264ref elif options.benchmark == 'tonto': process = Mybench.tonto elif options.benchmark == 'lbm': process = Mybench.lbm elif options.benchmark == 'omnetpp': process = Mybench.omnetpp elif options.benchmark == 'astar': process = Mybench.astar elif options.benchmark == 'wrf': process = Mybench.wrf elif options.benchmark == 'sphinx3': process = Mybench.sphinx3 elif options.benchmark == 'xalancbmk': process = Mybench.xalancbmk elif options.benchmark == 'specrand_i': process = Mybench.specrand_i elif options.benchmark == 'specrand_f': process = Mybench.specrand_f if options.chkpt != "": process.chkpt = options.chkpt (CPUClass, test_mem_mode, FutureClass) = Simulation.setCPUClass(options) CPUClass.clock = '1.0GHz' #np = options.num_cpus np = 1 system = System(cpu = [CPUClass(cpu_id=i) for i in xrange(np)], physmem = PhysicalMemory(range=AddrRange("4096MB")), membus = Bus(), mem_mode = 'timing') system.physmem.port = system.membus.port for i in xrange(np): if options.caches: system.cpu[i].addPrivateSplitL1Caches(L1Cache(size = '64kB'), L1Cache(size = '64kB')) if options.l2cache: system.l2 = L2Cache(size='2MB') system.tol2bus = Bus() system.l2.cpu_side = system.tol2bus.port system.l2.mem_side = system.membus.port system.cpu[i].connectMemPorts(system.tol2bus) else: system.cpu[i].connectMemPorts(system.membus) system.cpu[i].workload = process[i] root = Root(system = system) Simulation.run(options, root, system, FutureClass)
The SPEC CPU2006 testing dataset results
We use the quard-core Xeon 2.5GHz with 16G memory machine. The operation system is 64bits CentOS 5.2. The timing results are from the simple cpu model and SPEC CPU2006 testing data set.
benchmark | datatype | language | input data | number of instructions | host seconds | comment |
400.perlbench | integer | C | attrs.out | - | - | fatal: fault (unalign) detected @ PC 0x12009cedc |
401.bzip2 | integer | C | input.program | 3171671617 | 1353.56 | o.k. |
403.gcc | integer | C | cccp.i | - | - | never end, but o.k. for smaller input |
410.bwaves | floating | Fortran | test | 119365801487 | 51703.94 | o.k. |
416.gamess | floating | Fortran | exam29 | - | - | abormal exit |
429.mcf | integer | C | inp.in | 5112705810 | 3386.09 | o.k. |
433.milc | floating | C | su3imp.in | 38027871822 | 18402.06 | output is different with the reference |
434.zeusmp | floating | Fortran | zmp_inp | 62107158516 | 27746.77 | output is different with the reference |
435.gromacs | floating | C/Fortran | gromacs.tpr | 10861507208 | 4457.33 | output is wrong, mremap has problem |
436.cactusADM | floating | C/Fortran | benchADM.par | - | - | fatal: fault (unalign) detected @ PC 0x120026614 |
437.leslie3d | floating | Fortran | leslie3d.in | 87402135744 | 41635.96 | o.k. |
444.namd | floating point | C++ | namd.input | 64449976020 | 26798.88 | o.k |
445.gobmk | integer | C | capture.tst | 494502991 | 260.29 | o.k. |
447.dealII | floating | C++ | 8 | - | - | output is wrong |
450.soplex | floating | C++ | test.mps | 72422927 | 31.95 | o.k. |
453.povray | floating point | C++ | test.ini | 3597778011 | 1737.24 | o.k. |
454.calculix | floating point | C | beampic.inp | 251699786 | 101.04 | o.k. |
456.hmmer | integer | C | bombesin.hmm | 2386768547 | 997.97 | o.k. |
458.sjeng | integer | C | test.txt | 21682684235 | 9406.80 | o.k. |
459.GemsFDTD | floating | Fortran | test.in | 11046857318 | 5289.88 | o.k. |
462.libquantum | integer | C | 33 5 | 292639209 | 111.64 | o.k. |
464.h264ref | integer | C | foreman_test_encoder_baseline.cfg | 154340641371 | 67426.00 | o.k. |
465.tonto | floating | Fortran | - | - | - | compile error |
470.lbm | floating | C | 100_100_130_cf_a.of | 7058506019 | 4599.69 | o.k. |
471.omnetpp | integer | C++ | omnetpp.ini | 2450821721 | 1153.36 | o.k. |
473.astar | integer | C++ | lake.cfg | 35796103621 | 16433.83 | output is different with the reference |
481.wrf | floating | C/Fortran | - | - | - | STOP wrf_abort. Need library |
482.sphinx3 | floating | C | args.an4 | 9352006427 | 4011.67 | o.k. |
483.xalancbmk | integer | C++ | test.xml | 501493417 | 276.77 | o.k. |
998.specrand | integer | C | 324342 24239 | 71348559 | 32.93 | o.k. |
999.specrand | floating | C | 324342 24239 | 71348559 | 31.58 | o.k. |
Trouble shooting
You may encounter errors while executing the SPEC CPU2006, and these two errors are common on 32-bits machine.
1. terminate called after throwing an instance of 'std::bad_alloc':
The M5 cannot allocate memory from you system. This happens a lot in the 32-bits machine, because only 3G memory can be used. To make life easier, you need a 64-bits machine.
2. bus error:
The same as above.
If you have any questions, please email to M5 Mailing-List.