next up previous
Next: Use of Scattergrams for Defining Up: EMME/2 News 8 October 1989 Previous: Laser Printer Support for GPR

EMME/2 BENCHMARKS

In past EMME/2 News issues, several benchmarks have been published each time EMME/2 was ported to a new product. These benchmarks reported the CPU time for auto and transit assignment with the standard Winnipeg demonstration data base. In an attempt to give a general picture of the relative machine performance and evolution over time, we have gathered here all the benchmark results accumulated over the years. We will also compare some of these results with those obtained by running well known CPU and FPU benchmarks.

The following table gives all the EMME/2 benchmark results sorted according to auto assignment performance. The lines marked with "*" give results for the 80386 using native 32-bit protected mode.

Computer ModelProcessorSpeedDateAuto Assignment Trans.
CPU/FPUMHz1 it.TotalAss.
IBM 3090NEC NAS AS/EX90n/a09/898.492.719.1
HP 9000-835HP RISC3009/8910.3113.623.4
SUN SPARCserver 330 SPARC2508/8911.1n/an/a
SUN SPARCstation1SPARC2009/8914.9163.228.9
HP 9000-825HP RISC2509/8918.6202.556.5
VAX 8600DECn/a03/8723.1n/an/a
Interpro 340Clipper2503/8929.8325.992.2
Definicon PM-03068030/8823309/8934.1382.180.5
Interpro 120Clipper2003/8936.3412.8116.8
COMPAQ 20e (*)80386/3872009/8943.7491.3101.7
Definicon DSI-78068020/8812009/8951.6580.6123.5
COMPAQ 20 (*)80386/3872009/8958.3653.4135.0
SUN 3/6068020/8812009/8963.4702.9132.3
IBM PS2/70 (*)80386/3872009/8964.8721.8147.6
COMPAQ 20e80386/3872009/8966.2737.1214.4
Definicon DSI-78068020/8811703/8767.5n/a152.5
Masscomp 552068020/lightningn/a09/8973.2806.0172.3
COMPAQ 20 (*)80386/3871607/8875.2n/a183.6
Masscomp 540068020/881n/a11/8686.0937.9190.2
COMPAQ 2080386/3872009/8986.1957.0267.7
IBM PS2/7080386/3872009/8991.81018.2297.5
VAX 11/780DECn/a03/87119.2n/a300.5
microVAX IIDECn/a03/87151.2n/a450.3
VAX station 2000DECn/a09/89167.91842.4392.2
CLUB 28680286/2871009/89177.21947.9477.5
HP 9000-500HPn/a08/85186.52028.8292.0
AT clone (Taiwan)80286/2871209/89199.82200.7548.6
SUN 3/5068020/8811503/87202.0n/a247.7
Definicon DSI-3232032/0811011/85223.42455.9534.8
IBM AT80286/287 802/88262.5n/a1193.4
Masscomp 50068010n/a06/85291.33166.0499.6
Symmetric 37532016/0811012/86417.34526.9862.0
AT&T Unix-PC 7300680101001/84463.55147.6n/a
Burroughs XT550680101010/85522.45695.71503.5
Pixel 100/AP680001006/85549.55959.0715.8
SUN 2/50680101006/85584.66391.1730.5

Not surprisingly, the above table is also nearly in reverse chronological order. In the five year span covered by the benchmarks, a tremendous progress has been made at the hardware level. Consider the 68000 microprocessors family for example: a factor of 15 has been gained from the 10 MHz 68000 with an iteration time of 550 sec to the 33 MHz 68030 with an iteration time of 34.1 sec! Similarly, the 20 MHz 80386 is 6 times faster than its 8 MHz 80286 predecessor. In this last case, the difference would even be greater if we could have included results from 25 and 33 MHz 80386 or the new generation of 80486 machines which are now available. Then, at the top of the table, we have RISC microprocessors nearing the IBM 3090 mainframe results. At these speeds, we can almost speak of interactive assignments!

For all the 80386 machines tested, the improvement obtained when using the native protected mode (lines marked with "*" in the above table) is about 30% for the auto assignment and 50% for the transit assignment.

Different computer models are usually compared using small standard benchmarks that try to isolate one system component (i.e CPU, FPU, disk I/O, etc). An interesting question is: if machine X is 10 times faster than machine Y according to benchmark Z, will EMME/2 also run 10 times faster? In other words, can we predict the EMME/2 performance on a machine based solely on a few standard benchmark results? In a search for an answer to this last question, we have conducted a small study with three benchmarks testing the performance of the CPU, the FPU and a mix of both. To be even more complete, the study could also have included disk I/O and memory access benchmarks, but we decided to concentrate on CPU and FPU benchmarks, since EMME/2 is known to be processor bound. The following benchmarks were used:

The following table gives the benchmark results along with some EMME/2 results taken from the previous table. All the tests where done with the basic hardware configuration, without software accelerators such as high level memory cacheing (memcache) or virtual disk (vdisk). For multi-user operating systems, the programs where run at night to avoid as much as possible interference from other users. (Although it would be interesting in itself to observe the degradation under normal and heavy workloads.) Except for swhet and dwhet, all results are expressed in seconds.

ComputerswhetdwhetsfloatdfloatssievelsieveAutoTransitBuildwpg
SPARCstation1630342493.04.81.751.27163.228.8382
HP 9000-825349126315.36.01.831.49202.556.5450
PM-030234521298.19.33.322.89382.180.5870
SUN 3/601186117117.718.34.684.93702.9132.31147
DSI-7801220112015.317.95.244.60580.6123.51146
Masscomp 5520250016976.110.45.455.88806.0172.31124
COMPAQ 20e (*)1388130315.016.85.333.73491.3101.61044
COMPAQ 20e103692414.214.95.8910.22737.1214.41359
COMPAQ 20 (*)1282117515.017.46.534.15653.4135.01294
COMPAQ 2095084214.415.48.0313.84957.0267.71804
IBM PS2/70 (*)1158106418.120.86.704.16721.8147.61453
IBM PS2/7088378216.317.78.0813.741018.2297.51818
CLUB 28628125468.974.015.1625.351947.9477.52981
AT clone28525764.870.017.7828.742200.7548.63399
DSI-3231227734.442.314.6012.602415.0513.03826
Symmetric 37519619661.754.636.3926.384175.2817.06253
VAX st. 2000n/an/a13.319.319.3013.301842.4392.22462

The following table gives the sample correlation matrix between the benchmark times (note that for this the whetstone/sec results of swhet and dwhet were first inverted to obtain the corresponding time values):

swhetdwhetsfloatdfloatssievelsieveAutoTransit
dwhet.99
sfloat.94.95
dfloat.91.93.99
ssieve.95.92.76.71
lsieve.90.91.91.89.82
Auto.97.96.80.77.98.84
Transit.98.97.85.82.95.91.98
Buildwpg.97.95.82.78.96.84.99.98

Four points are worth noting in this last table:

Keeping in mind the small sample size (17 observations), this small study would suggest that even if depending on the FPU throughput (high correlation with sfloat) EMME/2 seems more CPU bound than FPU bound even for the auto assignment.

In a next article, we will compare the speed of the various graphic displays and also run benchmarks on the GPL and GPR utilities, using various plotters and printers.


next up previous
Next: Use of Scattergrams for Defining Up: EMME/2 News 8 October 1989 Previous: Laser Printer Support for GPR


Heinz Spiess, EMME/2 Support Center, Thu Jun 6 14:19:19 MET DST 1996