IOPerf results

Results were obtained using two sets of IO tests All results are averaged over 10 passes. The test order in that table is :

Block IO Character Random
fwrite fread putc getc seek
Test tag & FSKB KB/sec RealKB/sec CPU%CPU KB/sec RealKB/sec CPU%CPU KB/sec RealKB/sec CPU%CPU KB/sec RealKB/sec CPU%CPU KB/sec RealKB/sec CPU%CPU
Ll__star_dataFC_test320000 50879.71204537.65 25.99 206727.46237766.26 88.94 38528.0244808.43 86.42 49936.7950601.39 98.69 2655.352785.8895.45
Ls__star_dataFC_test320000 60320.86198456.02 30.81 225273.65235172.93 95.81 38860.5543495.86 89.44 49879.6450565.85 98.65 2704.972741.0298.71
NPV_rcas6005Ll__star_dataFC_test320000 7097.31119042.595.98 4060.87165427.702.49 4819.5144270.7510.89 4254.9447315.388.98 1967.272492.4279.12
NPV_rcas6005Ls__star_dataFC_test320000 6254.49128243.964.88 4672.36161470.982.91 3989.3943570.389.16 4405.5347399.879.30 2095.292469.6484.96

In the results above, there do not seem any major differences between 4 kBytes and 64 KBytes IO based tests. The multi thread tests show an almos linear scaling in write but a concerning factor of 50 less in block IO read.

Network aggregates

Further tests were performed from two networks. The results below
STAR CAS networkReference

This test was performed by using maximum number of nodes from the CAS nodes and performing any of the test above (we already noted we did not see any major differences between small or larger chunks within 4 / 64 kBytes).
STAR's peak is at best 4 Gbits/sec (512 MByte/sec) while the reference plot (Phen-ix nodes) would indicate up to 6 Gbit/sec (768 MBytes/sec). We are 50% off peak performance and seem to be saturating. We currently attribute this discrepency to a network topology problem and investigating this.

The below graph gives a baseline comparing to the PANFS system which out-performed our previous NFS solutions.

At a maximum of 300 MB/sec at best, we would still exceed the previous solution performance. We also performed tests where both read and write would be ongoing simultaneously. The results could be seen below

IOzone profiling

The profiling gives a gross performance view for several kByte buffer and several file size.
The Read/Write tests are unbuffered while FRead/FWrite tests are buffered.