\noindent {\bf Amdahl Vector Processors (Fujitsu VP)} \noindent {\bf Vector Register Architecture} \vspace {.1in} \noindent {\bf Architecture:} The Amdahl 500, 1100, 1200, and 1400 Vector Processors are marketed by Amdahl Corp. in the U.S., Canada, Europe, and the Pacific Basin. These products are manufactured by Fujitsu, and similar models are marketed in Japan as the VP-50, VP-100, VP-200, and VP-400. The VP-100 and 200 is also marketed by Siemens in mainland Europe. In 1987, the range was upgraded by the addition of E models. The main change was to the functional pipes. \vspace {.1in} \noindent These are all register-to-register machines. All models have one scalar and one vector unit which can execute computations independently. The scalar unit fetches all instructions and passes each instruction to the appropriate unit for execution. The scalar processor is based on the Fujitsu M380/382 series mainframes and runs the IBM S/370 extended architecture instruction set. A recent Amdahl proprietary software program product, called VP/XA, allows Amdahl vector processors to run current MVS/XA releases, and permits Amdahl supercomputers to use standard operating environments. \vspace {.1in} \noindent {\bf Configuration:} The vector unit consists of 5 or 6 pipelines, a vector register memory, and a mask memory. The 5 or 6 pipelines comprise 1 or 2 load/store pipelines, plus 1 mask pipeline, 1 add/logical pipeline, 1 multiply pipeline, and 1 divide pipeline. In the E models, the multiply pipe is replaced by a multifunctional pipe for floating-point addition, multiplication, or concurrent multiplication/addition. The number of concurrent pipelines, vector register size, and mask register size differ for each model, as shown below. Main memory capacity ranges from 32 Mbytes to 1024 Mbytes (4 to 128 M 64-bit words). \vspace{.15in} \begin{center} \begin {tabular} {l r r r r} & \multicolumn{4} {c}{Model} \\ Configuration & 500 & 1100 & 1200 & 1400 \\ \hline \# pipes total & 5 & 6 & 6 & 5 \\ \# concurrent load/store pipes & 1 & 2 & 2 & 1 \\ \# 64 bit words/vect cyc/pipe & 1 & 1 & 2 & 4 \\ Scalar cycle time (nsec)& 14 & 14 & 14 & 14 \\ Vector cycle time (nsec)& 7 & 7 & 7 & 7 \\ \# concurrent arith pipes & 2 & 3 & 3 & 3 \\ \# 64-bit results/vect cyc/pipe & 1 & 1 & 2 & 4 \\ Vect. reg. size (Kbytes) & 32 & 32 & 64 & 128 \\ Mask reg. size (Bytes) & 512 & 512 & 1024 & 2048 \\ Max. main memory (Mbytes) & 512 & 512 & 1024 & 1024 \\ Min. main memory (Mbytes) & 32 & 32 & 64 & 64 \\ Max. interleaving (ways) & 128 & 128 & 256 & 256 \\ \end{tabular} \end{center} \vspace{.15in} The total vector register capacity is 32-128 Kbytes. The registers can be reconfigured dynamically to 6 different combinations with varying vector register lengths, as shown below: \vspace{.15in} \begin{center} \begin {tabular} {l r r r r} \multicolumn{5} {c}{Configuration of Vector Registers}\\ & \multicolumn{4} {c}{ Register Length by Model} \\ & \multicolumn{4} {c}{(\# of 64-bit word elements)} \\ \# registers & 500 & 1100 & 1200 & 1400 \\ \hline 8 & 512 & 512 & 1024 & 2048 \\ 16 & 256 & 256 & 512 & 1024 \\ 32 & 128 & 128 & 256 & 512 \\ 64 & 64 & 64 & 128 & 256 \\ 128 & 32 & 32 & 64 & 128 \\ 256 & 16 & 16 & 32 & 64 \\ \end{tabular} \end{center} \vspace{.15in} \vspace {.1in} \noindent Other features: \begin{tabbing} aaa\= \kill \> 400 and 1300 gate ECL, 350-picosecond delay\\ \> main memory - 256 KB, 55 nsec, MOS static RAM\\ \> 380-470 square feet\\ \> 36-62 KVA power consumption\\ \> air cooled \end{tabbing} \vspace {.1in} \noindent {\bf Performance:} \vspace{.1in} \noindent The vector performance varies according to model as follows: \vspace{.15in} \begin{center} \begin {tabular} {c c c c} Model & Peak Mflops & Model & Peak Mflops \\ \hline 500 & 143 & 500E & 286 \\ 1100 & 286 & 1100E & 429 \\ 1200 & 571 & 1200E & 857 \\ 1400 & 1143 & 1400E & 1714 \\ 2000 & 1600+ & & \end{tabular} \end{center} \vspace{.15in} \vspace {.1in} \noindent The scalar processor cycle time is 14 nsec, compared to the CRAY X-MP's 8.5 nsec, but a sampling of scalar instructions indicates that the VP operations may be slightly faster than the X-MP's. All scalar work can overlap vector operations. \vspace {.1in} \noindent {\bf Software:} \vspace{.1in} \noindent \begin{tabbing} aaa\= \kill \> VP/XA operating system offering IBM MVS/XA system support\\ \> Automatic vectorizing Fortran compiler (Fortran 77/VP)\\ \> Scalar Fortran compiler\\ \> Interactive debugger\\ \> Performance measurement tools\\ \> Interactive vectorizer\\ \> STREAM77 Language Converter\\ \> SIMUL38 IBM 3838 array processor simulator\\ \> Scientific subroutine library (223 routines) \end{tabbing} \vspace {.1in} \noindent {\bf Contact:} \vspace{.1in} \begin{flushleft} Phil Howell \\ Amdahl Corp. \\ 1250 East Arques Ave. \\ P.O. Box 3470 \\ Sunnyvale, CA 94088 \\ 408-746-6880 \\ \vspace {.1in} Dr. Horst-Peter Rother \\ Amdahl International Management Services Ltd. \\ Dogmersfield Park \\ Hartley Wintney \\ Hampshire RG27 8TE \\ England\\ 0252-24555 Telex 858486 G \end{flushleft} .